Background: The traditional method used to estimate tree biomass is allometry. In this method, models are tested and equations fitted by regression usually applying ordinary least squares, though other analogous methods are also used for this purpose. Due to the nature of tree biomass data, the assumptions of regression are not always accomplished, bringing uncertainties to the inferences. This article demonstrates that the Data Mining (DM) technique can be used as an alternative to traditional regression approach to estimate tree biomass in the Atlantic Forest, providing better results than allometry, and demonstrating simplicity, versatility and flexibility to apply to a wide range of conditions. Results: Various DM approaches were examined regarding distance, number of neighbors and weighting, by using 180 trees coming from environmental restoration plantations in the Atlantic Forest biome. The best results were attained using the Chebishev distance, 1/d weighting and 5 neighbors. Increasing number of neighbors did not improve estimates. We also analyze the effect of the size of data set and number of variables in the results. The complete data set and the maximum number of predicting variables provided the best fitting. We compare DM to Schumacher-Hall model and the results showed a gain of up to 16.5 % in reduction of the standard error of estimate. Conclusion: It was concluded that Data Mining can provide accurate estimates of tree biomass and can be successfully used for this purpose in environmental restoration plantations in the Atlantic Forest. This technique provides lower standard error of estimate than the Schumacher-Hall model and has the advantage of not requiring some statistical assumptions as do the regression models. Flexibility, versatility and simplicity are attributes of DM that corroborates its great potential for similar applications.
Forests contribute to climate change mitigation by storing carbon in tree biomass. The amount of carbon stored in this carbon pool is estimated by using either allometric equations or biomass expansion factors. Both of the methods provide estimate of the carbon stock based on the biometric parameters of a model tree. This study calls attention to the potential advantages of the data mining technique known as instance-based classification, which is not used currently for this purpose. The analysis of the data on the carbon storage in 30 trees of Brazilian pine (Araucaria angustifolia) shows that the instance-based classification provides as relevant estimates as the conventional methods do. The coefficient of correlation between the estimated and measured values of carbon storage in tree biomass does not vary significantly with the choice of the method. The use of some other measures of method performance leads to the same result. In contrast to the convention methods the instance-based classification does not presume any specific form of the function relating carbon storage to the biometric parameters of the tree. Since the best form of such function is difficult to find, the instance-based classification could outperform the conventional methods in some cases, or simply get rid of the questions about the choice of the allometric equations.
Este trabalho objetivou ajustar equações para estimar a biomassa total de plantas de bambu, do gênero Guadua, bem como comparar o ajuste de equações por regressão linear com a técnica de mineração de dados. Foram utilizados 38 colmos de bambu, nos quais foram mensuradas as variáveis diâmetro à altura do peito (dap), diâmetro do colo do colmo e altura do colmo, seguido da determinação de massa total por método destrutivo. A biomassa determinada em 25 colmos foi utilizada para ajuste de equações pelo método dos mínimos quadrados e 13 colmos serviram para a validação da melhor equação. As frações de biomassa por compartimento diferem significativamente (p < 0,05) entre si. A maior fração da biomassa corresponde ao colmo, representando 69,2% do total, seguida pela dos rizomas, dos galhos e da folhagem, com 15,7; 10,8 e 4,2%, respectivamente. A melhor equação ajustada para estimar a biomassa total apresentou coeficiente de determinação de 0,93 e erro padrão da estimativa de 15%. Já a técnica de mineração de dados apresentou coeficiente de determinação de 0,81, com erro padrão de 23,8%. Pode-se estimar acuradamente a biomassa de Guadua por regressão linear e por mineração dos dados. Neste trabalho, o método de regressão apresentou melhor desempenho. A limitação de dados pode ser o fator determinante para o pior desempenho da técnica de mineração de dados, pois requer uma massa de dados mais ampla para funcionar satisfatoriamente.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.