2021
DOI: 10.3390/f12020216
|View full text |Cite
|
Sign up to set email alerts
|

Combination of Feature Selection and CatBoost for Prediction: The First Application to the Estimation of Aboveground Biomass

Abstract: Increasing numbers of explanatory variables tend to result in information redundancy and “dimensional disaster” in the quantitative remote sensing of forest aboveground biomass (AGB). Feature selection of model factors is an effective method for improving the accuracy of AGB estimates. Machine learning algorithms are also widely used in AGB estimation, although little research has addressed the use of the categorical boosting algorithm (CatBoost) for AGB estimation. Both feature selection and regression for AG… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
34
0
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 124 publications
(56 citation statements)
references
References 65 publications
0
34
0
2
Order By: Relevance
“…In fact, small changes in the learning sample can cause dramatic changes in the built tree derived from individual tree-based models, and so the estimated results can be unstable and inaccurate. This is the reason why most recent studies have adopted bagging and boosting ensemble algorithms [25,26].…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In fact, small changes in the learning sample can cause dramatic changes in the built tree derived from individual tree-based models, and so the estimated results can be unstable and inaccurate. This is the reason why most recent studies have adopted bagging and boosting ensemble algorithms [25,26].…”
Section: Resultsmentioning
confidence: 99%
“…Machine learning methods have been widely applied to develop AGB prediction models from different remote sensing data sources such as optical satellite imagery, UAV stereo-imagery, airborne hyperspectral images, ALS (airborne laser scanning), and spaceborne SAR [15,16,25,26,36,37].…”
Section: Discussionmentioning
confidence: 99%
“…In this study, the CB and the XGB are well‐known ML algorithms and successfully applied for the estimation of various biophysical parameters (Ha et al, 2021b; Luo et al, 2021; Naghibi et al, 2020; Pham et al, 2020). However, the implementation of these models for Chl‐a estimation in a highly eutrophic reservoir is limited; therefore, our results might suggest novel findings for model selection and their performance as well.…”
Section: Discussionmentioning
confidence: 99%
“…During the past six years, the random forest usage has steadily increased. By using Google Trends service, it was possible to collect historic data that was used to visualize the search interest for five prediction related terms: (a) linear regression-one of the most used prediction algorithms, (b) random forest, (c) XGBoost-a regularizing gradient boosting algorithm enforced by a strong community of data scientists [63], (d) CatBoost-one of the newest gradient boosting prediction algorithms [64], (e) prediction model-a generic term used to describe any of the previous mentioned algorithms (Figure 3). As it can be observed in Figure 3, the search interest value for the 'linear regression' term is the highest (which is to be expected due to its age and worldwide popularity).…”
Section: Model Developmentmentioning
confidence: 99%