DOI: 10.31274/etd-180810-6083
|View full text |Cite
|
Sign up to set email alerts
|

Random forest robustness, variable importance, and tree aggregation

Abstract: In classification problems, random forests are often used to estimate the probability of a new case falling into each of C possible categories. These probabilities are routinely of interest, for example, in risk analysis. Different methods have been proposed for both growing random forests and aggregating predictions from individual trees, but comparative studies are limited. In this paper, we compare and contrast prominent random forest techniques, with particular emphasis on the aggregation of tree predictio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(13 citation statements)
references
References 17 publications
0
13
0
Order By: Relevance
“…Random Forest (RF) is a tree-based ML technique proposed by [122]. Theoretically, RF model is an ensemble of multiple decision trees and uses the majority vote/decision of the trees as the final RF model [123,124]. The algorithm becomes more robust when more decision trees are constructed.…”
Section: Machine Learning (Ml) For Pm 25 Estimationmentioning
confidence: 99%
“…Random Forest (RF) is a tree-based ML technique proposed by [122]. Theoretically, RF model is an ensemble of multiple decision trees and uses the majority vote/decision of the trees as the final RF model [123,124]. The algorithm becomes more robust when more decision trees are constructed.…”
Section: Machine Learning (Ml) For Pm 25 Estimationmentioning
confidence: 99%
“…Second, we consider potential cultural differences in the meaning of the studied constructs by establishing equivalence of factor scores through (partial) strong invariance (see ( 21 )). Finally, to determine the efficacy of our independent variables in explaining contact avoidance, hygiene maintenance, and COVID-19 policy support in each country, we applied random forest-based regression algorithms appropriate for complex data sets with possible nonlinear and interactive relationships between variables ( 22 , 23 ).…”
Section: Introductionmentioning
confidence: 99%
“…In this study, the random forest regression method was employed to evaluate the importance of each LBSN feature. The calculating procedures were as follows [22]:…”
Section: Feature Selection Using Random Forest Regression Methodsmentioning
confidence: 99%