2019
DOI: 10.1016/j.envpol.2018.11.034
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of random forest regression and multiple linear regression for predicting indoor fine particulate matter concentrations in a highly polluted city

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
72
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 124 publications
(74 citation statements)
references
References 37 publications
2
72
0
Order By: Relevance
“…We adopted a two-step process, removing highly intercorrelated variables for reducing the computational complexity of the algorithm and the dimension of the input data. Several related publications have shown that the predictive power of RFs may benefit from variable selection [50,71]. Finally, in terms of the most appropriate diversity index to be used with remote sensing data, the results indicate that, in our site, a remotely sensed spectral signal correlates better with Shannon's Index than Simpson's diversity, since, across all sensors, the Sentinel-2 MSI model provided the highest coefficient of determination.…”
Section: Discussionmentioning
confidence: 58%
See 2 more Smart Citations
“…We adopted a two-step process, removing highly intercorrelated variables for reducing the computational complexity of the algorithm and the dimension of the input data. Several related publications have shown that the predictive power of RFs may benefit from variable selection [50,71]. Finally, in terms of the most appropriate diversity index to be used with remote sensing data, the results indicate that, in our site, a remotely sensed spectral signal correlates better with Shannon's Index than Simpson's diversity, since, across all sensors, the Sentinel-2 MSI model provided the highest coefficient of determination.…”
Section: Discussionmentioning
confidence: 58%
“…Unlike to the majority of the previous studies, we did not use a linear regression approach for assessing the relationship between remote sensing data and field the measurements of tree species diversity. RF regression does not require any data distribution assumptions and can detect interactions and higher order relationships between independent variables without a priori specification of these terms [50]. Furthermore, RF are particularly appealing due to their ability to generalize even under a limited training samples regime, as often is the case in remote sensing applications [48].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Every bootstrap sample for each CART is randomly selected from pre-datasets, and the features used are also extracted randomly from all features in a certain proportion. Specifically, every CART can train a nonlinear fitting model to estimate NSSR with the defined bootstrap sample; the output NSSR of RF is an average of the outputs of an individual CART [44]. Hence, because of its 'bagging' thought, RF algorithms typically yield a reduced bias of the estimations and, in general, good accuracies.…”
Section: Brief Introduction Of Machine Learning Algorithmsmentioning
confidence: 99%
“…Many papers compared linear regression with random forest algorithms in other elds. As we can see, the random forest algorithm has not performed better in all eld and aspects (26)(27)(28), this reveals that random forest method can only take advantage over linear regression in some data models. So a more signi cant number of multi-centre data are needed to validate our outcomes in the eld of GFR estimation.…”
Section: Discussionmentioning
confidence: 92%