2019
DOI: 10.3390/rs11040375
|View full text |Cite
|
Sign up to set email alerts
|

Estimation of Poverty Using Random Forest Regression with Multi-Source Data: A Case Study in Bangladesh

Abstract: Spatially explicit and reliable data on poverty is critical for both policy makers and researchers. However, such data remain scarce particularly in developing countries. Current research is limited in using environmental data from different sources in isolation to estimate poverty despite the fact that poverty is a complex phenomenon which cannot be quantified either theoretically or practically by one single data type. This study proposes a random forest regression (RFR) model to estimate poverty at 10 km × … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
87
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 128 publications
(88 citation statements)
references
References 50 publications
1
87
0
Order By: Relevance
“…This indicates that the model may have a tendency to underestimate the higher values and overestimate the lowers. One possible explanation for this phenomenon could be the inherent limitations of the random forest, whose final result is obtained by averaging the results of multiple decision trees, which may lead to a decreasing variance of the model's results and an unobtainable prediction value that exceeds the range of observed values [31,62]. Most of the standardized residuals fall into the range −0.5~0.5, signifying good performance of models on most road sections.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…This indicates that the model may have a tendency to underestimate the higher values and overestimate the lowers. One possible explanation for this phenomenon could be the inherent limitations of the random forest, whose final result is obtained by averaging the results of multiple decision trees, which may lead to a decreasing variance of the model's results and an unobtainable prediction value that exceeds the range of observed values [31,62]. Most of the standardized residuals fall into the range −0.5~0.5, signifying good performance of models on most road sections.…”
Section: Resultsmentioning
confidence: 99%
“…To bridge the research gap, this study aimed to explore the ability of POI to estimate traffic collisions by categories of collisions, types of roads, and periods of the day. In particular, this study introduced the nighttime lights (NTL) dataset [25][26][27][28][29][30][31], a type of remote sensing data, into the crash prediction models. The aim was to explore the ability of the two data sources to map urban road safety, since both of them are easily obtained and are widely acknowledged for reflecting human activities and urban structure.…”
Section: Introductionmentioning
confidence: 99%
“…On the other hand, learning-based approaches attempt to predict whether a given pixel is a road or not, according to the context around the target pixel [9,10,[36][37][38][39][40]. The extraction is similar to the task of salient objects extraction or segmentation [41][42][43][44][45][46][47].…”
Section: Related Workmentioning
confidence: 99%
“…Zhao et al [98] used a random forest regression (RFR) model by combining features extracted from multiple data sources, including National Polar-orbiting Partnership Visible Infrared Imaging Radiometer Suite (NPP-VIIRS) Day/Night Band (DNB) nighttime light (NTL) data, Google satellite imagery and land cover map, road map and division headquarter location data to estimate poverty based on household wealth index (WI) at a 10-km spatial resolution. The authors trained the RFR model using data in Bangladesh and applied the model to both Bangladesh and Nepal to evaluate the model's accuracy.…”
Section: Use Of Satellite Imaging Data To Predict Energy Povertymentioning
confidence: 99%