2017
DOI: 10.1002/pop4.169
|View full text |Cite
|
Sign up to set email alerts
|

Is Random Forest a Superior Methodology for Predicting Poverty? An Empirical Assessment

Abstract: Random forest (RF) is in many fields of research a common method for data‐driven predictions. Within economics and prediction of poverty, RF is rarely used. Comparing out‐of‐sample predictions in surveys for the same year in six countries shows that RF is often more accurate than current common practice (multiple imputations with variables selected by Stepwise and Lasso), suggesting that this method could contribute to better poverty predictions. However, none of the methods consistently provides accurate pred… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
20
0

Year Published

2018
2018
2025
2025

Publication Types

Select...
9
1

Relationship

1
9

Authors

Journals

citations
Cited by 32 publications
(22 citation statements)
references
References 17 publications
2
20
0
Order By: Relevance
“…This paper provides a comprehensive analysis on the drivers of poverty via the application of various estimation methods, such as probit, tobit, and machine learning algorithms. While it is widely recognised in the poverty literature that traditional methods such as probit or tobit can contribute to understanding the main factors influencing poverty status, the application of machine learning techniques in this context is limited to only a few countries (Sohnesen and Stender, 2016;Zhao et al, 2019). Because machine learning, unlike probit or tobit, offers a nonparametric and flexible way to model poverty status, we hope to develop a superior povertyidentification technique that not only results in higher predictive power but also reveals more clearly the underlying empirical relationships considered.…”
Section: Motivation Behind Empirical Methodsmentioning
confidence: 99%
“…This paper provides a comprehensive analysis on the drivers of poverty via the application of various estimation methods, such as probit, tobit, and machine learning algorithms. While it is widely recognised in the poverty literature that traditional methods such as probit or tobit can contribute to understanding the main factors influencing poverty status, the application of machine learning techniques in this context is limited to only a few countries (Sohnesen and Stender, 2016;Zhao et al, 2019). Because machine learning, unlike probit or tobit, offers a nonparametric and flexible way to model poverty status, we hope to develop a superior povertyidentification technique that not only results in higher predictive power but also reveals more clearly the underlying empirical relationships considered.…”
Section: Motivation Behind Empirical Methodsmentioning
confidence: 99%
“…While the selection of variables based on LASSO, present better results compared to RF in terms of reducing the inclusion error (non-poor households classified as poor). Sohnesen & Stender (2017) also, use LASSO and RF to predict poverty using data of a period of one year. The results indicate that RF is a good predictor of poverty and obtains more robust estimates compared to Linear Regression estimators.…”
Section: Related Work Social Science and Machine Learningmentioning
confidence: 99%
“…McBride and Nichols (2016) seminal application of machine learning to poverty targeting is the first to use an approach that is explicitly designed to improve out-of-sample prediction. However, along with Sohnesen and Stender (2017), they employ a black-box random forest methodology. 29 Mark Schreiner's insightful and widely-used approach to designing proxy-means tests (e.g.…”
Section: Pragmatic Drought-contingent Targeting Strategymentioning
confidence: 99%