2020
DOI: 10.1214/19-ba1195
|View full text |Cite
|
Sign up to set email alerts
|

Bayesian Regression Tree Models for Causal Inference: Regularization, Confounding, and Heterogeneous Effects (with Discussion)

Abstract: This paper presents a novel nonlinear regression model for estimating heterogeneous treatment effects, geared specifically towards situations with small effect sizes, heterogeneous effects, and strong confounding by observables. Standard nonlinear regression models, which may work quite well for prediction, have two notable weaknesses when used to estimate heterogeneous treatment effects. First, they can yield badly biased estimates of treatment effects when fit to data with strong confounding. The Bayesian ca… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

3
238
0
4

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 238 publications
(245 citation statements)
references
References 103 publications
3
238
0
4
Order By: Relevance
“…Both the model for potential outcomes and PS (i.e., π(X i ) = E(Z i |X i )) are fitted by BART with the default setting. Direct/ps-BART incorporates the estimate of PS in the specification of the outcome model, implicitly inducing a covariate-dependent prior to the regression function, which even outperforms the BART model ( [25,40]. .…”
Section: Additional Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Both the model for potential outcomes and PS (i.e., π(X i ) = E(Z i |X i )) are fitted by BART with the default setting. Direct/ps-BART incorporates the estimate of PS in the specification of the outcome model, implicitly inducing a covariate-dependent prior to the regression function, which even outperforms the BART model ( [25,40]. .…”
Section: Additional Methodsmentioning
confidence: 99%
“…Recently, more flexible semi-parametric and non-parametric models have demonstrated good performance [14]. In particular, BART seems well suited to uncover the true treatment effect when the treatment assignment is confounded [14,[25][26][27].…”
Section: Introductionmentioning
confidence: 99%
“…The latter authors introduced XGBoost, an especially fast and scalable version that has gone on to dominate in a number of public data science competitions (Gorman 2017). Bayesian versions of boosting have also been developed (Chipman et al 2010) with extensions to causal inference (Hahn et al 2017;Hill 2011), survival analysis (Sparapani et al 2016), and high-dimensional modeling (Linero 2018;Linero and Yang 2018).…”
Section: Boostingmentioning
confidence: 99%
“…For example, Hill () proposed using Bayesian additive regression trees (BART) to flexibly model the response surface. To mitigate the regression‐induced confounding issue (Hahn et al, ) resulting in biased estimates of treatment effects in regularized models such as BART used in Hill (), Hahn et al () proposed a Bayesian random forest model which represents the regression as a sum of two components: one is a function of covariates and estimated propensity score (PS) as prognostic effect and the other is a function on treatment effects that depends on covariates and estimated PS. The model also allows the estimates of heterogeneous treatment effects.…”
Section: Introductionmentioning
confidence: 99%