2021
DOI: 10.1097/ede.0000000000001332
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning for Causal Inference: On the Use of Cross-fit Estimators

Abstract: Background: Modern causal inference methods allow machine learning to be used to weaken parametric modeling assumptions. However, the use of machine learning may result in complications for inference. Doubly robust cross-fit estimators have been proposed to yield better statistical properties. Methods: We conducted a simulation study to assess the performance of several different estimators for the average causal effect. The data generating mechanisms for the simulated treatment and outcome included log-transf… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
36
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 51 publications
(37 citation statements)
references
References 45 publications
(72 reference statements)
1
36
0
Order By: Relevance
“…However, even if TMLE is less prone to errors due to misspecification than alternative methods (eg, inverse probability weighting) there is some question regarding the validity of the robustness of inference produced by TMLE in nonparametric settings 49 . This is an area of ongoing work (ie, double/debiased machine learning, cross‐validated TMLE and cross‐fit estimators) 44,50,51 . Furthermore, TMLE and the SuperLearner were originally developed in R 35,46 .…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…However, even if TMLE is less prone to errors due to misspecification than alternative methods (eg, inverse probability weighting) there is some question regarding the validity of the robustness of inference produced by TMLE in nonparametric settings 49 . This is an area of ongoing work (ie, double/debiased machine learning, cross‐validated TMLE and cross‐fit estimators) 44,50,51 . Furthermore, TMLE and the SuperLearner were originally developed in R 35,46 .…”
Section: Discussionmentioning
confidence: 99%
“…Using these approaches may require larger sample sizes to avoid finite‐sample bias 16,54 . However, recent developments support the use of cross‐fit double‐robust estimators for data adaptive estimation 44,50 . Tutorials introducing the use and derivation of the functional delta method and influence curve for applied researchers are needed.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Curth and van der Schaar (2021) focus directly on meta-learning algorithms for estimation of heterogeneous treatment effects, but refrain from studying sample-splitting and cross-fitting procedures and rely fully on the full-sample estimation. In this regard, Zivich and Breskin (2021) study the performance of treatment effect estimators based on cross-fitting, including some meta-learners as well. Similarly to Knaus et al (2021) they find the DR-learner with an ensemble machine learning base learners together with cross-fitting to perform the best among all considered estimators, both in comparison to cases without cross-fitting and to parametric base learners.…”
Section: Related Workmentioning
confidence: 99%
“…Similarly to Knaus et al (2021) they find the DR-learner with an ensemble machine learning base learners together with cross-fitting to perform the best among all considered estimators, both in comparison to cases without cross-fitting and to parametric base learners. However, Zivich and Breskin (2021) study exclusively the estimation of average effects without examining convergence performance of the estimators, considering only a single sample size of 3 000 observations. 3 Recently, Jacob (2020) focuses on the estimation of heterogeneous treatment effects under various cross-fitting schemes for selected meta-learning algorithms.…”
Section: Related Workmentioning
confidence: 99%