2017
DOI: 10.3102/1076998617731518
|View full text |Cite
|
Sign up to set email alerts
|

Rebar: Reinforcing a Matching Estimator With Predictions From High-Dimensional Covariates

Abstract: In causal matching designs, some control subjects are often left unmatched, and some covariates are often left unmodeled. This article introduces “rebar,” a method using high-dimensional modeling to incorporate these commonly discarded data without sacrificing the integrity of the matching design. After constructing a match, a researcher uses the unmatched control subjects—the remnant—to fit a machine learning model predicting control potential outcomes as a function of the full covariate matrix. The resulting… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
8
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 74 publications
0
8
0
Order By: Relevance
“…Thus, these schools are outside of our evaluation sample. Following Sales, Hansen, and Rowan (2014), we start by estimating the following linear regression model using the remnant data:…”
Section: Technical Appendixmentioning
confidence: 99%
See 3 more Smart Citations
“…Thus, these schools are outside of our evaluation sample. Following Sales, Hansen, and Rowan (2014), we start by estimating the following linear regression model using the remnant data:…”
Section: Technical Appendixmentioning
confidence: 99%
“…Thus, these schools are outside of our evaluation sample. Following Sales, Hansen, and Rowan (2014), we start by estimating the following linear regression model using the remnant data: In Equation (C1), Y sdt is a Grade 3 math test score for school s in district d in year t , and X s and X d are defined as above. 22 After estimating Equation (C1), we store the coefficient estimates true α ^ 1 t and true α ^ 2 t and construct the following residualized test score outcome for each school in our analytic sample in each year: In Equation (C2), Y sdt is the Grade 3 test score for school s in district d in year t for a school that adopted one of the four primary curricula.…”
Section: Data Appendixmentioning
confidence: 99%
See 2 more Smart Citations
“…In this analysis, true Y ^ C ( R i ) cannot itself be influenced by Maria since it is based on a model fit to pre-Maria death counts (cf. Sales, Hansen, & Rowan, 2018). Hence, the effect of Maria on E (i.e., E T E C ) is exactly equal to its effect on Y .…”
Section: Introductionmentioning
confidence: 99%