2021
DOI: 10.48550/arxiv.2107.02780
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Causal Inference with Corrupted Data: Measurement Error, Missing Values, Discretization, and Differential Privacy

Abstract: Even the most carefully curated economic data sets have variables that are noisy, missing, discretized, or privatized. The standard workflow for empirical research involves data cleaning followed by data analysis that typically ignores the bias and variance consequences of data cleaning. We formulate a semiparametric model for causal inference with corrupted data to encompass both data cleaning and data analysis. We propose a new end-to-end procedure for data cleaning, estimation, and inference with data clean… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
2
1

Relationship

2
3

Authors

Journals

citations
Cited by 8 publications
(15 citation statements)
references
References 37 publications
(82 reference statements)
0
15
0
Order By: Relevance
“…Such a condition is necessary as well for generalization, e.g., if every entry of E[S (k) | E] is equal to 0, then no meaningful model β (k) can learned. Such an assumption has also been explored in Agarwal et al (2021a,b); Agarwal and Singh (2021). In particular, in Agarwal et al (2021b) the authors provide a data-driven hypothesis test to verify when such a condition holds.…”
Section: Discussion Of Assumptionsmentioning
confidence: 94%
See 3 more Smart Citations

Causal Matrix Completion

Agarwal,
Dahleh,
Shah
et al. 2021
Preprint
Self Cite
“…Such a condition is necessary as well for generalization, e.g., if every entry of E[S (k) | E] is equal to 0, then no meaningful model β (k) can learned. Such an assumption has also been explored in Agarwal et al (2021a,b); Agarwal and Singh (2021). In particular, in Agarwal et al (2021b) the authors provide a data-driven hypothesis test to verify when such a condition holds.…”
Section: Discussion Of Assumptionsmentioning
confidence: 94%
“…In Section 3, we propose a causal framework for matrix completion that draws inspiration from the rich and growing literature in econometrics on panel data and matrix completion; some relevant works include Amjad et al (2018Amjad et al ( , 2019 Agarwal and Singh (2021). As is common in matrix completion, these works impose a (approximate) low-rank factor model on the signal matrix (i.e., A), also known as an interactive fixed effects model, to capture structure across units and time (i.e., the rows and columns of the matrix, respectively).…”
Section: Panel Data and Matrix Completionmentioning
confidence: 99%
See 2 more Smart Citations

Causal Matrix Completion

Agarwal,
Dahleh,
Shah
et al. 2021
Preprint
Self Cite
“…Despite the additional complexity of the longitudinal setting, I recover equally strong results. In particular, [54,52,14,28,101,35,31,51,29,2] study static local parameters and [81,50,40,29] study static proximal parameters. Like the latter, I arrive at mean square rates and projected mean square rates of confounding bridges (also called nonparametric instrumental variable regressions).…”
Section: Localization and Proxiesmentioning
confidence: 99%