2019
DOI: 10.1080/01621459.2019.1609973
|View full text |Cite
|
Sign up to set email alerts
|

Combining Multiple Observational Data Sources to Estimate Causal Effects

Abstract: The era of big data has witnessed an increasing availability of multiple data sources for statistical analyses. We consider estimation of causal effects combining big main data with unmeasured confounders and smaller validation data with supplementary information on these confounders. Under the unconfoundedness assumption with completely observed confounders, the smaller validation data allow for constructing consistent estimators for causal effects, but the big main data can only give error-prone estimators i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
43
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 57 publications
(43 citation statements)
references
References 86 publications
0
43
0
Order By: Relevance
“…Such penalty functions can be potentially useful for handling high dimensional covariate problems. Also, the proposed method can be used for causal inference, including estimation of average treatment effect from observational studies (Morgan and Winship, 2014;Yang and Ding, 2020). Developing tools for causal inference using the kernel ridge regression-based propensity score method will be an important extension of this research.…”
Section: Discussionmentioning
confidence: 99%
“…Such penalty functions can be potentially useful for handling high dimensional covariate problems. Also, the proposed method can be used for causal inference, including estimation of average treatment effect from observational studies (Morgan and Winship, 2014;Yang and Ding, 2020). Developing tools for causal inference using the kernel ridge regression-based propensity score method will be an important extension of this research.…”
Section: Discussionmentioning
confidence: 99%
“…Motivated by the observation that the treatment effects on the secondary outcome should be similar in the RCT and observational data if X is sufficient for Assumption 2, [45] developed a control function method for using differences in the estimated causal effects on the secondary outcome between the two samples to adjust estimation of the treatment effect on the primary outcome. [46] considered the scenario where a small validation dataset with all confounders (Y, A, X 1 , U, S = 1) and a big main dataset with unmeasured confounders (Y, A, X 1 , S = 0) are available. Both are random samples of the target population hence external validity is satisfied.…”
Section: Correcting For Bias In Observational Study Using Validation ...mentioning
confidence: 99%
“…In presence of this type of auxiliary data, methods based on the generalized method of moment, generalized regression, weight calibration, constrained maximum likelihood, empirical likelihood, etc., have been proposed to borrow auxiliary information to power up the main study. [1][2][3][4][5][6][7][8][9][10][11][12][13] In this article, we consider a different type of auxiliary data that is also widely seen in applications, that is, an auxiliary measurement collected in the same study but served as the outcome in a secondary analysis. Usually, such kind of auxiliary measurement is highly associated with the primary outcome, and how to incorporate this secondary information to enhance estimation precision for the main analysis is of high interest.…”
Section: Introductionmentioning
confidence: 99%