2016
DOI: 10.1073/pnas.1511656113
|View full text |Cite
|
Sign up to set email alerts
|

Modeling confounding by half-sibling regression

Abstract: We describe a method for removing the effect of confounders to reconstruct a latent quantity of interest. The method, referred to as "half-sibling regression," is inspired by recent work in causal inference using additive noise models. We provide a theoretical justification, discussing both independent and identically distributed as well as time series data, respectively, and illustrate the potential of the method in a challenging astronomy application.machine learning | causal inference | astronomy | exoplane… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
63
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 60 publications
(63 citation statements)
references
References 17 publications
0
63
0
Order By: Relevance
“…Finally, Schölkopf et al (13) present "Modeling confounding by half-sibling regression," showing how to remove the effect of confounders in large-scale data, and give an application to astronomy.…”
mentioning
confidence: 99%
“…Finally, Schölkopf et al (13) present "Modeling confounding by half-sibling regression," showing how to remove the effect of confounders in large-scale data, and give an application to astronomy.…”
mentioning
confidence: 99%
“…Our confounder correction strategy is largely inspired by existing methods in genomics [11,36] and astrophysics [37]. In genomics, the RUV methods (removing unwanted variance) identify "control" genes or samples, which are not affected by case-control labels, and perform principal component analysis (PCA) on the control data to ascertain biases introduced by technical covariates.…”
Section: Discussionmentioning
confidence: 99%
“…The half-sibling regression [37] also adopts a similar idea, but without PCA, measurements of control variates are directly included in a regression model. Sparse modeling of genetic effects proves to be indispensable in high-dimensional multivariate analysis.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…A general approach to detecting and controlling for confounds remains an open area of research in machine learning (see Refs. and for promising advances in this area). Rather than relying solely on a predictive approach, we contend that domain knowledge can also be used to explain an algorithm's performance, and applied to determine which confounds are plausible, as in the examples below.…”
Section: New Developments—the Machine‐learning Paradigm and “Big Data”mentioning
confidence: 99%