Causal Matrix Completion

Agarwal, Anish; Dahleh, Munther; Shah, Devavrat; Shen, Dennis

doi:10.48550/arxiv.2109.15154

Cited by 3 publications

(4 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, we used the same code to analyze the six influenza studies (with rank between 6 and 9) and the HIV-1 Catnap dataset containing hundreds of studies (with rank 23). We also found that nuclear norm minimization performed better than other imputation approaches (e.g., mean imputation or kNN regression; Figure S12 and STAR Methods) and also outperformed a recently published method of causal matrix completion developed to specifically complete data with values that are missing in non-random (structured) patterns (Agarwal et al, 2021). That said, many other forms of matrix completion exist, and our approach could be further refined by incorporating side information such as virus sequence or antibody isotype (Radhakrishnan et al, 2022).…”

Section: Llmentioning

confidence: 55%

“…Moreover, we tested a recently-published method of causal matrix completion that was developed to specifically deal with scenarios in which data are not missing at random, and in which the availability of data may be correlated with the outcome of the experiment (Agarwal et al, 2021). This could be relevant, for instance, in matrix completion of the HIV-1 Catnap data through a given date, since measurements made by the community were not randomly chosen, but rather chosen to be informative for a specific study.…”

Section: Comparison With Other Matrix Completion Algorithmsmentioning

confidence: 99%

See 1 more Smart Citation

Extrapolating missing antibody-virus measurements across serological studies

Einav

Cleary

2022

Cell Systems

View full text Add to dashboard Cite

Section: Llmentioning

confidence: 55%

Section: Comparison With Other Matrix Completion Algorithmsmentioning

confidence: 99%

Extrapolating missing antibody-virus measurements across serological studies

Einav

Cleary

2022

Cell Systems

View full text Add to dashboard Cite

“…Recently, there has been a growing literature that approaches causal inference from a matrix completion perspective. Proposals include approximating the control unit matrix using nuclear-norm minimization (Athey et al, 2021), using singular value decomposition (Amjad et al, 2018), or by finding nearest neighbors (Agarwal et al, 2021) for missing entries of a matrix to best match control units and the treated unit of interest.…”

Section: Introductionmentioning

confidence: 99%

Optimal Recovery for Causal Inference

Ferwana¹,

Varshney²

2022

Preprint

View full text Add to dashboard Cite

It is crucial to successfully quantify causal effects of a policy intervention to determine whether the policy achieved the desired outcomes. We present a deterministic approach to a classical method of policy evaluation, synthetic control (Abadie and Gardeazabal, 2003), to estimate the unobservable outcome of a treatment unit using ellipsoidal optimal recovery (EOpR). EOpR provides policy evaluators with "worst-case" outcomes and "typical" outcomes to help in decision making. It is an approximation-theoretic technique that also relates to the theory of principal components, which recovers unknown observations given a learned signal class and a set of known observations. We show that EOpR can improve pre-treatment fit and bias of the post-treatment estimation relative to other econometrics methods. Beyond recovery of the unit of interest, an advantage of EOpR is that it produces worst-case estimates over the estimations produced by the recovery. We assess our approach on artificially-generated data, on datasets commonly used in the econometrics literature, and also derive results in the context of the COVID-19 pandemic. Such an approach is novel in the econometrics literature for causality and policy evaluation.

show abstract

“…This problem can be reduced to matrix completion, where rows index users and columns index items; each missing user-item entry corresponds to the potential rating a user would give to that item had they rated it. To motivate the importance of studying the missingness mechanism, we showcase two experiments (details inAgarwal et al (2021b)), one with MCAR and the other with MNAR data in Figures 1.6a and 1.7a. We use three matrix completion algorithms to recover the distribution of true ratings given a subset of revealed ratings: (i) Universal singular value thresholding (USVT) a popular spectral based method; (ii) Softimpute (softImpute), a popular optimization based method; (iii) "synthetic nearest neighbors" (SNN), our proposed method.…”

mentioning

confidence: 99%

Causal Inference for Social and Engineering Systems

Agarwal

2022

SIGMETRICS Perform. Eval. Rev.

View full text Add to dashboard Cite

What will happen to Y if we do A? A variety of meaningful social and engineering questions can be formulated this way: What will happen to a patient's health if they are given a new therapy? What will happen to a country's economy if policy-makers legislate a new tax? What will happen to a data center's latency if a new congestion control protocol is used? We explore how to answer such counterfactual questions using observational data-which is increasingly available due to digitization and pervasive sensors-and/or very limited experimental data. The two key challenges are: (i) counterfactual prediction in the presence of latent confounders; (ii) estimation with modern datasets which are high-dimensional, noisy, and sparse. The key framework we introduce is connecting causal inference with tensor completion. In particular, we represent the various potential outcomes (i.e., counterfactuals) of interest through an order-3 tensor. The key theoretical results presented are: (i) Formal identification results establishing under what missingness patterns, latent confounding, and structure on the tensor is recovery of unobserved potential outcomes possible. (ii) Introducing novel estimators to recover these unobserved potential outcomes and proving they are finite-sample consistent and asymptotically normal. Finally, we discuss connections between matrix/tensor completion and time series analysis and reinforcement learning; we believe this could serve as a basis to do counterfactual forecasting, and building data-driven simulators for reinforcement learning.

show abstract

Causal Matrix Completion

Cited by 3 publications

References 32 publications

Extrapolating missing antibody-virus measurements across serological studies

Extrapolating missing antibody-virus measurements across serological studies

Optimal Recovery for Causal Inference

Causal Inference for Social and Engineering Systems

Contact Info

Product

Resources

About