Estimating Categorical Counterfactuals via Deep Twin Networks

Vlontzos, Athanasios; Kainz, Bernhard; Gilligan-Lee, Ciarán M.

doi:10.21203/rs.3.rs-1684942/v1

Cited by 4 publications

(6 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In terms of evaluation, given that CI is not a well-defined task in language processing, the results may be questioned due to their strict dependence on subjective human criteria. This is a clear point of general improvement (beyond the specific purposes of this work) toward the fair assessment of other related CI approaches such as the Twin Networks method to estimate the probabilities of causation (Vlontzos, A., Kainz, B., and Gilligan-Lee, C. M., 2021), the causal regularization of neural networks to improve their interpretability (Bahadori, M. T., Chalupka, K., Choi, E., Chen, R., Stewart, W. F., and Sun, J., 2017; Shen, Z., Cui, P., Kuang, K., Li, B., and Chen, P., 2018), or the learning of causally disentangled representations using Variational Autoencoders (Suter, R., Miladinović, D., Schölkopf, B., and Bauer, S.,, 2019;Yang, M., Liu, F., Chen, Z., Shen, X., Hao, J., and Wang, J., 2020).…”

Section: Discussionmentioning

confidence: 91%

Towards Learning Causal Representations of Technical Word Embeddings for Smart Troubleshooting

Trilla

Mijatović²,

Vilasís-Cardona³

2022

IJPHM

View full text Add to dashboard Cite

This work explores how the causality inference paradigm may be applied to troubleshoot the root causes of failures through language processing and Deep Learning. To do so, the causality hierarchy has been taken for reference: associative, interventional, and retrospective levels of causality have thus been researched within textual data in the form of a failure analysis ontology and a set of written records on Return On Experience. A novel approach to extracting linguistic knowledge has been devised through the joint embedding of two contextualized Bag-Of-Words models, which defines both a probabilistic framework and a distributed representation of the underlying causal semantics. This method has been applied to the maintenance of rolling stock bogies, and the results indicate that the inference of causality has been partially attained with the currently available technical documentation (consensus over 70%). However, there is still some disagreement between root causes and problems that leads to confusion and uncertainty. In consequence, the proposed approach may be used as a strategy to detect lexical imprecision, make writing recommendations in the form of standard reporting guidelines, and ultimately help produce clearer diagnosis materials to increase the safety of the railway service.

show abstract

Section: Discussionmentioning

confidence: 91%

Towards Learning Causal Representations of Technical Word Embeddings for Smart Troubleshooting

Trilla

Mijatović²,

Vilasís-Cardona³

2022

IJPHM

View full text Add to dashboard Cite

show abstract

“…(Pawlowski et al, 2020) developed a normalizing flow model to perform the abduction step in an abduction-action-prediction counterfactual inference task and are able to generate plausible brain MRI volumes. assume a different approach and develop a generative model based on Deep Twin Networks (Vlontzos et al, 2021a). Performing counterfactual inference in the latent space embeddings, the authors are able to generate realistic Ultrasound Videos with different Left Ventricle Ejection Fractions.…”

Section: Generative Methodsmentioning

confidence: 99%

“…Indeed, P (Y x = y ′ | E = e) is given by (Pearl, 2009) u P (Y x (u) = y ′ )P (u|e) . There are two main ways to resolve this type of questions; the Abduction-Action-Prediction paradigm and the Twin Network paradigm shown respectively in ML literature among others in (Castro et al, 2020b;Vlontzos et al, 2021a). In short given SCM M with latent distribution P (U ) and evidence e, the conditional probability P (Y x | e) is evaluated as follows: 1) Abduction: Infer the posterior of the latent variables with evidence e to obtain P (U | e), 2) Action: Apply do(x) to obtain submodel M x , 3) Prediction: Compute the probability of Y in the submodel M x with P (U | e).…”

Section: Structural Causal Modelsmentioning

confidence: 99%

A Review of Causality for Learning Algorithms in Medical Image Analysis

Vlontzos

Rueckert

Kainz

2022

Melba

View full text Add to dashboard Cite

Medical image analysis is a vibrant research area that offers doctors and medical practitioners invaluable insight and the ability to accurately diagnose and monitor disease. Machine learning provides an additional boost for this area. However, machine learning for medical image analysis is particularly vulnerable to natural biases like domain shifts that affect algorithmic performance and robustness. In this paper we analyze machine learning for medical image analysis within the framework of Technology Readiness Levels and review how causal analysis methods can fill a gap when creating robust and adaptable medical image analysis algorithms.<br>We review methods using causality in medical imaging AI/ML and find that causal analysis has the potential to mitigate critical problems for clinical translation but that uptake and clinical downstream research has been limited so far.

show abstract

“…Counterfactuals can be estimated with (i) a three-step procedure [53] ( abduction–action–prediction ) which has been recently enhanced with deep learning [15,92] using generative models such as normalizing flows [93], variational autoencoders [94] and diffusion probabilistic models [95] or (ii) twin networks [96] which augment the original SCM resulting in both factual and counterfactual variables represented simultaneously. Deep twin networks [97] leverage neural networks to further improve flexibility of the causal mechanisms. We note that quantifying the effect of interventions usually assumes that causal models are given either explicitly [15,98] or learned via causal discovery [99].…”

Section: Research Directions In Causal Machine Learningmentioning

confidence: 99%

Causal machine learning for healthcare and precision medicine

et al. 2022

View full text Add to dashboard Cite

Causal machine learning (CML) has experienced increasing popularity in healthcare. Beyond the inherent capabilities of adding domain knowledge into learning systems, CML provides a complete toolset for investigating how a system would react to an intervention (e.g. outcome given a treatment). Quantifying effects of interventions allows actionable decisions to be made while maintaining robustness in the presence of confounders. Here, we explore how causal inference can be incorporated into different aspects of clinical decision support systems by using recent advances in machine learning. Throughout this paper, we use Alzheimer’s disease to create examples for illustrating how CML can be advantageous in clinical scenarios. Furthermore, we discuss important challenges present in healthcare applications such as processing high-dimensional and unstructured data, generalization to out-of-distribution samples and temporal relationships, that despite the great effort from the research community remain to be solved. Finally, we review lines of research within causal representation learning, causal discovery and causal reasoning which offer the potential towards addressing the aforementioned challenges.

show abstract

Estimating Categorical Counterfactuals via Deep Twin Networks

Cited by 4 publications

References 11 publications

Towards Learning Causal Representations of Technical Word Embeddings for Smart Troubleshooting

Towards Learning Causal Representations of Technical Word Embeddings for Smart Troubleshooting

A Review of Causality for Learning Algorithms in Medical Image Analysis

Causal machine learning for healthcare and precision medicine

Contact Info

Product

Resources

About