2018
DOI: 10.1007/s41060-018-0097-y
|View full text |Cite|
|
Sign up to set email alerts
|

Constraint-based causal discovery with mixed data

Abstract: We address the problem of constraint-based causal discovery with mixed data types, such as (but not limited to) continuous, binary, multinomial, and ordinal variables. We use likelihood-ratio tests based on appropriate regression models and show how to derive symmetric conditional independence tests. Such tests can then be directly used by existing constraint-based methods with mixed data, such as the PC and FCI algorithms for learning Bayesian networks and maximal ancestral graphs, respectively. In experiment… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 34 publications
(22 citation statements)
references
References 38 publications
0
22
0
Order By: Relevance
“…We first tested the mixed-type data extension of MIIC network reconstruction method on benchmark mixed-type data. Datasets were generated based on non-linear bayesian rules using the R script provided as Supplementary code; an example of non-Gaussian mixed-type distribution dataset is shown in S10 [25], also designed to analyze mixed-type data. Precision, Comparisons with fully continuous datasets, S13 Fig, were also performed with additional methods, CAM [26], kPC, rank-PC and rank-FCI [27] algorithms, S14 and S15 Figs, and confirm the better performance of MIIC over alternative continuous or mixed-type network learning methods.…”
Section: Application To Benchmark Synthetic Datamentioning
confidence: 99%
See 1 more Smart Citation
“…We first tested the mixed-type data extension of MIIC network reconstruction method on benchmark mixed-type data. Datasets were generated based on non-linear bayesian rules using the R script provided as Supplementary code; an example of non-Gaussian mixed-type distribution dataset is shown in S10 [25], also designed to analyze mixed-type data. Precision, Comparisons with fully continuous datasets, S13 Fig, were also performed with additional methods, CAM [26], kPC, rank-PC and rank-FCI [27] algorithms, S14 and S15 Figs, and confirm the better performance of MIIC over alternative continuous or mixed-type network learning methods.…”
Section: Application To Benchmark Synthetic Datamentioning
confidence: 99%
“…Another confounding example, X Z ! Y, taken from [25] with a uniform categorical Z with three levels, X and Y being continuous, for N = 1, 000 observations. With Z i the binary variable corresponding to the i-th dummy variable of Z, we defined X = −Z 1 + Z 2 + 0.2� X which is centered around either −1 if Z = 1, 0 if Z = 3 or 1 if Z = 2; and Y = Z 1 + Z 2 + 0.2� Y , � � N ð0; 1Þ which is centered around either 0 if Z = 3 or 1 if Z = 1 or Z = 2.…”
Section: S8 Fig Pairwise Dependence and Conditional Independence Betmentioning
confidence: 99%
“…Meanwhile, there has also been more and more evidence demonstrating the possibility of discovering causal relationships by combining both experimental and observational data (Cooper and Yoo, 2013 ; Hauser and Bühlmann, 2015 ; Meinshausen et al, 2016 ). Other notable direction involves learning from mixed data types (continuous and discrete variables) (Andrews et al, 2018 ; Tsagris et al, 2018 ). In principle, our approach can be naturally adapted to handle mixed variable types, as long as an appropriate conditional independence test is employed.…”
Section: Background and Related Workmentioning
confidence: 99%
“…We also extend this method to make it applicable on heterogeneous data. Handling this type of data is a frequent issue in practice, and constitutes an active area of research [12,13].…”
Section: Introductionmentioning
confidence: 99%