2016 IEEE 16th International Conference on Data Mining (ICDM) 2016
DOI: 10.1109/icdm.2016.0015
|View full text |Cite
|
Sign up to set email alerts
|

Causal Inference by Compression

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
31
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
3
3
1

Relationship

3
4

Authors

Journals

citations
Cited by 27 publications
(31 citation statements)
references
References 17 publications
0
31
0
Order By: Relevance
“…However, there is one particular area which we would like to mention specifically, since that area had hardly seen any MDL applications before 2007 whereas nowadays such applications are flourishing: this is the field of data mining. Some representative publications are Vreeken et al [2011], Koutra et al [2015], Budhathoki et al [2018]. Most of this work centers on the use of two-part codes, but sometimes NML and other sophisticated universal distributions/codes are used as well [Tatti and Vreeken, 2008].…”
Section: Discussionmentioning
confidence: 99%
“…However, there is one particular area which we would like to mention specifically, since that area had hardly seen any MDL applications before 2007 whereas nowadays such applications are flourishing: this is the field of data mining. Some representative publications are Vreeken et al [2011], Koutra et al [2015], Budhathoki et al [2018]. Most of this work centers on the use of two-part codes, but sometimes NML and other sophisticated universal distributions/codes are used as well [Tatti and Vreeken, 2008].…”
Section: Discussionmentioning
confidence: 99%
“…Mooij instantiates the first practical compression-based approach [18] using the Minimum Message Length. Budhathoki and Vreeken approximate K(X) and K(Y | X) through MDL, and propose Origo, a decision tree based approach for causal inference on multivariate binary data [1]. Marx and Vreeken [17] proposed Slope, an MDL based method employing local and global regression for univariate numeric data.…”
Section: Related Workmentioning
confidence: 99%
“…Our approach is based on algorithmic information theory. That is, we follow the postulate that if X → Y , it will be easier-in terms of Kolmogorov complexityto first describe X, and then describe Y given X, than vice-versa [11,33,1]. Kolmogorov complexity is not computable, but can be approximated through the Minimum Description Length (MDL) principle [26,5], which we use to instantiate this framework.…”
Section: Introductionmentioning
confidence: 99%
“…Vreeken [29] proposed to approximate the Kolmogorov complexity for numeric data using the cumulative residual entropy, and gave an instantiation for multivariate continuous-valued data. Perhaps most related to SLOPE is ORIGO [4], which uses MDL to infer causal direction on binary data, whereas we focus on univariate numeric data.…”
Section: Related Workmentioning
confidence: 99%