An active area of research in the fields of machine learning and statistics is the development of causal discovery algorithms, the purpose of which is to infer the causal relations that hold among a set of variables from the correlations that these exhibit . We apply some of these algorithms to the correlations that arise for entangled quantum systems. We show that they cannot distinguish correlations that satisfy Bell inequalities from correlations that violate Bell inequalities, and consequently that they cannot do justice to the challenges of explaining certain quantum correlations causally. Nonetheless, by adapting the conceptual tools of causal inference, we can show that any attempt to provide a causal explanation of nonsignalling correlations that violate a Bell inequality must contradict a core principle of these algorithms, namely, that an observed statistical independence between variables should not be explained by fine-tuning of the causal parameters. In particular, we demonstrate the need for such fine-tuning for most of the causal mechanisms that have been proposed to underlie Bell correlations, including superluminal causal influences, superdeterminism (that is, a denial of freedom of choice of settings), and retrocausal influences which do not introduce causal cycles. 4 Other work in the field of machine learning has appealed to statistical features besides CI relations, but not the features of correlations that are relevant for Bellʼs theorem. Peters et al [8] demonstrate that if one is promised an additive noise model, then features of the joint distribution can often distinguish cause from effect in the case of a distribution on a pair of variables, where there are no CI relations to guide the analysis. Other approaches have appealed to the complexity of conditional distributions [3][4][5]. 5 A defining feature of a common cause is that if the statistical dependence between two variables is to be explained entirely by a common cause, then it must be the case that conditioning on the common cause makes the variables statistically independent. As we will see, this feature is built into the framework of causal models. Statements of Reichenbachʼs principle often assert it explicitly. New J. Phys. 17 (2015) 033002 C J Wood and R W Spekkens New J. Phys. 17 (2015) 033002 C J Wood and R W Spekkens