Causal inference on DAGs

Confounding! This scientist performed a miracle graph surgery intervention and you won’t believe what happened next

2016-10-26 — 2025-02-24

algebra

causal

graphical models

how do science

machine learning

networks

probability

statistics

Suspiciously similar content

Making valid statistical inference, in the sense of making inference that is compatible with our understanding of the causal relationships that exist in the world (not just the correlations in our data). Graphical models and related techniques for doing it. Avoiding the danger of folk statistics. Observational studies, confounding, adjustment criteria, d-separation, identifiability, interventions, moral equivalence… Avoidance of Ecological fallacy/ Simpson’s paradox.

The gold standard, of course, is to work out if A causes B by doing an experiment where no input but A changes, then observing B, which is what a controlled trial is. In practice this is unattainable because it would usually require cloning the entire state of the universe and running multiple copies in parallel. Statistically it can be nearly as good to do the experiment where we change A and all other influences apart from A are at least uncorrelated with A, which is more usually what we do — a randomised controlled trial. In many circumstances though, (budget restrictions, ethical constraints, bad experimental design…) we cannot do these ideal experiments, and a mathematical crutch is needed to get us the next-best outcome, which is to control for the things that we must (and not control for the things we must not).

In classic-flavoured causal inference, we use graphical models with the additional assumption that $A \to B$ may be read as “A causes a change in B”. This is what happens if you use a Structural Equation Model (also known as hierarchical models) to impose a causal structure on the observations. The result is a particular type of graph, a Directed Acyclic Graph (DAG) which, informally put, summarises what can possibly affect what in the model. Slightly more formally, it summarises what cannot (conditionally) affect what. C&C conditional treatment effect estimation by potential outcomes.

With this tool in hand, I can answer the question of when I can use my crappy observational data, collected without a good experimental design for whatever reason, to do interventional inference? There is a lot of research in this area; I should summarise the salient bits for myself.

1 do-calculus

2 Stochastic interventions

Stochastic interventions (Correa and Bareinboim 2020) also known as soft interventions (Massidda et al. 2023).

TBC

3 Mechanised causality

What if some of the nodes in the graph are active, meaning they will change their policies to change the outcome? This can still be handled within the graphical model framework; see Causality and agency.

4 What can go wrong?

What can I actually identify? For a start, if we are resorting to this more difficult methodology, that already suggests that we might be trying to use data which was collected with no regard to our actual statistical needs, and thus we might really stretch to imagine that we can actually find appropriate instruments in the data. Here is an essay on that theme.

OMFG Exogenous Variation! Or, Can You Find Good Nails When You Find an Indonesian Politics Hammer quotes Angus Deaton

we have at least some control over the light but choose to let it fall where it may and then proclaim that whatever it illuminates is what we were looking for all along

5 Teaching

Yanir Seroussi’s Causal inference resources recommends

Causal Diagrams: Draw Your Assumptions Before Your Conclusions. A high-level introduction to causal diagrams by Miguel Hernán. Highly recommended for those who want to get a conceptual overview of how causal diagrams work and why they’re useful.

A/B Testing by Google: Online Experiment Design and Analysis. Experimentation is key to causal inference, with the online world offering an accessible ground for running experiments. This short course is worth doing if you’re involved in online experiments in any way.

Miguel Hernán and Jamie Robins’ causal inference book (Miguel A. Hernán and Robins 2020) is available in free draft form online. See Yanir Seroussi’s review.

Jonas Peters’ notes from his teaching in 2015 (I think I took this course).

Samantha Kleinberg wrote two classes, introductory and advanced. The latter is notable for handling time-dependent causality.

Tutorial: David Sontag and Uri Shalit, Causal inference from observational studies. Mastering Metrics: The Path from Cause to Effect A resource list for causality in statistics, data science and physics.

Brady Neal’s Introduction to Causal Inference includes his draft textbook.

Felix Elwert’s summary is good (Elwert 2013).

Chapter 3 of (some edition of) Pearl’s book is available as an author’s preprint: Part 1, 2, 3, 4, 5, 6.

Stanford encyclopaedia of philosophy entry.

Various classic introductions (Pearl 2012, 1998; Elwert 2013; Morgan and Winship 2015; Rohrer 2018). Notably not recommended as a pedagogic experience (Koller and Friedman 2009) (although as a reference text it is great and will make you smarter).

The dagitty intro is an interactive guide via visualizations. Likewise, the ggdag bias structure vignette shows off the useful explanation diagrams available in ggdag and is also a good introduction to selection bias and causal DAGs themselves.

Amit Sharma’s tutorial at KDD. See also Emily Riederer’s Causal design patterns for data analysts Spurious correlation induced by sampling bias.

Still confused? Overwhelmed? I am. How about a diagram?

Figure 6: Brady Neal recommends Which causal inference book you should read.

6 Instrumental variables

See instrumental variables.

7 Incoming

Causal Reinforcement Learning
Stephen Malina — Deriving the front-door criterion with the do-calculus
causalscience.org aims to bring academia and industry together to advance causal inference in practice.
Difference-in-differences, Average Treatment Effects and the Importance of Mechanisms: Part 2
Scott Cunningham, Causal Inference The Mixtape (Cunningham 2021)
Nick Huntington-Klein, The Effect: An Introduction to Research Design and Causality (Huntington-Klein 2022)

8 References

Aalen, Røysland, Gran, et al. 2016. “Can We Believe the DAGs? A Comment on the Relationship Between Causal DAGs and Mechanisms.” Statistical Methods in Medical Research.

Achab, Bacry, Gaïffas, et al. 2017. “Uncovering Causality from Multivariate Hawkes Integrated Cumulants.” In PMLR.

Allen, Barrett, Horsman, et al. 2017. “Quantum Common Causes and Quantum Causal Models.” Physical Review X.

Aragam, Gu, and Zhou. 2017. “Learning Large-Scale Bayesian Networks with the Sparsebn Package.” arXiv:1703.04025 [Cs, Stat].

Aral, Muchnik, and Sundararajan. 2009. “Distinguishing Influence-Based Contagion from Homophily-Driven Diffusion in Dynamic Networks.” Proceedings of the National Academy of Sciences.

Arjovsky, Bottou, Gulrajani, et al. 2020. “Invariant Risk Minimization.”

Arnold, Castillo, and Sarabia. 1999. Conditional Specification of Statistical Models.

Athey, and Wager. 2019. “Estimating Treatment Effects with Causal Forests: An Application.” arXiv:1902.07409 [Stat].

Bahadori, Chalupka, Choi, et al. 2017. “Neural Causal Regularization Under the Independence of Mechanisms Assumption.” arXiv:1702.02604 [Cs, Stat].

Bareinboim, and Pearl. 2016. “Causal Inference and the Data-Fusion Problem.” Proceedings of the National Academy of Sciences.

Bareinboim, Tian, and Pearl. 2014. “Recovering from Selection Bias in Causal and Statistical Inference.” In AAAI.

Barnum, Barrett, Clark, et al. 2010. “Entropy and Information Causality in General Probabilistic Theories.” New Journal of Physics.

Besserve, Mehrjou, Sun, et al. 2019. “Counterfactuals Uncover the Modular Structure of Deep Generative Models.” In arXiv:1812.03253 [Cs, Stat].

Blom, Bongers, and Mooij. 2020. “Beyond Structural Causal Models: Causal Constraints Models.” In Uncertainty in Artificial Intelligence.

Blom, and Mooij. 2020. “Robust Model Predictions via Causal Ordering.” In.

Bloniarz, Liu, Zhang, et al. 2015. “Lasso Adjustments of Treatment Effect Estimates in Randomized Experiments.” arXiv:1507.03652 [Math, Stat].

Bonchi, Gullo, Mishra, et al. 2018. “Probabilistic Causal Analysis of Social Influence.” In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. CIKM ’18.

Bongers, Forré, Peters, et al. 2021. “Foundations of Structural Causal Models with Cycles and Latent Variables.” The Annals of Statistics.

Bongers, and Mooij. 2018. “From Random Differential Equations to Structural Causal Models: The Stochastic Case.” arXiv:1803.08784 [Cs, Stat].

Bongers, Peters, Schölkopf, et al. 2016. “Structural Causal Models: Cycles, Marginalizations, Exogenous Reparametrizations and Reductions.” arXiv:1611.06221 [Cs, Stat].

Bottou, Peters, Quiñonero-Candela, et al. 2013. “Counterfactual Reasoning and Learning Systems.” arXiv:1209.2355 [Cs, Math, Stat].

Braunstein, and Ingrosso. 2016. “Inference of Causality in Epidemics on Temporal Contact Networks.” Scientific Reports.

Bright, Malinsky, and Thompson. 2016. “Causally Interpreting Intersectionality Theory.” Philosophy of Science.

Brito, and Pearl. 2002. “A New Identification Condition for Recursive Models With Correlated Errors.” Structural Equation Modeling: A Multidisciplinary Journal.

———. 2012. “Generalized Instrumental Variables.” arXiv:1301.0560 [Cs].

Brodersen, Gallusser, Koehler, et al. 2015. “Inferring Causal Impact Using Bayesian Structural Time-Series Models.” The Annals of Applied Statistics.

Bühlmann. 2013. “Causal Statistical Inference in High Dimensions.” Mathematical Methods of Operations Research.

———. 2020. “Invariance, Causality and Robustness.” Statistical Science.

Bühlmann, Kalisch, and Meier. 2014. “High-Dimensional Statistics with a View Toward Applications in Biology.” Annual Review of Statistics and Its Application.

Bühlmann, Peters, Ernest, et al. 2014. “Predicting Causal Effects in High-Dimensional Settings.”

Bühlmann, Rütimann, and Kalisch. 2013. “Controlling False Positive Selections in High-Dimensional Regression and Causal Inference.” Statistical Methods in Medical Research.

Chalak, and White. 2012. “Causality, Conditional Independence, and Graphical Separation in Settable Systems.” Neural Computation.

Chau, Ton, González, et al. 2021. “BayesIMP: Uncertainty Quantification for Causal Data Fusion.”

Chaves, Lemos, and Pienaar. 2018. “Causal Modeling the Delayed-Choice Experiment.” Physical Review Letters.

Chen, and Pearl. 2012. “Regression and Causation: A Critical Examination of Econometric Textbooks.”

Christiansen, Pfister, Jakobsen, et al. 2020. “A Causal Framework for Distribution Generalization.”

Claassen, Mooij, and Heskes. 2014. “Proof Supplement - Learning Sparse Causal Models Is Not NP-Hard (UAI2013).” arXiv:1411.1557 [Stat].

Colombo, Maathuis, Kalisch, et al. 2012. “Learning High-Dimensional Directed Acyclic Graphs with Latent and Selection Variables.” The Annals of Statistics.

Cornish, Taufiq, Doucet, et al. 2023. “Causal Falsification of Digital Twins.”

Correa, and Bareinboim. 2020. “A Calculus for Stochastic Interventions:Causal Effect Identification and Surrogate Experiments.” Proceedings of the AAAI Conference on Artificial Intelligence.

Cunningham. 2021. Causal Inference: The Mixtape.

Dash. 2003. “Caveats For Causal Reasoning With Equilibrium Models.”

Dawid. 2021. “Decision-Theoretic Foundations for Statistical Causality.” Journal of Causal Inference.

De Luna, Waernbaum, and Richardson. 2011. “Covariate Selection for the Nonparametric Estimation of an Average Treatment Effect.” Biometrika.

Dhir, Ashman, Requeima, et al. 2024. “A Meta-Learning Approach to Bayesian Causal Discovery.” In.

Didelez. 2015. “Causal Reasoning for Events in Continuous Time: A Decision–Theoretic Approach.” In.

Duong, Gupta, and Nguyen. 2024. “Causal Discovery via Bayesian Optimization.” In.

Duvenaud, Eaton, Murphy, et al. 2010. “Causal Learning Without DAGs.” In NIPS Causality: Objectives and Assessment.

Eichler. 2001. “Granger-Causality Graphs for Multivariate Time Series.” Granger-Causality Graphs for Multivariate Time Series.

Elwert. 2013. “Graphical Causal Models.” In Handbook of Causal Analysis for Social Research. Handbooks of Sociology and Social Research.

Entner, Hoyer, and Spirtes. 2013. “Data-Driven Covariate Selection for Nonparametric Estimation of Causal Effects.” In Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics.

Ernest, and Bühlmann. 2014. “Marginal Integration for Fully Robust Causal Inference.” arXiv:1405.1868 [Stat].

Fernández-Loría, and Provost. 2021. “Causal Decision Making and Causal Effect Estimation Are Not the Same… and Why It Matters.” arXiv:2104.04103 [Cs, Stat].

Fixx. 1977. Games for the superintelligent.

Foreman-Mackey, Montet, Hogg, et al. 2015. “A Systematic Search for Transiting Planets in the K2 Data.” The Astrophysical Journal.

Fu, and Zhou. 2013. “Learning Sparse Causal Gaussian Networks With Experimental Intervention: Regularization and Coordinate Descent.” Journal of the American Statistical Association.

Gebharter, and Retzlaff. 2020. “A New Proposal How to Handle Counterexamples to Markov Causation à La Cartwright, or: Fixing the Chemical Factory.” Synthese.

Geiger, Ibeling, Zur, et al. 2024. “Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability.”

Gelman. 2010. “Causality and Statistical Learning.” American Journal of Sociology.

Gelman, and Meng. 2004. Applied Bayesian Modeling and Causal Inference From Incomplete-Data Perspectives.

Gendron, Witbrock, and Dobbie. 2023. “A Survey of Methods, Challenges and Perspectives in Causality.”

Genewein, McGrath, Déletang, et al. 2020. “Algorithms for Causal Reasoning in Probability Trees.”

Geng, Liu, Liu, et al. 2019. “Evaluation of Causal Effects and Local Structure Learning of Causal Networks.” Annual Review of Statistics and Its Application.

Glymour. 1998. “What Went Wrong? Reflections on Science by Observation and the Bell Curve.” Philosophy of Science.

Gordon, Moakler, and Zettelmeyer. 2023. “Predictive Incrementality by Experimentation (PIE) for Ad Measurement.”

Gu, Fu, and Zhou. 2014. “Adaptive Penalized Estimation of Directed Acyclic Graphs From Categorical Data.” arXiv:1403.2310 [Stat].

Guo, Tóth, Schölkopf, et al. 2022. “Causal de Finetti: On the Identification of Invariant Causal Structure in Exchangeable Data.”

Hansen, and Sokol. 2014. “Causal Interpretation of Stochastic Differential Equations.” Electronic Journal of Probability.

Hernán, Miguel A. 2016. “Does Water Kill? A Call for Less Casual Causal Inferences.” Annals of Epidemiology.

———. 2018. “The C-Word: Scientific Euphemisms Do Not Improve Causal Inference From Observational Data.” American Journal of Public Health.

Hernán, Miguel, and Robins. 2019a. Causal Inference Vol 3.

———. 2019b. Causal Inference Vol 2.

———. 2019c. Causal Inference Vol 1.

Hernán, Miguel A, and Robins. 2020. Causal Inference: What If.

Hinton, Osindero, and Bao. 2005. “Learning Causally Linked Markov Random Fields.” In Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics.

Hoyer, Janzing, Mooij, et al. 2009. “Nonlinear Causal Discovery with Additive Noise Models.” In Advances in Neural Information Processing Systems 21.

Huang, and Kleinberg. 2015. “Fast and Accurate Causal Inference from Time Series Data.” In.

Hult, and Zachariah. 2020. “Inference of Causal Effects When Adjustment Sets Are Unknown.” arXiv:2012.08154 [Cs, Stat].

Huntington-Klein. 2022. The Effect: An Introduction to Research Design and Causality.

Hyttinen, Eberhardt, and Järvisalo. n.d. “Do-Calculus When the True Graph Is Unknown.”

Imbens, and Menzel. 2021. “A Causal Bootstrap.” The Annals of Statistics.

Janzing, Mooij, Zhang, et al. 2012. “Information-Geometric Approach to Inferring Causal Directions.” Artificial Intelligence.

Janzing, and Schölkopf. 2010. “Causal Inference Using the Algorithmic Markov Condition.” IEEE Transactions on Information Theory.

Johansson, Fredrik, Shalit, and Sontag. 2016. “Learning Representations for Counterfactual Inference.” In International Conference on Machine Learning.

Johansson, Fredrik D., Shalit, and Sontag. 2018. “Learning Representations for Counterfactual Inference.” arXiv:1605.03661 [Cs, Stat].

Jordan, Michael Irwin. 1999. Learning in Graphical Models.

Jordan, Michael I., Wang, and Zhou. 2022. “Empirical Gateaux Derivatives for Causal Inference.”

Jordan, Michael I., and Weiss. 2002a. “Graphical Models: Probabilistic Inference.” The Handbook of Brain Theory and Neural Networks.

———. 2002b. “Probabilistic Inference in Graphical Models.” Handbook of Neural Networks and Brain Theory.

Jørgensen, Gresele, and Weichwald. 2025. “What Is Causal about Causal Models and Representations?”

Kalainathan, Goudet, and Dutta. 2020. “Causal Discovery Toolbox: Uncovering Causal Relationships in Python.” Journal of Machine Learning Research.

Kalisch, and Bühlmann. 2007. “Estimating High-Dimensional Directed Acyclic Graphs with the PC-Algorithm.” Journal of Machine Learning Research.

Kallus. 2020. “Generalized Optimal Matching Methods for Causal Inference.” Journal of Machine Learning Research.

Kennedy. 2015. “Semiparametric Theory and Empirical Processes in Causal Inference.” arXiv Preprint arXiv:1510.04740.

Kilbertus, Rojas Carulla, Parascandolo, et al. 2017. “Avoiding Discrimination Through Causal Reasoning.” In Advances in Neural Information Processing Systems 30.

Kim, and Pearl. 1983. “A Computational Model for Causal and Diagnostic Reasoning in Inference Systems.” In IJCAI.

Kleinberg. 2012. Causality, Probability, and Time.

———. 2015. Why: A Guide to Finding and Using Causes.

Kocaoglu, Snyder, Dimakis, et al. 2017. “CausalGAN: Learning Causal Implicit Generative Models with Adversarial Training.” arXiv:1709.02023 [Cs, Math, Stat].

Kohavi, Tang, and Xu. 2020. Trustworthy Online Controlled Experiments: A Practical Guide to A/B Testing.

Kohler, Kreuter, and Stuart. 2019. “Nonprobability Sampling and Causal Analysis.” Annual Review of Statistics and Its Application.

Koller, and Friedman. 2009. Probabilistic Graphical Models : Principles and Techniques.

Künzel, Sekhon, Bickel, et al. 2019. “Metalearners for Estimating Heterogeneous Treatment Effects Using Machine Learning.” Proceedings of the National Academy of Sciences.

Lauritzen, Steffen L. 1996. Graphical Models. Oxford Statistical Science Series.

———. 2000. “Causal Inference from Graphical Models.” In Complex Stochastic Systems.

Lauritzen, S. L., and Spiegelhalter. 1988. “Local Computations with Probabilities on Graphical Structures and Their Application to Expert Systems.” Journal of the Royal Statistical Society. Series B (Methodological).

Lee, and Bareinboim. 2021. “Causal Identification with Matrix Equations.” In.

———. n.d. “Causal Effect Identifiability Under Partial-Observability.”

Li, and Liu. 2024. “Efficient and Trustworthy Causal Discovery with Latent Variables and Complex Relations.” In.

Locatello, Bauer, Lucic, et al. 2019. “Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations.” In Proceedings of the 36th International Conference on Machine Learning.

Lopez-Paz, Nishihara, Chintala, et al. 2016. “Discovering Causal Signals in Images.” arXiv:1605.08179 [Cs, Stat].

Louizos, Shalit, Mooij, et al. 2017. “Causal Effect Inference with Deep Latent-Variable Models.” In Advances in Neural Information Processing Systems 30.

Maathuis, and Colombo. 2013. “A Generalized Backdoor Criterion.” arXiv Preprint arXiv:1307.5636.

Maathuis, Colombo, Kalisch, et al. 2010. “Predicting Causal Effects in Large-Scale Systems from Observational Data.” Nature Methods.

Maathuis, Kalisch, and Bühlmann. 2009. “Estimating High-Dimensional Intervention Effects from Observational Data.” The Annals of Statistics.

Malinsky, Shpitser, and Richardson. 2019. “A Potential Outcomes Calculus for Identifying Conditional Path-Specific Effects.” arXiv:1903.03662 [Stat].

Marbach, Prill, Schaffter, et al. 2010. “Revealing Strengths and Weaknesses of Methods for Gene Network Inference.” Proceedings of the National Academy of Sciences.

Marks, Rager, Michaud, et al. 2024. “Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models.” In.

Massidda, Geiger, Icard, et al. 2023. “Causal Abstraction with Soft Interventions.” In Proceedings of the Second Conference on Causal Learning and Reasoning.

Meinshausen. 2018. “Causality from a Distributional Robustness Point of View.” In 2018 IEEE Data Science Workshop (DSW).

Messerli. 2012. “Chocolate Consumption, Cognitive Function, and Nobel Laureates.” New England Journal of Medicine.

Mihalkova, and Mooney. 2007. “Bottom-up Learning of Markov Logic Network Structure.” In Proceedings of the 24th International Conference on Machine Learning.

Mogensen, Malinsky, and Hansen. 2018. “Causal Learning for Partially Observed Stochastic Dynamical Systems.” In UAI2018.

Montanari. 2011. “Lecture Notes for Stat 375 Inference in Graphical Models.”

Mooij, Peters, Janzing, et al. 2016. “Distinguishing Cause from Effect Using Observational Data: Methods and Benchmarks.” Journal of Machine Learning Research.

Morgan, and Winship. 2015. Counterfactuals and Causal Inference.

Msaouel. 2022. “The Big Data Paradox in Clinical Practice.” Cancer Investigation.

Murphy. 2012. Machine learning: a probabilistic perspective. Adaptive computation and machine learning series.

Murray, Swanson, and Hernán. 2019. “Guidelines for Estimating Causal Effects in Pragmatic Randomized Trials.” arXiv:1911.06030 [Stat].

Neal. 2020. “Introduction to Causal Inference from a Machine Learning Perspective.” Course Lecture Notes (Draft).

Neapolitan. 2003. Learning Bayesian Networks.

Ng, Fang, Zhu, et al. 2020. “Masked Gradient-Based Causal Structure Learning.” arXiv:1910.08527 [Cs, Stat].

Ng, Zhu, Chen, et al. 2019. “A Graph Autoencoder Approach to Causal Structure Learning.” In Advances In Neural Information Processing Systems.

Nilsson, Bonander, Strömberg, et al. 2021. “A Directed Acyclic Graph for Interactions.” International Journal of Epidemiology.

Noel, and Nyhan. 2011. “The ‘Unfriending’ Problem: The Consequences of Homophily in Friendship Retention for Causal Estimates of Social Influence.” Social Networks.

Ormaniec, Sussex, Lorch, et al. 2024. “Standardizing Structural Causal Models.” In.

Ortega, Kunesch, Delétang, et al. 2021. “Shaking the Foundations: Delusions in Sequence Models for Interaction and Control.” arXiv:2110.10819 [Cs].

Pawlowski, Paterek, Kaszlikowski, et al. 2009. “Information Causality as a Physical Principle.” Nature.

Pearl. 1982. “Reverend Bayes on Inference Engines: A Distributed Hierarchical Approach.” In Proceedings of the Second AAAI Conference on Artificial Intelligence. AAAI’82.

———. 1986. “Fusion, Propagation, and Structuring in Belief Networks.” Artificial Intelligence.

———. 1998. “Graphical Models for Probabilistic and Causal Reasoning.” In Quantified Representation of Uncertainty and Imprecision. Handbook of Defeasible Reasoning and Uncertainty Management Systems.

———. 2008. Probabilistic reasoning in intelligent systems: networks of plausible inference. The Morgan Kaufmann series in representation and reasoning.

———. 2009a. “Causal Inference in Statistics: An Overview.” Statistics Surveys.

———. 2009b. Causality: Models, Reasoning and Inference.

———. 2010. “The Foundations of Causal Inference.” Sociological Methodology.

———. 2011. “Simpson’s Paradox: An Anatomy.”

———. 2012. “The Do-Calculus Revisited Judea Pearl Keynote Lecture, August 17, 2012 UAI-2012 Conference, Catalina, CA.” Edited by Nando de Freitas and Kevin Murphy.

Pearl, and Bareinboim. 2014. “External Validity: From Do-Calculus to Transportability Across Populations.” Statistical Science.

Pearl, Glymour, and Jewell. 2016. Causal Inference in Statistics: A Primer.

Peters. 2015. “Causality Lecture Notes.”

Peters, Bühlmann, and Meinshausen. 2015. “Causal Inference Using Invariant Prediction: Identification and Confidence Intervals.” arXiv:1501.01332 [Stat].

Peters, Janzing, and Schölkopf. 2017. Elements of Causal Inference: Foundations and Learning Algorithms. Adaptive Computation and Machine Learning Series.

Peters, Mooij, Janzing, et al. 2014. “Causal Discovery with Continuous Additive Noise Models.” The Journal of Machine Learning Research.

Prashant, Ng, Zhang, et al. 2024. “Differentiable Causal Discovery for Latent Hierarchical Causal Models.” In.

Raginsky. 2011. “Directed Information and Pearl’s Causal Calculus.” In 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

Rakesh, Guo, Moraffah, et al. 2018. “Linked Causal Variational Autoencoder for Inferring Paired Spillover Effects.” In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. CIKM ’18.

Rehkopf, Glymour, and Osypuk. 2016. “The Consistency Assumption for Causal Inference in Social Epidemiology: When a Rose Is Not a Rose.” Current Epidemiology Reports.

Reizinger, Guo, Huszár, et al. 2024. “Identifiable Exchangeable Mechanisms for Causal Structure and Representation Learning.” In.

Richardson, Thomas S., and Robins. 2013. “Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality.”

Richardson, Thomas, and Spirtes. 2002. “Ancestral Graph Markov Models.” Annals of Statistics.

Robins. 1997. “Causal Inference from Complex Longitudinal Data.” In Latent Variable Modeling and Applications to Causality. Lecture Notes in Statistics.

Rohrer. 2018. “Thinking Clearly About Correlations and Causation: Graphical Causal Models for Observational Data:” Advances in Methods and Practices in Psychological Science.

Rothenhäusler, Meinshausen, Bühlmann, et al. 2020. “Anchor Regression: Heterogeneous Data Meets Causality.” arXiv:1801.06229 [Stat].

Rotnitzky, and Smucler. 2020. “Efficient Adjustment Sets for Population Average Causal Treatment Effect Estimation in Graphical Models.” Journal of Machine Learning Research.

Rubenstein, Bongers, Schölkopf, et al. 2018. “From Deterministic ODEs to Dynamic Structural Causal Models.” In Uncertainty in Artificial Intelligence.

Rubenstein, Weichwald, Bongers, et al. 2017. “Causal Consistency of Structural Equation Models.” In Uncertainty in Artificial Intelligence.

Rubin, and Waterman. 2006. “Estimating the Causal Effects of Marketing Interventions Using Propensity Score Methodology.” Statistical Science.

Sauer, and VanderWeele. 2013. Use of Directed Acyclic Graphs.

Schölkopf. 2022. “Causality for Machine Learning.” In Probabilistic and Causal Inference: The Works of Judea Pearl.

Schölkopf, Hogg, Wang, et al. 2015. “Removing Systematic Errors for Exoplanet Search via Latent Causes.” arXiv:1505.03036 [Astro-Ph, Stat].

Schölkopf, Janzing, Peters, et al. 2012. “On Causal and Anticausal Learning.” In ICML 2012.

Schölkopf, Locatello, Bauer, et al. 2021. “Toward Causal Representation Learning.” Proceedings of the IEEE.

Schölkopf, Muandet, Fukumizu, et al. 2015. “Computing Functions of Random Variables via Reproducing Kernel Hilbert Space Representations.” arXiv:1501.06794 [Cs, Stat].

Schulam, and Saria. 2017. “Reliable Decision Support Using Counterfactual Models.” In Proceedings of the 31st International Conference on Neural Information Processing Systems. NIPS’17.

Shalizi, and McFowland III. 2016. “Controlling for Latent Homophily in Social Networks Through Inferring Latent Locations.” arXiv:1607.06565 [Physics, Stat].

Shalizi, and Thomas. 2011. “Homophily and Contagion Are Generically Confounded in Observational Social Network Studies.” Sociological Methods & Research.

Sharma, and Kiciman. 2020. “DoWhy: An End-to-End Library for Causal Inference.”

Shipley. 2016. Cause and Correlation in Biology: A User’s Guide to Path Analysis, Structural Equations and Causal Inference with R.

Shpitser, and Pearl. 2008. “Complete Identification Methods for the Causal Hierarchy.” The Journal of Machine Learning Research.

Shpitser, and Tchetgen. 2014. “Causal Inference with a Graphical Hierarchy of Interventions.” arXiv:1411.2127 [Stat].

Shrier, and Platt. 2008. “Reducing Bias Through Directed Acyclic Graphs.” BMC Medical Research Methodology.

Smith, David A., and Eisner. 2008. “Dependency Parsing by Belief Propagation.” In Proceedings of the Conference on Empirical Methods in Natural Language Processing.

Smith, Bonnie, Ogburn, McGue, et al. 2020. “Causal Effects in Twin Studies: The Role of Interference.” arXiv:2007.04511 [Stat].

Spirtes, Glymour, and Scheines. 2001. Causation, Prediction, and Search. Adaptive Computation and Machine Learning.

Subbaswamy, Schulam, and Saria. 2019. “Preventing Failures Due to Dataset Shift: Learning Predictive Models That Transport.” In The 22nd International Conference on Artificial Intelligence and Statistics.

Suzuki, Shinozaki, and Yamamoto. 2020. “Causal Diagrams: Pitfalls and Tips.” Journal of Epidemiology.

Textor, Idelberger, and Liśkiewicz. 2015. “Learning from Pairwise Marginal Independencies.” arXiv:1508.00280 [Cs].

Textor, and Liśkiewicz. 2011. “Adjustment Criteria in Causal Diagrams: An Algorithmic Perspective.” In Proceedings of the Twenty-Seventh Conference on Uncertainty in Artificial Intelligence. UAI’11.

Tschantz, Sen, and Datta. 2019. “Differential Privacy as a Causal Property.” arXiv:1710.05899 [Cs].

Tu, Zhang, Ackermann, et al. 2018. “Causal Discovery in the Presence of Missing Data.” arXiv:1807.04010 [Cs, Stat].

van der Zander, and Liśkiewicz. 2016. “Separators and Adjustment Sets in Markov Equivalent DAGs.” In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence. AAAI’16.

van der Zander, Liśkiewicz, and Textor. 2014. “Constructing Separators and Adjustment Sets in Ancestral Graphs.” In Proceedings of the UAI 2014 Conference on Causal Inference: Learning and Prediction - Volume 1274. CI’14.

van der Zander, Textor, and Liskiewicz. 2015. “Efficiently Finding Conditional Instruments for Causal Inference.” In Proceedings of the 24th International Conference on Artificial Intelligence. IJCAI’15.

Vansteelandt, Bekaert, and Claeskens. 2012. “On Model Selection and Model Misspecification in Causal Inference.” Statistical Methods in Medical Research.

Veitch, and Zaveri. 2020. “Sense and Sensitivity Analysis: Simple Post-Hoc Analysis of Bias Due to Unobserved Confounding.”

Visweswaran, and Cooper. 2014. “Counting Markov Blanket Structures.” arXiv:1407.2483 [Cs, Stat].

Walker. 2014. “The Effect of Unmeasured Confounders on the Ability to Estimate a True Performance or Selection Gradient (and Other Partial Regression Coefficients).” Evolution.

Wang, Dun, Hogg, Foreman-Mackey, et al. 2017. “A Pixel-Level Model for Event Discovery in Time-Domain Imaging.” arXiv:1710.02428 [Astro-Ph].

Wang, Jiawei, Lu, Cao, et al. 2024. “Neural Causal Graph for Interpretable and Intervenable Classification.” In.

Wang, Yuhao, Solus, Yang, et al. 2017. “Permutation-Based Causal Inference Algorithms with Interventions.”

Weichwald, and Peters. 2020. “Causality in Cognitive Neuroscience: Concepts, Challenges, and Distributional Robustness.” arXiv:2002.06060 [q-Bio, Stat].

Westfall, and Yarkoni. 2016. “Statistically Controlling for Confounding Constructs Is Harder Than You Think.” PLOS ONE.

Wong. 2020. “Computational Causal Inference.” arXiv:2007.10979 [Cs, Stat].

Wright. 1934. “The Method of Path Coefficients.” The Annals of Mathematical Statistics.

Wu, Yulun, McConnell, and Iriondo. 2024. “Counterfactual Generative Modeling with Variational Causal Inference.” In.

Wu, Anpeng, Qiu, Chen, et al. 2024. “Causal Graph Transformer for Treatment Effect Estimation Under Unknown Interference.” In.

Yadav, Prunelli, Hoff, et al. 2016. “Causal Inference in Observational Data.” arXiv:1611.04660 [Cs, Stat].

Yang, Liu, Chen, et al. 2021. “CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models.” In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

Yao, Rancati, Cadei, et al. 2024. “Unifying Causal Representation Learning with the Invariance Principle.” In.

Yedidia, Freeman, and Weiss. 2003. “Understanding Belief Propagation and Its Generalizations.” In Exploring Artificial Intelligence in the New Millennium.

Zečević, Dhami, Veličković, et al. 2021. “Relating Graph Neural Networks to Structural Causal Models.”

Zhang, Peters, Janzing, et al. 2012. “Kernel-Based Conditional Independence Test and Application in Causal Discovery.” arXiv:1202.3775 [Cs, Stat].

Zheng, Aragam, Ravikumar, et al. 2018. “DAGs with NO TEARS: Continuous Optimization for Structure Learning.” In Advances in Neural Information Processing Systems 31.