Causal inference on DAGs

Confounding! This scientist performed miracle graph surgery during an intervention and you won’t believe what happened next


Inferring the optimal intervention requires accounting for which arrows are independent of which

Inferring cause and effect from nature. Graphical models and related techniques for doing it. Avoiding the danger of folk statistics. Observational studies, confounding, adjustment criteria, d-separation, identifiability, interventions, moral equivalence…

The most well-trodden path here is using directed graphical models with the additional assumption that \(A\rightarrow B\) may be read as “A causes a change in B”. C&C instrumental variables and propensity score matching. When you are talking Structural Equation models, this boils down to more or less some extra interpretation imposed on hierarchical models. Avoidance of Ecological fallacy/ Simpson’s paradox.

When can I use my crappy observational data, collected without a good experimental design for whatever reason, to do interventional inference? There is a lot of research in this. I should summarise the salient bits for myself. In fact I did; I led a reading group on this.

See also quantum causal graphical models, and the use of classical causal graphical models to eliminate hidden quantum causes.

Spurious correlation induced by sampling bias.

Gwern on Causality:

I speculate that in realistic causal networks or DAGs, the number of possible correlations grows faster than the number of possible causal relationships. So confounds really are that common, and since people do not think in DAGs, the imbalance also explains overconfidence.

Learning materials

Miguel Hernán and Jamie Robins’ new causal inference book, has a free draft online. See Yanir Seroussi’s review. Jonas Peters’ notes from his teaching in 2015 (I may have taken this course; can’t recall exactly).

Samantha Kleinberg has a book notable for its handling for time-dependent causality.

Tutorial: David Sontag and Uri Shalit, Causal inference from observational studies.

Lord’s paradox.

Felix Elwert’s summary. (Elwert 2013)

Chapter 3 of (some edition of) Pearl’s book is available as an author’s preprint: Part 1, 2, 3, 4, 5, 6.

Stanford encyclopaedia of philosophy entry.

Various classic introductions (Pearl 2012, 1998; Elwert 2013; Morgan and Winship 2015; Rohrer 2018). Notably not recommended on pedagogic grounds (Koller and Friedman 2009).

The dagitty intro is an interactive guide via visualizations. Likewise, the ggdag bias structure vignette shows of the useful explanation diagrams available in ggdag and is also a good introduction to selection bias and causal dags themselves.

Amit Sharma’s tutorial at KDD.

Counterfactuals

🏗

External validity

See external validity.

Propensity scores

Rubin and Waterman (2006) comes recommended by Shalizi as:

A good description of Rubin et al.’s methods for causal inference, adapted to the meanest understanding. […] Rubin and Waterman do a very good job of explaining, in a clear and concrete problem, just how and why the newer techniques of causal inference are valuable, with just enough technical detail that it doesn’t seem like magic.

Causal Graph inference from data

Uh oh. You don’t know what causes what? Or specifically, you can’t eliminate a whole bunch of potential causal arrows a priori? Much more work.

Here is a seminar I noticed on this theme, which is also a lightspeed introduction to some difficulties.

Guido Consonni, Objective Bayes Model Selection of Gaussian Essential Graphs with Observational and Interventional Data.

Graphical models based on Directed Acyclic Graphs (DAGs) represent a powerful tool for investigating dependencies among variables. It is well known that one cannot distinguish between DAGs encoding the same set of conditional independencies (Markov equivalent DAGs) using only observational data. However, the space of all DAGs can be partitioned into Markov equivalence classes, each being represented by a unique Essential Graph (EG), also called Completed Partially Directed Graph (CPDAG). In some fields, in particular genomics, one can have both observational and interventional data, the latter being produced after an exogenous perturbation of some variables in the system, or from randomized intervention experiments. Interventions destroy the original causal structure, and modify the Markov property of the underlying DAG, leading to a finer partition of DAGs into equivalence classes, each one being represented by an Interventional Essential Graph (I-EG) (Hauser and Buehlmann). In this talk we consider Bayesian model selection of EGs under the assumption that the variables are jointly Gaussian. In particular, we adopt an objective Bayes approach, based on the notion of fractional Bayes factor, and obtain a closed form expression for the marginal likelihood of an EG. Next we construct a Markov chain to explore the EG space under a sparsity constraint, and propose an MCMC algorithm to approximate the posterior distribution over the space of EGs. Our methodology, which we name Objective Bayes Essential graph Search (OBES), allows to evaluate the inferential uncertainty associated to any features of interest, for instance the posterior probability of edge inclusion. An extension of OBES to deal simultaneously with observational and interventional data is also presented: this involves suitable modifications of the likelihood and prior, as well as of the MCMC algorithm. We conclude by presenting results for simulated and real experiments (protein-signalling data).

This is joint work with Federico Castelletti, Stefano Peluso and Marco Della

Causal time series DAGS

As with other time series methods, has its own issues.

🏗 find out how Causal impact works. (Based on Brodersen et al. (2015).)

The CausalImpact R package implements an approach to estimating the causal effect of a designed intervention on a time series. For example, how many additional daily clicks were generated by an advertising campaign? Answering a question like this can be difficult when a randomized experiment is not available. The package aims to address this difficulty using a structural Bayesian time-series model to estimate how the response metric might have evolved after the intervention if the intervention had not occurred.

Drawing graphical models

See diagramming graphical models.

Tools

Many. See, e.g. CausalDiscoveryToolbox, ijmbarr/causalgraphicalmodels: Causal Graphical Models in Python dagR does R. Recent and backed by Microsfot, DoWhy is a python toolbox.

Aalen, OO, K Røysland, JM Gran, R Kouyos, and T Lange. 2016. “Can We Believe the DAGs? A Comment on the Relationship Between Causal DAGs and Mechanisms.” Statistical Methods in Medical Research 25 (5): 2294–314. https://doi.org/10.1177/0962280213520436.
Achab, Massil, Emmanuel Bacry, Stéphane Gaïffas, Iacopo Mastromatteo, and Jean-Francois Muzy. 2017. “Uncovering Causality from Multivariate Hawkes Integrated Cumulants.” In PMLR. http://arxiv.org/abs/1607.06333.
Allen, John-Mark A., Jonathan Barrett, Dominic C. Horsman, Ciarán M. Lee, and Robert W. Spekkens. 2017. “Quantum Common Causes and Quantum Causal Models.” Physical Review X 7 (3): 031021. https://doi.org/10.1103/PhysRevX.7.031021.
Aragam, Bryon, Jiaying Gu, and Qing Zhou. 2017. “Learning Large-Scale Bayesian Networks with the Sparsebn Package.” March 11, 2017. http://arxiv.org/abs/1703.04025.
Aral, Sinan, Lev Muchnik, and Arun Sundararajan. 2009. “Distinguishing Influence-Based Contagion from Homophily-Driven Diffusion in Dynamic Networks.” Proceedings of the National Academy of Sciences 106 (51): 21544–49. https://doi.org/10.1073/pnas.0908800106.
Arnold, Barry C., Enrique Castillo, and Jose M. Sarabia. 1999. Conditional Specification of Statistical Models. Springer Science & Business Media. https://books.google.com.au/books?hl=en&lr=&id=lKeKu_HtMdQC&oi=fnd&pg=PA1&dq=arnold+castillo+sarabia+conditional+specification+of+statistical+models&ots=gxWoVEdsde&sig=p0BJlEeB5yQ052m5YhfQ_A6Kmoo.
Bahadori, Mohammad Taha, Krzysztof Chalupka, Edward Choi, Robert Chen, Walter F. Stewart, and Jimeng Sun. 2017. “Neural Causal Regularization Under the Independence of Mechanisms Assumption.” February 8, 2017. http://arxiv.org/abs/1702.02604.
Bareinboim, Elias, and Judea Pearl. 2016. “Causal Inference and the Data-Fusion Problem.” Proceedings of the National Academy of Sciences 113 (27): 7345–52. https://doi.org/10.1073/pnas.1510507113.
Bareinboim, Elias, Jin Tian, and Judea Pearl. 2014. “Recovering from Selection Bias in Causal and Statistical Inference.” In AAAI, 2410–16. http://ftp.cs.ucla.edu/pub/stat_ser/r425.pdf.
Besserve, Michel, Arash Mehrjou, Rémy Sun, and Bernhard Schölkopf. 2019. “Counterfactuals Uncover the Modular Structure of Deep Generative Models.” December 12, 2019. http://arxiv.org/abs/1812.03253.
Blom, Tineke, Stephan Bongers, and Joris M. Mooij. 2020. “Beyond Structural Causal Models: Causal Constraints Models.” In Uncertainty in Artificial Intelligence, 585–94. PMLR. http://proceedings.mlr.press/v115/blom20a.html.
Bloniarz, Adam, Hanzhong Liu, Cun-Hui Zhang, Jasjeet Sekhon, and Bin Yu. 2015. “Lasso Adjustments of Treatment Effect Estimates in Randomized Experiments.” July 13, 2015. http://arxiv.org/abs/1507.03652.
Bonchi, Francesco, Francesco Gullo, Bud Mishra, and Daniele Ramazzotti. 2018. “Probabilistic Causal Analysis of Social Influence.” In Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 1003–12. CIKM ’18. New York, NY, USA: ACM. https://doi.org/10.1145/3269206.3271756.
Bongers, Stephan, Patrick Forré, Jonas Peters, Bernhard Schölkopf, and Joris M. Mooij. 2020. “Foundations of Structural Causal Models with Cycles and Latent Variables.” May 5, 2020. http://arxiv.org/abs/1611.06221.
Bongers, Stephan, and Joris M. Mooij. 2018. “From Random Differential Equations to Structural Causal Models: The Stochastic Case.” March 27, 2018. http://arxiv.org/abs/1803.08784.
Bongers, Stephan, Jonas Peters, Bernhard Schölkopf, and Joris M. Mooij. 2016. “Structural Causal Models: Cycles, Marginalizations, Exogenous Reparametrizations and Reductions.” November 18, 2016. http://arxiv.org/abs/1611.06221.
Bottou, Léon, Jonas Peters, Joaquin Quiñonero-Candela, Denis X. Charles, D. Max Chickering, Elon Portugaly, Dipankar Ray, Patrice Simard, and Ed Snelson. 2013. “Counterfactual Reasoning and Learning Systems.” July 27, 2013. http://arxiv.org/abs/1209.2355.
Brodersen, Kay H., Fabian Gallusser, Jim Koehler, Nicolas Remy, and Steven L. Scott. 2015. “Inferring Causal Impact Using Bayesian Structural Time-Series Models.” The Annals of Applied Statistics 9 (1): 247–74. https://doi.org/10.1214/14-AOAS788.
Bühlmann, Peter. 2013. “Causal Statistical Inference in High Dimensions.” Mathematical Methods of Operations Research 77 (3): 357–70. https://doi.org/10.1007/s00186-012-0404-7.
Bühlmann, Peter, Markus Kalisch, and Lukas Meier. 2014. “High-Dimensional Statistics with a View Toward Applications in Biology.” Annual Review of Statistics and Its Application 1 (1): 255–78. https://doi.org/10.1146/annurev-statistics-022513-115545.
Bühlmann, Peter, Jonas Peters, Jan Ernest, and Marloes Maathuis. 2014. “Predicting Causal Effects in High-Dimensional Settings.” http://springmeeting2014.sfds.asso.fr/wp-content/uploads/2014/04/buhlmann.pdf.
Bühlmann, Peter, Philipp Rütimann, and Markus Kalisch. 2013. “Controlling False Positive Selections in High-Dimensional Regression and Causal Inference.” Statistical Methods in Medical Research 22 (5): 466–92. http://smm.sagepub.com/content/22/5/466.short.
Chalak, Karim, and Halbert White. 2012. “Causality, Conditional Independence, and Graphical Separation in Settable Systems.” Neural Computation 24 (7): 1611–68. http://ieeexplore.ieee.org/abstract/document/6797310/.
Chaves, Rafael, Gabriela Barreto Lemos, and Jacques Pienaar. 2018. “Causal Modeling the Delayed-Choice Experiment.” Physical Review Letters 120 (19): 190401. https://doi.org/10.1103/PhysRevLett.120.190401.
Chen, B, and J Pearl. 2012. “Regression and Causation: A Critical Examination of Econometric Textbooks.”
Claassen, Tom, Joris M. Mooij, and Tom Heskes. 2014. “Proof Supplement - Learning Sparse Causal Models Is Not NP-Hard (UAI2013).” November 6, 2014. http://arxiv.org/abs/1411.1557.
Colombo, Diego, Marloes H. Maathuis, Markus Kalisch, and Thomas S. Richardson. 2012. “Learning High-Dimensional Directed Acyclic Graphs with Latent and Selection Variables.” The Annals of Statistics 40 (1): 294–321. http://projecteuclid.org/euclid.aos/1333567191.
De Luna, Xavier, Ingeborg Waernbaum, and Thomas S. Richardson. 2011. “Covariate Selection for the Nonparametric Estimation of an Average Treatment Effect.” Biometrika, October, asr041. https://doi.org/10.1093/biomet/asr041.
Didelez, Vanessa. n.d. “Causal Reasoning for Events in Continuous Time: A DecisionTheoretic Approach.” Accessed July 18, 2015. http://www.homepages.ucl.ac.uk/~ucgtrbd/uai2015_causal/papers/didelez.pdf.
Duvenaud, David K., Daniel Eaton, Kevin P. Murphy, and Mark W. Schmidt. 2010. “Causal Learning Without DAGs.” In NIPS Causality: Objectives and Assessment, 177–90. http://jmlr.org/proceedings/papers/v6/duvenaud10a/duvenaud10a.pdf.
Eichler, Michael. 2001. “Granger-Causality Graphs for Multivariate Time Series.” Granger-Causality Graphs for Multivariate Time Series. http://archiv.ub.uni-heidelberg.de/volltextserver/20749/1/beitrag.64.pdf.
Elwert, Felix. 2013. “Graphical Causal Models.” In Handbook of Causal Analysis for Social Research, edited by Stephen L. Morgan, 245–73. Handbooks of Sociology and Social Research. Dordrecht: Springer Netherlands. https://doi.org/10.1007/978-94-007-6094-3_13.
Entner, Doris, Patrik Hoyer, and Peter Spirtes. 2013. “Data-Driven Covariate Selection for Nonparametric Estimation of Causal Effects.” In Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, 256–64. http://jmlr.org/proceedings/papers/v31/entner13a.html.
Ernest, Jan, and Peter Bühlmann. 2014. “Marginal Integration for Fully Robust Causal Inference.” May 8, 2014. http://arxiv.org/abs/1405.1868.
Fixx, James F. 1977. Games for the Superintelligent. London: Muller.
Foreman-Mackey, Daniel, Benjamin T. Montet, David W. Hogg, Timothy D. Morton, Dun Wang, and Bernhard Schölkopf. 2015. “A Systematic Search for Transiting Planets in the K2 Data.” The Astrophysical Journal 806 (2): 215. https://doi.org/10.1088/0004-637X/806/2/215.
Fu, Fei, and Qing Zhou. 2013. “Learning Sparse Causal Gaussian Networks With Experimental Intervention: Regularization and Coordinate Descent.” Journal of the American Statistical Association 108 (501): 288–300. https://doi.org/10.1080/01621459.2012.754359.
Gebharter, Alexander, and Nina Retzlaff. 2020. “A New Proposal How to Handle Counterexamples to Markov Causation à La Cartwright, or: Fixing the Chemical Factory.” Synthese 197 (4): 1467–86. https://doi.org/10.1007/s11229-018-02014-7.
Gelman, Andrew. 2010. “Causality and Statistical Learning.” American Journal of Sociology 117 (3): 955–66. https://doi.org/10.1086/662659.
Gelman, Andrew, and Xiao-Li Meng. 2004. Applied Bayesian Modeling and Causal Inference From Incomplete-Data Perspectives. John Wiley & Sons.
Genewein, Tim, Tom McGrath, Grégoire Déletang, Vladimir Mikulik, Miljan Martic, Shane Legg, and Pedro A. Ortega. 2020. “Algorithms for Causal Reasoning in Probability Trees.” October 23, 2020. http://arxiv.org/abs/2010.12237.
Geng, Zhi, Yue Liu, Chunchen Liu, and Wang Miao. 2019. “Evaluation of Causal Effects and Local Structure Learning of Causal Networks.” Annual Review of Statistics and Its Application 6 (1): 103–24. https://doi.org/10.1146/annurev-statistics-030718-105312.
Gu, Jiaying, Fei Fu, and Qing Zhou. 2014. “Adaptive Penalized Estimation of Directed Acyclic Graphs From Categorical Data.” March 10, 2014. http://arxiv.org/abs/1403.2310.
Hansen, Niels, and Alexander Sokol. 2014. “Causal Interpretation of Stochastic Differential Equations.” Electronic Journal of Probability 19. https://doi.org/10.1214/EJP.v19-2891.
Hernán, Miguel, and Jamie Robins. 2019a. Causal Inference Vol 3.
———. 2019b. Causal Inference Vol 2.
———. 2019c. Causal Inference Vol 1.
Hinton, Geoffrey E., Simon Osindero, and Kejie Bao. 2005. “Learning Causally Linked Markov Random Fields.” In Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics, 128–35. Citeseer. http://www.cs.toronto.edu/~osindero/PUBLICATIONS/HintonOsinderoBao05_CLMRF.pdf.
Hoyer, Patrik O., Dominik Janzing, Joris M Mooij, Jonas Peters, and Bernhard Schölkopf. 2009. “Nonlinear Causal Discovery with Additive Noise Models.” In Advances in Neural Information Processing Systems 21, edited by D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, 689–96. Curran Associates, Inc. http://papers.nips.cc/paper/3548-nonlinear-causal-discovery-with-additive-noise-models.pdf.
Huang, Yuxiao, and Samantha Kleinberg. 2015. “Fast and Accurate Causal Inference from Time Series Data.” In, 6. http://www.skleinberg.org/papers/huang_flairs15.pdf.
Janzing, Dominik, Joris Mooij, Kun Zhang, Jan Lemeire, Jakob Zscheischler, Povilas Daniušis, Bastian Steudel, and Bernhard Schölkopf. 2012. “Information-Geometric Approach to Inferring Causal Directions.” Artificial Intelligence 182-183 (May): 1–31. https://doi.org/10.1016/j.artint.2012.01.002.
Janzing, Dominik, and Bernhard Schölkopf. 2010. “Causal Inference Using the Algorithmic Markov Condition.” IEEE Transactions on Information Theory 56 (10): 5168–94. https://doi.org/10.1109/TIT.2010.2060095.
Johansson, Fredrik D., Uri Shalit, and David Sontag. 2018. “Learning Representations for Counterfactual Inference.” June 6, 2018. http://arxiv.org/abs/1605.03661.
Johansson, Fredrik, Uri Shalit, and David Sontag. 2016. “Learning Representations for Counterfactual Inference.” In International Conference on Machine Learning, 3020–29. PMLR. http://proceedings.mlr.press/v48/johansson16.html.
Jordan, Michael I., and Yair Weiss. 2002a. “Graphical Models: Probabilistic Inference.” The Handbook of Brain Theory and Neural Networks, 490–96. http://www.cs.iastate.edu/ honavar/jordan2.pdf.
———. 2002b. “Probabilistic Inference in Graphical Models.” Handbook of Neural Networks and Brain Theory. http://mlg.eng.cam.ac.uk/zoubin/course03/hbtnn2e-I.pdf.
Jordan, Michael Irwin. 1999. Learning in Graphical Models. Cambridge, Mass.: MIT Press.
Kalainathan, Diviyan, Olivier Goudet, and Ritik Dutta. 2020. “Causal Discovery Toolbox: Uncovering Causal Relationships in Python.” Journal of Machine Learning Research 21 (37): 1–5. http://jmlr.org/papers/v21/19-187.html.
Kalisch, Markus, and Peter Bühlmann. 2007. “Estimating High-Dimensional Directed Acyclic Graphs with the PC-Algorithm.” Journal of Machine Learning Research 8 (May): 613–36. http://jmlr.org/papers/v8/kalisch07a.html.
Kallus, Nathan. 2020. “Generalized Optimal Matching Methods for Causal Inference.” Journal of Machine Learning Research 21 (62): 1–54. http://jmlr.org/papers/v21/19-120.html.
Kennedy, Edward H. 2015. “Semiparametric Theory and Empirical Processes in Causal Inference.” 2015. http://arxiv.org/abs/1510.04740.
Kilbertus, Niki, Mateo Rojas Carulla, Giambattista Parascandolo, Moritz Hardt, Dominik Janzing, and Bernhard Schölkopf. 2017. “Avoiding Discrimination Through Causal Reasoning.” In Advances in Neural Information Processing Systems 30, edited by I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, 656–66. Curran Associates, Inc. http://papers.nips.cc/paper/6668-avoiding-discrimination-through-causal-reasoning.pdf.
Kim, Jin H., and Judea Pearl. 1983. “A Computational Model for Causal and Diagnostic Reasoning in Inference Systems.” In IJCAI, 83:190–93. Citeseer. http://ijcai.org/Past.
Kleinberg, Samantha. 2012. Causality, Probability, and Time. 1 edition. Cambridge: Cambridge University Press.
Kocaoglu, Murat, Christopher Snyder, Alexandros G. Dimakis, and Sriram Vishwanath. 2017. CausalGAN: Learning Causal Implicit Generative Models with Adversarial Training.” September 14, 2017. http://arxiv.org/abs/1709.02023.
Kohler, Ulrich, Frauke Kreuter, and Elizabeth A. Stuart. 2019. “Nonprobability Sampling and Causal Analysis.” Annual Review of Statistics and Its Application 6 (1): 149–72. https://doi.org/10.1146/annurev-statistics-030718-104951.
Koller, Daphne, and Nir Friedman. 2009. Probabilistic Graphical Models : Principles and Techniques. Cambridge, MA: MIT Press.
Lauritzen, S. L., and D. J. Spiegelhalter. 1988. “Local Computations with Probabilities on Graphical Structures and Their Application to Expert Systems.” Journal of the Royal Statistical Society. Series B (Methodological) 50 (2): 157–224. http://intersci.ss.uci.edu/wiki/pdf/Lauritzen1988.pdf.
Lauritzen, Steffen L. 1996. Graphical Models. Clarendon Press.
———. 2000. “Causal Inference from Graphical Models.” In Complex Stochastic Systems, 63–107. CRC Press. https://books.google.ch/books?hl=en&lr=&id=gCENL6qflA8C&oi=fnd&pg=PA63&ots=vgUI_QIs0y&sig=4WEKa7ToKKqHC1fsSt5prFZSL4Q.
Lee, Sanghack, and Elias Bareinboim. n.d. “Causal Effect Identifiability Under Partial-Observability,” 10.
Locatello, Francesco, Stefan Bauer, Mario Lucic, Gunnar Rätsch, Sylvain Gelly, Bernhard Schölkopf, and Olivier Bachem. 2019. “Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations.” June 18, 2019. http://arxiv.org/abs/1811.12359.
Lopez-Paz, David, Robert Nishihara, Soumith Chintala, Bernhard Schölkopf, and Léon Bottou. 2016. “Discovering Causal Signals in Images.” May 26, 2016. http://arxiv.org/abs/1605.08179.
Louizos, Christos, Uri Shalit, Joris M Mooij, David Sontag, Richard Zemel, and Max Welling. 2017. “Causal Effect Inference with Deep Latent-Variable Models.” In Advances in Neural Information Processing Systems 30, edited by I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, 6446–56. Curran Associates, Inc. http://papers.nips.cc/paper/7223-causal-effect-inference-with-deep-latent-variable-models.pdf.
Maathuis, Marloes H., and Diego Colombo. 2013. “A Generalized Backdoor Criterion.” 2013. http://arxiv.org/abs/1307.5636.
Maathuis, Marloes H., Diego Colombo, Markus Kalisch, and Peter Bühlmann. 2010. “Predicting Causal Effects in Large-Scale Systems from Observational Data.” Nature Methods 7 (4): 247–48. https://doi.org/10.1038/nmeth0410-247.
Maathuis, Marloes H., Markus Kalisch, and Peter Bühlmann. 2009. “Estimating High-Dimensional Intervention Effects from Observational Data.” The Annals of Statistics 37 (December): 3133–64. https://doi.org/10.1214/09-AOS685.
Malinsky, Daniel, Ilya Shpitser, and Thomas Richardson. 2019. “A Potential Outcomes Calculus for Identifying Conditional Path-Specific Effects.” March 8, 2019. http://arxiv.org/abs/1903.03662.
Marbach, Daniel, Robert J. Prill, Thomas Schaffter, Claudio Mattiussi, Dario Floreano, and Gustavo Stolovitzky. 2010. “Revealing Strengths and Weaknesses of Methods for Gene Network Inference.” Proceedings of the National Academy of Sciences 107 (14): 6286–91. https://doi.org/10.1073/pnas.0913357107.
Meinshausen, Nicolai. 2018. “Causality from a Distributional Robustness Point of View.” In 2018 IEEE Data Science Workshop (DSW), 6–10. https://doi.org/10.1109/DSW.2018.8439889.
Messerli, Franz H. 2012. “Chocolate Consumption, Cognitive Function, and Nobel Laureates.” New England Journal of Medicine 367 (16): 1562–64. https://doi.org/10.1056/NEJMon1211064.
Mihalkova, Lilyana, and Raymond J. Mooney. 2007. “Bottom-up Learning of Markov Logic Network Structure.” In Proceedings of the 24th International Conference on Machine Learning, 625–32. ACM. http://dl.acm.org/citation.cfm?id=1273575.
Mogensen, Søren Wengel, Daniel Malinsky, and Niels Richard Hansen. 2018. “Causal Learning for Partially Observed Stochastic Dynamical Systems.” In Uai2018, 17. http://auai.org/uai2018/proceedings/papers/142.pdf.
Montanari, Andrea. 2011. “Lecture Notes for Stat 375 Inference in Graphical Models.” http://www.stanford.edu/~montanar/TEACHING/Stat375/handouts/notes_stat375_1.pdf.
Mooij, Joris M., Jonas Peters, Dominik Janzing, Jakob Zscheischler, and Bernhard Schölkopf. 2016. “Distinguishing Cause from Effect Using Observational Data: Methods and Benchmarks.” Journal of Machine Learning Research 17 (32): 1–102. http://jmlr.org/papers/v17/14-518.html.
Morgan, Stephen L., and Christopher Winship. 2015. Counterfactuals and Causal Inference. Cambridge University Press.
Murphy, Kevin P. 2012. Machine Learning: A Probabilistic Perspective. 1 edition. Adaptive Computation and Machine Learning Series. Cambridge, MA: MIT Press.
Neapolitan, Richard E. 2003. Learning Bayesian Networks. Vol. 38. Prentice Hal, Paperback. https://books.secure-services.me/Gentoomen.
Newsom, author., Jason. 2009. “Estimation of the Causal Effects of Time-Varying Exposures.” In Longitudinal Data Analysis, 1st edition, 553:599. Routledge,.
Ng, Ignavier, Zhuangyan Fang, Shengyu Zhu, Zhitang Chen, and Jun Wang. 2020. “Masked Gradient-Based Causal Structure Learning.” February 17, 2020. http://arxiv.org/abs/1910.08527.
Ng, Ignavier, Shengyu Zhu, Zhitang Chen, and Zhuangyan Fang. 2019. “A Graph Autoencoder Approach to Causal Structure Learning.” In Advances In Neural Information Processing Systems. http://arxiv.org/abs/1911.07420.
Noel, Hans, and Brendan Nyhan. 2011. “The ‘Unfriending’ Problem: The Consequences of Homophily in Friendship Retention for Causal Estimates of Social Influence.” Social Networks 33 (3): 211–18. https://doi.org/10.1016/j.socnet.2011.05.003.
Pearl, Judea. 1982. “Reverend Bayes on Inference Engines: A Distributed Hierarchical Approach.” In In Proceedings of the National Conference on Artificial Intelligence, 133–36. http://www.aaai.org/Papers/AAAI/1982/AAAI82-032.pdf.
———. 1986. “Fusion, Propagation, and Structuring in Belief Networks.” Artificial Intelligence 29 (3): 241–88. https://doi.org/10.1016/0004-3702(86)90072-X.
———. 1998. “Graphical Models for Probabilistic and Causal Reasoning.” In Quantified Representation of Uncertainty and Imprecision, edited by Philippe Smets, 367–89. Handbook of Defeasible Reasoning and Uncertainty Management Systems. Dordrecht: Springer Netherlands. https://doi.org/10.1007/978-94-017-1735-9_12.
———. 2008. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Rev. 2. print., 12. [Dr.]. The Morgan Kaufmann Series in Representation and Reasoning. San Francisco, Calif: Kaufmann.
———. 2009a. “Causal Inference in Statistics: An Overview.” Statistics Surveys 3: 96–146. https://doi.org/10.1214/09-SS057.
———. 2009b. Causality: Models, Reasoning and Inference. Cambridge University Press.
———. 2010. “3. The Foundations of Causal Inference.” Sociological Methodology 40 (1): 75–149. https://doi.org/10.1111/j.1467-9531.2010.01228.x.
———. 2012. “The Do-Calculus Revisited Judea Pearl Keynote Lecture, August 17, 2012 UAI-2012 Conference, Catalina, CA.” Edited by Nando de Freitas and Kevin Murphy, 8.
Pearl, Judea, and Elias Bareinboim. 2014. “External Validity: From Do-Calculus to Transportability Across Populations.” Statistical Science 29 (4): 579–95. https://doi.org/10.1214/14-STS486.
Pearl, Judea, Madelyn Glymour, and Nicholas P. Jewell. 2016. Causal Inference in Statistics: A Primer. Wiley.
Peters, Jonas. 2015. “Causality Lecture Notes.” http://web.math.ku.dk/~peters/jonas_files/scriptChapter1-4.pdf.
Peters, Jonas, Peter Bühlmann, and Nicolai Meinshausen. 2015. “Causal Inference Using Invariant Prediction: Identification and Confidence Intervals.” January 6, 2015. http://arxiv.org/abs/1501.01332.
Peters, Jonas, Dominik Janzing, and Bernhard Schölkopf. 2017. Elements of Causal Inference: Foundations and Learning Algorithms. Adaptive Computation and Machine Learning Series. Cambridge, Massachuestts: The MIT Press. https://www.dropbox.com/s/dl/gkmsow492w3oolt/11283.pdf.
Peters, Jonas, Joris M. Mooij, Dominik Janzing, and Bernhard Schölkopf. 2014. “Causal Discovery with Continuous Additive Noise Models.” The Journal of Machine Learning Research 15 (1): 2009–53.
Raginsky, M. 2011. “Directed Information and Pearl’s Causal Calculus.” In 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton), 958–65. https://doi.org/10.1109/Allerton.2011.6120270.
Rakesh, Vineeth, Ruocheng Guo, Raha Moraffah, Nitin Agarwal, and Huan Liu. 2018. “Linked Causal Variational Autoencoder for Inferring Paired Spillover Effects.” In Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 1679–82. CIKM ’18. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/3269206.3269267.
Rehkopf, David H., M. Maria Glymour, and Theresa L. Osypuk. 2016. “The Consistency Assumption for Causal Inference in Social Epidemiology: When a Rose Is Not a Rose.” Current Epidemiology Reports 3 (1): 63–71. https://doi.org/10.1007/s40471-016-0069-5.
Richardson, Thomas, and Peter Spirtes. 2002. “Ancestral Graph Markov Models.” Annals of Statistics 30 (4): 962–1030. https://doi.org/10.1214/aos/1031689015.
Robins, James M. 1997. “Causal Inference from Complex Longitudinal Data.” In Latent Variable Modeling and Applications to Causality, edited by Maia Berkane, 69–117. Lecture Notes in Statistics. New York, NY: Springer. https://doi.org/10.1007/978-1-4612-1842-5_4.
Rohrer, Julia M. 2018. “Thinking Clearly About Correlations and Causation: Graphical Causal Models for Observational Data:” Advances in Methods and Practices in Psychological Science, January. https://doi.org/10.1177/2515245917745629.
Rotnitzky, Andrea, and Ezequiel Smucler. 2020. “Efficient Adjustment Sets for Population Average Causal Treatment Effect Estimation in Graphical Models.” Journal of Machine Learning Research 21 (188): 1–86. http://jmlr.org/papers/v21/19-1026.html.
Rubenstein, Paul K., Stephan Bongers, Bernhard Schölkopf, and Joris M. Mooij. 2018. “From Deterministic ODEs to Dynamic Structural Causal Models.” July 9, 2018. http://arxiv.org/abs/1608.08028.
Rubenstein, Paul K., Sebastian Weichwald, Stephan Bongers, Joris M. Mooij, Dominik Janzing, Moritz Grosse-Wentrup, and Bernhard Schölkopf. 2017. “Causal Consistency of Structural Equation Models.” July 4, 2017. http://arxiv.org/abs/1707.00819.
Rubin, Donald B, and Richard P Waterman. 2006. “Estimating the Causal Effects of Marketing Interventions Using Propensity Score Methodology.” Statistical Science 21 (2): 206–22. https://doi.org/10.1214/088342306000000259.
Sauer, Brian, and Tyler J. VanderWeele. 2013. Use of Directed Acyclic Graphs. Agency for Healthcare Research and Quality (US). https://www.ncbi.nlm.nih.gov/books/NBK126189/.
Schölkopf, Bernhard. 2019. “Causality for Machine Learning.” December 23, 2019. http://arxiv.org/abs/1911.10500.
Schölkopf, Bernhard, Bernhard, Dominik Janzing, Jonas Peters, Eleni Sgouritsa, Kun Zhang, and Joris Mooij. 2012. “On Causal and Anticausal Learning.” In ICML 2012. http://arxiv.org/abs/1206.6471.
Schölkopf, Bernhard, David W. Hogg, Dun Wang, Daniel Foreman-Mackey, Dominik Janzing, Carl-Johann Simon-Gabriel, and Jonas Peters. 2015. “Removing Systematic Errors for Exoplanet Search via Latent Causes.” May 12, 2015. http://arxiv.org/abs/1505.03036.
Schölkopf, Bernhard, Krikamol Muandet, Kenji Fukumizu, and Jonas Peters. 2015. “Computing Functions of Random Variables via Reproducing Kernel Hilbert Space Representations.” January 27, 2015. http://arxiv.org/abs/1501.06794.
Schulam, Peter, and Suchi Saria. 2017. “Reliable Decision Support Using Counterfactual Models.” In Proceedings of the 31st International Conference on Neural Information Processing Systems, 1696–706. NIPS’17. Red Hook, NY, USA: Curran Associates Inc. http://papers.nips.cc/paper/6767-reliable-decision-support-using-counterfactual-models.pdf.
Shalizi, Cosma Rohilla, and Edward McFowland III. 2016. “Controlling for Latent Homophily in Social Networks Through Inferring Latent Locations.” July 22, 2016. http://arxiv.org/abs/1607.06565.
Shalizi, Cosma Rohilla, and Andrew C. Thomas. 2011. “Homophily and Contagion Are Generically Confounded in Observational Social Network Studies.” Sociological Methods & Research 40 (2): 211–39. https://doi.org/10.1177/0049124111404820.
Shpitser, Ilya, and Judea Pearl. 2008. “Complete Identification Methods for the Causal Hierarchy.” The Journal of Machine Learning Research 9: 1941–79.
Shpitser, Ilya, and Eric Tchetgen Tchetgen. 2014. “Causal Inference with a Graphical Hierarchy of Interventions.” November 8, 2014. http://arxiv.org/abs/1411.2127.
Shrier, Ian, and Robert W. Platt. 2008. “Reducing Bias Through Directed Acyclic Graphs.” BMC Medical Research Methodology 8 (1): 70. https://doi.org/10.1186/1471-2288-8-70.
Smith, Bonnie, Elizabeth L. Ogburn, Matt McGue, Saonli Basu, and Daniel O. Scharfstein. 2020. “Causal Effects in Twin Studies: The Role of Interference.” July 8, 2020. http://arxiv.org/abs/2007.04511.
Smith, David A., and Jason Eisner. 2008. “Dependency Parsing by Belief Propagation.” In Proceedings of the Conference on Empirical Methods in Natural Language Processing, 145–56. Association for Computational Linguistics. http://dl.acm.org/citation.cfm?id=1613737.
Spirtes, Peter, Clark Glymour, and Richard Scheines. 2001. Causation, Prediction, and Search. Second Edition. Adaptive Computation and Machine Learning. The MIT Press. https://www.cs.cmu.edu/afs/cs.cmu.edu/project/learn-43/lib/photoz/.g/scottd/fullbook.pdf.
Subbaswamy, Adarsh, Peter Schulam, and Suchi Saria. 2019. “Preventing Failures Due to Dataset Shift: Learning Predictive Models That Transport.” In The 22nd International Conference on Artificial Intelligence and Statistics, 3118–27. PMLR. http://proceedings.mlr.press/v89/subbaswamy19a.html.
Textor, Johannes, Alexander Idelberger, and Maciej Liśkiewicz. 2015. “Learning from Pairwise Marginal Independencies.” August 2, 2015. http://arxiv.org/abs/1508.00280.
Textor, Johannes, and Maciej Liśkiewicz. 2011. “Adjustment Criteria in Causal Diagrams: An Algorithmic Perspective.” In Proceedings of the Twenty-Seventh Conference on Uncertainty in Artificial Intelligence, 681–88. UAI’11. Arlington, Virginia, USA: AUAI Press. http://arxiv.org/abs/1202.3764.
Tschantz, Michael Carl, Shayak Sen, and Anupam Datta. 2019. “Differential Privacy as a Causal Property.” July 6, 2019. http://arxiv.org/abs/1710.05899.
Tu, Ruibo, Cheng Zhang, Paul Ackermann, Hedvig Kjellström, and Kun Zhang. 2018. “Causal Discovery in the Presence of Missing Data.” July 11, 2018. http://arxiv.org/abs/1807.04010.
Vansteelandt, Stijn, Maarten Bekaert, and Gerda Claeskens. 2012. “On Model Selection and Model Misspecification in Causal Inference.” Statistical Methods in Medical Research 21 (1): 7–30. https://doi.org/10.1177/0962280210387717.
Visweswaran, Shyam, and Gregory F. Cooper. 2014. “Counting Markov Blanket Structures.” July 9, 2014. http://arxiv.org/abs/1407.2483.
Walker, Jeffrey A. 2014. “The Effect of Unmeasured Confounders on the Ability to Estimate a True Performance or Selection Gradient (and Other Partial Regression Coefficients).” Evolution 68 (7): 2128–36. https://doi.org/10.1111/evo.12406.
Wang, Dun, David W. Hogg, Daniel Foreman-Mackey, and Bernhard Schölkopf. 2017. “A Pixel-Level Model for Event Discovery in Time-Domain Imaging.” October 9, 2017. http://arxiv.org/abs/1710.02428.
Weichwald, Sebastian, and Jonas Peters. 2020. “Causality in Cognitive Neuroscience: Concepts, Challenges, and Distributional Robustness.” July 3, 2020. http://arxiv.org/abs/2002.06060.
Wong, Jeffrey C. 2020. “Computational Causal Inference.” July 21, 2020. http://arxiv.org/abs/2007.10979.
Wright, Sewall. 1934. “The Method of Path Coefficients.” The Annals of Mathematical Statistics 5 (3): 161–215. https://doi.org/10.1214/aoms/1177732676.
Yadav, Pranjul, Lisiane Prunelli, Alexander Hoff, Michael Steinbach, Bonnie Westra, Vipin Kumar, and Gyorgy Simon. 2016. “Causal Inference in Observational Data.” November 14, 2016. http://arxiv.org/abs/1611.04660.
Yang, Mengyue, Furui Liu, Zhitang Chen, Xinwei Shen, Jianye Hao, and Jun Wang. 2020. CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models.” July 1, 2020. http://arxiv.org/abs/2004.08697.
Yedidia, J. S., W. T. Freeman, and Y. Weiss. 2003. “Understanding Belief Propagation and Its Generalizations.” In Exploring Artificial Intelligence in the New Millennium, edited by G. Lakemeyer and B. Nebel, 239–36. Morgan Kaufmann Publishers. http://www.merl.com/publications/TR2001-22.
Zander, Benito van der, and Maciej Liśkiewicz. 2016. “Separators and Adjustment Sets in Markov Equivalent DAGs.” In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 3315–21. AAAI’16. Phoenix, Arizona: AAAI Press. https://www.tcs.uni-luebeck.de/downloads/papers/2016/full-version.pdf.
Zander, Benito van der, Maciej Liśkiewicz, and Johannes Textor. 2014. “Constructing Separators and Adjustment Sets in Ancestral Graphs.” In Proceedings of the UAI 2014 Conference on Causal Inference: Learning and Prediction - Volume 1274, 11–24. CI’14. Aachen, DEU: CEUR-WS.org. https://staff.fnwi.uva.nl/j.m.mooij/articles/uai2014ci_proceedings.pdf#page=17.
Zander, Benito van der, Johannes Textor, and Maciej Liskiewicz. 2015. “Efficiently Finding Conditional Instruments for Causal Inference.” In Proceedings of the 24th International Conference on Artificial Intelligence, 3243–49. IJCAI’15. Buenos Aires, Argentina: AAAI Press. https://www.ijcai.org/Proceedings/15/Papers/457.pdf.
Zhang, Kun, Jonas Peters, Dominik Janzing, and Bernhard Schölkopf. 2012. “Kernel-Based Conditional Independence Test and Application in Causal Discovery.” February 14, 2012. http://arxiv.org/abs/1202.3775.