Multi agent causality

Game theory and decision theory for lots of interacting agents

2025-03-09 — 2026-01-13

Wherein Causal DAGs Are Extended to Include Agents and Decisions via a Mechanized Multi‑Agent Influence Diagram, and Iterated Games Are Employed to Exemplify Commitment Races Relevant to AI Safety.

adaptive

agents

AI safety

causality

cooperation

economics

evolution

extended self

game theory

graphical models

incentive mechanisms

learning

mind

networks

social graph

utility

wonk

Notes on decision theory and causality in which agents make decisions, in the context of iterated games in multi-agent systems, with applications to AI safety.

Extending causal DAGs to include agents and decisions.

One of the extensions to classical game theory — making explicit who knows what, who decides what, and how decisions affect outcomes.

0.1 Multi-agent graphs

There’s a long line of work attempting this (Heckerman and Shachter 1994; Dawid 2002; Koller and Milch 2003). I’m working from Hammond et al. (2023) and MacDermott, Everitt, and Belardinelli (2023), which introduce the One Ring that unifies them all in the form of something called a Mechanized Multi-Agent Influence Diagram, a.k.a. a MMAID.

cf Liu et al. (2024).

There’s also a library for computing with various interesting causal influence diagrams, causalincentives/pycid (Fox et al. 2021).

Library for graphical models of decision making, based on pgmpy and networkx

1 Commitment races

See commitment for a discussion of the commitment problem in multi-agent systems.

2 References

Benford. 2010. “What Does Newcomb’s Paradox Teach Us?”

Brunet, and Doolittle. 2015. “Multilevel Selection Theory and the Evolutionary Functions of Transposable Elements.” Genome Biology and Evolution.

Cai, Daskalakis, and Weinberg. 2013. “Understanding Incentives: Mechanism Design Becomes Algorithm Design.” arXiv:1305.4002 [Cs].

Dawid. 2002. “Influence Diagrams for Causal Modelling and Inference.” International Statistical Review.

Fernández-Loría, and Provost. 2021. “Causal Decision Making and Causal Effect Estimation Are Not the Same… and Why It Matters.”

Foerster, Farquhar, Afouras, et al. 2018. “Counterfactual Multi-Agent Policy Gradients.” Proceedings of the AAAI Conference on Artificial Intelligence.

Fox, Everitt, Carey, et al. 2021. “PyCID: A Python Library for Causal Influence Diagrams.” In.

Fox, MacDermott, Hammond, et al. 2023. “On Imperfect Recall in Multi-Agent Influence Diagrams.” Electronic Proceedings in Theoretical Computer Science.

Hammond, Chan, Clifton, et al. 2025. “Multi-Agent Risks from Advanced AI.”

Hammond, Fox, Everitt, et al. 2023. “Reasoning about Causality in Games.” Artificial Intelligence.

Harley. 1981. “Learning the Evolutionarily Stable Strategy.” Journal of Theoretical Biology.

Heckerman, and Shachter. 1994. “A Decision-Based View of Causality.” In Proceedings of the Tenth International Conference on Uncertainty in Artificial Intelligence. UAI’94.

Hetzer, and Sornette. 2013. “An Evolutionary Model of Cooperation, Fairness and Altruistic Punishment in Public Good Games.” PLoS ONE.

Hoel. 2017. “When the Map Is Better Than the Territory.” Entropy.

Kalai, and Lehrer. 1993. “Rational Learning Leads to Nash Equilibrium.” Econometrica.

Koller, and Milch. 2003. “Multi-Agent Influence Diagrams for Representing and Solving Games.” Games and Economic Behavior, First World Congress of the Game Theory Society,.

Liu, Wang, Li, et al. 2024. “Attaining Human Desirable Outcomes in Human-AI Interaction via Structural Causal Games.”

Loewith, and Street. 2025. “Mutual Prediction in Human–AI Coevolution.” Antikythera Digital Journal.

MacDermott, Everitt, and Belardinelli. 2023. “Characterising Decision Theories with Mechanised Causal Graphs.”

MacDermott, Fox, Belardinelli, et al. 2024. “Measuring Goal-Directedness.”

Meulemans, Kobayashi, Oswald, et al. 2024. “Multi-Agent Cooperation Through Learning-Aware Policy Gradients.” In.

Meulemans, Nasser, Wołczyk, et al. 2025. “Embedded Universal Predictive Intelligence: A Coherent Framework for Multi-Agent Learning.”

Mu, Guo, Chen, et al. 2024. “Multi-Agent, Human-Agent and Beyond: A Survey on Cooperation in Social Dilemmas.”

Nowak. 2006. “Five Rules for the Evolution of Cooperation.” Science.

Sanders, Galla, and Shapiro. 2011. “Effects of Noise on Convergent Game Learning Dynamics.” arXiv:1109.4853.

Walters, Kaufmann, Sefas, et al. 2025. “Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study.”

Wolpert, and Benford. 2013. “The Lesson of Newcomb’s Paradox.” Synthese.