Multi agent causality
Game theory and decision theory for lots of interacting agents
2025-03-09 — 2026-01-13
Wherein causal DAGs are extended to include agents and decisions via a Mechanized Multi‑Agent Influence Diagram, and iterated games are employed to exemplify commitment races relevant to AI safety.
Notes on decision theory and causality in which agents make decisions, in the context of iterated games in multi-agent systems, with applications to AI safety.
Extending causal DAGs to include agents and decisions.
0.1 Multi-agent graphs
There’s a long line of work attempting this (Heckerman and Shachter 1994; Dawid 2002; Koller and Milch 2003). I’m working from Hammond et al. (2023) and MacDermott, Everitt, and Belardinelli (2023), which introduce the One Ring that unifies them all in the form of something called a Mechanized Multi-Agent Influence Diagram, a.k.a. a MMAID.
cf Liu et al. (2024).
There’s also a library for computing with various interesting causal influence diagrams, causalincentives/pycid (Fox et al. 2021).
Library for graphical models of decision making, based on pgmpy and networkx
1 Commitment races
See commitment for a discussion of the commitment problem in multi-agent systems.
