Multi agent causality
Game theory and decision theory for lots of interacting agents
2025-03-09 — 2025-03-09
Wherein agents’ decisions are modeled via a mechanised multi-agent influence diagram (MMAID) extending causal DAGs to iterated games, and the problem of commitment races is examined for AI safety.
Notes on decision theory and causality where agents make decisions, in the context of iterated games in multi-agent systems, with applications to AI safety.
Extending causal DAGs to include agents and decisions.
0.1 Multi-agent graphs
There seems to be a long series of works attempting this (Heckerman and Shachter 1994; Dawid 2002; Koller and Milch 2003). I am working from Hammond et al. (2023) and MacDermott, Everitt, and Belardinelli (2023), which introduce the One Ring that unifies them all in the form of something called a Mechanised Multi-Agent Influence Diagram, a.k.a. a MMAID.
cf Liu et al. (2024).
1 Commitment races
See commitment for a discussion of the commitment problem in the context of multi-agent systems.