Probably actually reading/writing
2020-03-05 — 2026-06-29
Wherein an Author’s Active Research Across Bounded Agency, Collective Truth-Finding, and Cooperative AI Is Catalogued, With Notebooks on Ethics, Coalitional Fairness, and Societal Resilience Left Partially Unpublished.
Stuff I’m currently reading or otherwise working on. If you’re looking at this and you’re not me, maybe you should reconsider your hobbies. This is very inside-baseball, and also inside the sausage factory. You might regret seeing how the baseball sausage is made out of the minced meat of mixed metaphors.
1 Currently writing
Not all of it is publicly visible yet but each of these corresponds to an open notebook in which I am currently writing about things, which is the same thing for me as thinking about things.
Many of these areas are somewhat frustrating at the moment. I’ve moved in to a new area and tried to bring my unique perspective to it, to discover that the area is more mined-out that I would like.
1.1 Run-ups at bounded agency
Something is just not right with the default framing of agency in AI and economics. This is obvious to many people. Nonetheless, I don’t think we have nailed “doing it right” yet.
The fact that many careers have been burned on this pyre already should lead us to believe that it is difficult to get right, and that we should be careful about what we are doing.
- Agency under bounded compute and information: Itemising existing ways of capping rationality to see if anything looked helpful. It looks like a quagmire of formalisms.
- Economics of compute and cognition — compute as a substitutable factor of production. Microeconomic lens on information processing in actually-existing society. I wonder if the economic ideas of substitutable factors of production can be applied to information processing in society, and is that the best we are going to get for valuing human effort?
- Bitter lessons in predictive optimization; Optimization works annoyingly well. What is outside its purview? Asking this question right is subtle, because more things than we thought clearly submit themselves to the tyranny of stochastic gradient descent.
- Making good intertemporal decisions; For whom am I even deciding? Selves across time seem to me to be very raggedly handled in the literature.
- Open-ended intelligence; How do we formalize intelligence in the setting of “things that persist and explore”? I am not sure if this should be a normative framing, or a descriptive one, or both. But it seems unsatisfactorily resolved in either case.
- Homunculi; One thing that is special about intelligence in beings I care about is that we are all self- and other-modelling intelligences. Surely the literature on this obvious observation is good? It seems weirdly thin.
- Renormalization for ML— Do the physicists have anything useful to add to my street-fightin’ variational Bayes?
- Nearly sufficient statistics and information bottlenecks— OK maybe this is not bonkers, even though I hate that it always ends up being informational.
- Platonic and convergent representations is more fruitful than I had supposed. Q: will learners of some very general type converge upon useful shared abstractions? What would that mean?
1.2 Collective truth-finding and its mechanism design
Extracting reliable belief and decision from heterogeneous, strategic, or boundedly-rational populations, and the institutional / algorithmic machinery for doing so.
- Wisdom from the madness of crowds — formalisms for extracting truth from biased, strategic crowds. I think there is some small nice post to be written here that uses mechanized graphs to make beliefs into first-class citizens, but I am not sure if there is a good enough outcome from doing so to justify the time.
- AI alignment to collective values — RLHF / ERM as implicit voting, differentiable social choice. If the algorithms already include some collective
- Civic tech and AI-mediated governance — bridging-based aggregation, deliberative platforms. Many excellent ideas here. Goal: persuade people to try them.
- Utopian governance — sortition, futarchy, liquid democracy, mechanism composition. What is the great good in governance?
- Delegated agent governance — AI pilled version of the previous two. If the world going well depends on humans collectively making it go well, do we get mechanisms to do this from AI itself?
Bayesian epistemics— proper scoring rules, peer prediction, Bayesian truth serum. This is a lit review for use in other things but I don’t think I need to add anything now; it seems straightforward enough to just read the papers and use the results.Truth/effectiveness heat pumps; Hand wringing about truth-seeking versus change-seeking, and how to jointly solve for both. On pause.Groupthink and the wisdom of crowds— diversity dividends, information cascades, collusion, elicitation. Quiescent for now.
1.3 Ecology of mind and agency
OK, if we want to understand mind we can take inspiration from biological systems which clearly do not optimize but instead explore and investigate and compete and evolve and generally interact with one another in a broadly ecological way.
Some ideas toward that:
- Steps to an ecology of mind; Misc notes on how minds might arise in ecological balance rather than solipsistic optimisation. Exploratory, no unifying themes yet.
- Intrinsic motivation; What motivations for prediction or action can we find that 1) do not cash out in an ERM loss function, and 2) are still tractable to analyze in some useful way?
Boundaries and blankets: “Where are the self/world boundaries?” This whole line of research is remarkably cursed for something that looks totally intuitive, and all of us solve every day.Utility as a local approximation of fitness: I was trying to understand rational agent utility in evolutionary perspective, but I think this is unhelpfully far from where utility “works” in practice.- What use is utility?; If we want to solve for a utilitarian outcome, what gets us the best approximation to a good outcome in practice?
- Human collective agency— Intuitively, ot me at least, human well-being inheres in individuals and also collectively. Is there any way of formalising this that is not just fatuous hand-waving?
- Multi-scale agency Maybe we can solve the question of human collective agency by instead generalising from humans to arbitrary multi-scale systems, and then specialising back to humans. Hmm.
- Cooperation through uncertainty. Small simple idea: Many virtuous cooperative patterns seem to arise from from uncertainties. Is ignorance bliss? Is excessive data-processing going to squeeze out cooperative patterns by eliminating the uncertainty that makes them possible? I have concern this question will re-invent some old Robin Hanson bit.
- Information Empowerment— I have a profound suspicion about all these information al empowerment measures because they seem to have doubly-cursed estimation theory. Firstly, the informational quantities they use are have no good estimators in general, and secondly if if they did, it would assume that the agents in question were acting in a closed world. It looks like empowerment solves for optionality as an end but I am not convinced that it is the right way to do so.
1.4 Cooperative AI
Building machines that work with us and each other.
- Coalition games— Self-tutoring on game theory for coalition formation.
- Algorithmic collective action— AI meets coalition formation. How do we get machines to cooperate with us and each other?
- Value/reward learning— If value is numinous and not directly observable, can we just learn it anyway and that is fine.
- Delegated agent governance— See above. Can AI+humans do better at creating a just society in a short term tactical fashion?
- Bargaining with successors— What is the game theory of a one-off power transition to a successor species? How does that go for us?
- Intergenerational game theory The human version of the previous question. How do we make decisions that are good for our successors, and how do we make sure they are good for us too? This is weirdly un-unified in the literature. We both over- and under-cooperate with future humans.
- Learning with theory of mind; Good or bad? How does it work?
- Making good intertemporal decisions— OK, how about I specialize just on me? How do I bargain with my past/future selves?
- Opponent shaping needs a rewrite to be less about Advantage Alignment.
- Cooperation through uncertainty (see above)
- Open-source game theory
1.5 Ethics
I’m sceptical of utilitarianism as an ultimate moral framework, for all that it is useful at the margins. Nonetheless, I need to make huge decisions under uncertainty. What to do?
Steps to an ecology of mind
Optionality as an end
Moral orbits.
Where does utilitarianism even come from?
- Where do preferences come from
- On what we want
- The moral wetware
1.6 Coalitional fairness
If I want to be able to operate in a world full of other agents, some human and some artificial, I want to know how we are able to coordinate and what conditions might make that multi-agent coordination feasible.
- Shapley values
- Am I getting fucked?
When does the GDP measure importanceDecided it’s not that interesting- Coalition games
- Algorithmic collective action
- Economics of growth
- Bargaining with successors
- Intergenerational game theory
- Making good intertemporal decisions
1.7 AI Safety and society
How does being human interact with the risk of AI? How coupled are the dangers with what it even means to be human? How path-dependent on virtuous societal outcomes is the risk of AI? How path-dependent on AI is the risk of societal collapse?
1.8 Economics of innovation, progress, automation
The nexus where growth theory, the economics of cognition, and the political economy of AI meet.
Economics of growth
When does the GDP measure importance
1.9 Sciences of the Artificial
I just realized that the eerie resemblance of ML research to physical sciences is deceptive. We have tools for understanding this better.
1.10 Agentic AI in practice
How I actually extract value from the machine.
- AI agents, applied
- Code agents and assistants
- Vibecoding Apple apps
- Maths and proof models, applied
- Generative AI workflows and hacks 2026
- AI search
- PDF ingestion
- Running LLMs locally on a Mac
- Discretizing and quantizing neural nets
- Fine tuning foundation models
- Fine-tuning danbot
- Editing images with AI
- Generative art with language+diffusion models
- Front-end clients for AI image models
1.11 DIY tech and community sovereignty
1.13 Complexity theory revival
Santa Fe Institute detritus
1.14 Human superorganisms
Multi-scale agency, collective intelligence, and the ecology of mind.
- Returns to scale in technological society
- Moral orbits.
- Revisit Probability collectives, CoINs etc
- Movement design
- Returns on hierarchy
- Effective collectivism
- Institutional alignment
- Beliefs and rituals of tribes
- Institutions for angels
- Where to deploy taboo
- Shower thought: The Great Society will never feel great; it’ll merely be better than the alternatives
- Player versus game
1.15 Foundation models and their world models
- Platonic and convergent representations is more fruitful than I had supposed. Q: will learners of some very general type converge upon useful shared abstractions? What would that mean?
- Causal/Bayesian inference in foundation models
- Differentiable learning of automata
- Maths and proof models, applied
- Graphical model / ML decoder ring
- Causal abstraction
- History-based RL generalizes RL from POMDPs to even less tractable settings.
- Neural net reasoning and symbolic mathematics
1.16 Bayes-meets-neural-nets
Not very active in this rn, but I have a niggling suspicion that this new-wave de-Finetti stuff is actually useful.
Bayes about logical statements
Quantum probability
1.17 Other ProbML stuff
1.18 Classification and society series
Something about the experience of existing in a society with things classified into “natural” kinds that are clearly contingent and useful for the systems in which we are part is deeply unintuitive for humans, even actively destructive of empathy. I want to tease apart what is happening there.
- Adversarial categorization
- Performative prediction
- Constructivist rationalism
- Affirming the consequent and evaporative tribalism
- Classifications are not very informative
- AUC and collateral damage
- Bias and base rates
- Decision theory and prejudice
- Is academic literary studies actually distinct from the security discipline of studying side-channel attacks?
1.19 Evolving a just society
a.k.a. Shouting at each other on the internet.
- Iterative game theory of communication styles
- Invasive arguments
- A social divide I’ve seen a lot recently: people who value cheap signalling highly versus those who view it negatively.
- Modern politics seems excellent at reducing the vast spectrum of policy options to two mediocre choices, then arguing about which is worse. What is this tendency called as a phenomenon in itself?
- Goodhart coordination
- Emancipating my tribe: the cruelty of collectivism (and why I love it anyway)
- Collective care
- Social calendaring
- Psychological resilience
- Nationalism
- The Activist and decoupling games, and game-changing
Lived evidence deductions and/orad hominem for discussing genetic arguments.- Structural problems are hard — let’s do training programs
- Diffusion of responsibility — is this distinct from messenger shooting?
- Coalition games
All We Need Is Hate- Speech standards
Pluralism✅
1.20 Human learner series
I’ve demoted this category because I am mostly pursuing other lines of attach at the moment.
Is “is” “ought”?
On what we want
Correlation construction
-
- Burkean conservatism is about identifying when moral training data is out-of-distribution.
- Something about universal grammar and its learnable local approximations versus universal ethics and their learnable local approximations. Morality by template; the computational difficulty of moral identification. Leading by example of necessity.
Akrasia in stochastic processes: what time-integrated happiness should we optimize?
Comfort traps✅ Good enough for nowMyths✅ a few notes are enough
1.21 When is computation “statistical”?
I mean this in the sense that, as in statistical mechanics, we know some bulk statistics of a population of solutions even when we can’t do the calculations for everything (like: air pressure doesn’t require simulating every molecule). It seems that machine learning sometimes behaves like this in a certain sense. I’m not sure of the scope of this idea — maybe I’m reinventing computational mechanics, or algorithmic information theory — so let’s use some examples to flesh it out:
- Trading equities. We can’t know every trade, but we can price options well under no-arbitrage assumptions, even though traders’ calculations can be far more complex than ours. No-arbitrage assumptions aren’t strictly true, but the returns from extra complexity to find arbitrage opportunities seem to diminish with compute, so in the wash it’s pretty similar.
- Statistical mechanics of statistics
- Scaling laws: we can’t know the exact computations an LLM will do, but we can predict its performance remarkably well given a data-parameter-train-compute budget.
- Algorithmic statistics and pseudorandomness study the statistical behaviours of some classes of algorithms, where they become near-indistinguishable from randomness in technical senses.
- Computational mechanics
- Neural net reasoning and symbolic mathematics
- Renormalization for ML
- Coarse graining
- …
1.22 How to do house stuff
Renovation etc.
1.23 Learning in context
- Interaction effects are what we want
- Interpolation is what we want
- Optimal conditioning is what we want
- Correlation construction is easier than cauzation learning
1.24 Economic dematerialization
via
- Enclosing the intellectual commons
- Creative economy jobs
Not currently active.
1.25 Misc
- Haunting and exchangeability. Connection to interpolation, individuation, legibility and nonparametrics.
- X is Yer than Z
- Is residual prediction different from adversarial prediction?
1.26 Meta
- Aunty Val, the digest persona
- Danbot
