Open ended intelligence

Need is all you need

2022-11-27 — 2025-09-25

Wherein two paradigms are surveyed, optimizing agents and replicating persisters are contrasted, intrinsic drives such as curiosity and empowerment are proposed as bridges, and open‑ended generators like POET are described.

adaptive

agents

energy

evolution

extended self

game theory

gene

incentive mechanisms

learning

mind

networks

probability

statistics

statmech

utility

wonk

1 Two paradigms of adaptation

This is a placeholder to talk about entities that try to ‘do well’ by simply continuing to exist. How is the loss function of an optimizer related to the notional fitness function of an evolutionary entity? “Entities that optimize for goals, above all,” versus “entities that replicate and persist, above all.”

These are two different paradigms for adaptive entities: optimizing (what our algorithms usually aim for) and persisting (what evolution produces).

Rather than being born with a single overriding goal encoded in a loss function that ranks states as better or worse, we evolutionary entities are messier. We have a deep drive to survive and also a desire to succeed while alive, where succeeding seems to be a somewhat adjustable criterion and might include being “happy”, “good”, “successful”, “loved”, “powerful”, “wise”, “free”, “just”, “beautiful”, “funny”, “interesting”, “creative”, “kind”, “rich” or “righteous”. Or whatever.

Optimized and evolved entities are both present in the world. Usually we think of surviving as the domain of life, and optimizing as the domain of machines, although the line is fuzzy thanks to genetic programming and self-optimizing nerds. Maybe that’s why machines seem so utterly alien to us. As an evolutionary replicator myself, I tend to fear optimizers, and I wonder how my interests can actually align with theirs.

There are newer non-optimizing paradigms for AI (Lehman and Stanley 2011; Ringstrom 2022), and I wonder whether they can do anything useful.

Cf. Arcas et al. (2024), which suggests that replicating sometimes emerges naturally from machines.

2 Intrinsic motivation models

One way to bridge the gap between pure optimizers and pure persisters is through intrinsic motivation. Instead of waiting for a sparse external signal (reward, loss, profit), an agent generates its own signals: curiosity, play, empowerment, surprise, novelty.

Curiosity encourages agents to seek states that reduce uncertainty or maximize prediction error.
Play is a form of practice without explicit external reward, but which builds flexible repertoires of behaviour.
Empowerment (as discussed earlier) motivates an agent to maintain future optionality by staying in states with many possible futures.
Novelty search abandons external performance measures altogether, rewarding only the discovery of novel behaviour (Lehman and Stanley 2011).

Recent work explores how such motivations can scale: intrinsically motivated deep RL (Du et al. 2023), empowerment approximations (Lidayan et al. 2025), models of complex curiosity (Ramírez-Ruiz et al. 2024), and even formal theories of curiosity (Schmidhuber 2010). These paradigms don’t abolish optimization, but they re-anchor it in something closer to evolutionary persistence — maintaining flexibility, exploration, and continued existence.

3 Open-endedness

This brings us to a more grandiose question: what does it mean for a system to be “open-ended”?

Jeff Clune posed a version of this question at EXAIT:

Could we devise an open-ended exploratory algorithm that is worth running for a billion years?

This isn’t about solving a single benchmark or reaching a single target loss. It’s about building processes that never finish and that continually create novelty, complexity, and surprise. That’s what life itself appears to be building.

Researchers have started sketching pathways:

POET: endlessly generates new environments and their solutions in tandem (Wang et al. 2019, 2020; Ecoffet et al. 2021).
Quality-diversity algorithms: not just about finding a single optimum but about filling out the space of possible strategies (Cully et al. 2015).
OMNI-EPIC: links human notions of interestingness with programmatically generated environments (Faldor et al. 2024). See also maxencefaldor/omni-epic
Broader frameworks in AI-GA (AI generating algorithms) (Clune 2020).

These approaches move away from a single-point optimization worldview toward something more like evolution: messy, self-propagating, self-diversifying, and driven by the twin imperatives of persistence and exploration.

Is that what life is building?

4 References

Abramsky, Banzhaf, Caves, et al. 2025. “Open Questions about Time and Self-Reference in Living Systems.”

Arcas, Alakuijala, Evans, et al. 2024. “Computational Life: How Well-Formed, Self-Replicating Programs Emerge from Simple Interaction.”

Berrueta, Pinosky, and Murphey. 2024. “Maximum Diffusion Reinforcement Learning.” Nature Machine Intelligence.

Clune. 2020. “AI-GAs: AI-Generating Algorithms, an Alternate Paradigm for Producing General Artificial Intelligence.”

Cully, Clune, Tarapore, et al. 2015. “Robots that can adapt like animals.” Nature.

Du, Kosoy, Dayan, et al. 2023. “What Can AI Learn from Human Exploration? Intrinsically-Motivated Humans and Agents in Open-World Exploration.” In.

Ecoffet, Huizinga, Lehman, et al. 2021. “First Return, Then Explore.” Nature.

Eysenbach, Gupta, Ibarz, et al. 2018. “Diversity Is All You Need: Learning Skills Without a Reward Function.” In.

Faldor, Zhang, Cully, et al. 2024. “OMNI-EPIC: Open-Endedness via Models of Human Notions of Interestingness with Environments Programmed in Code.” In.

Franzmeyer, Malinowski, and Henriques. 2021. “Learning Altruistic Behaviours in Reinforcement Learning Without External Rewards.” In.

Hafner, Ortega, Ba, et al. 2022. “Action and Perception as Divergence Minimization.”

Jaques, Lazaridou, Hughes, et al. 2019. “Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning.” In Proceedings of the 36th International Conference on Machine Learning.

Klyubin, Alexander S., Polani, and Nehaniv. 2005. “All Else Being Equal Be Empowered.” In Advances in Artificial Life.

Klyubin, A.S., Polani, and Nehaniv. 2005. “Empowerment: A Universal Agent-Centric Measure of Control.” In 2005 IEEE Congress on Evolutionary Computation.

Lehman, and Stanley. 2011. “Abandoning Objectives: Evolution Through the Search for Novelty Alone.” Evolutionary Computation.

———. 2013. “Evolvability Is Inevitable: Increasing Evolvability Without the Pressure to Adapt.” PLoS ONE.

Levin. 2024. “Artificial Intelligences: A Bridge Toward Diverse Intelligence and Humanity’s Future.” Advanced Intelligent Systems.

Lidayan, Du, Kosoy, et al. 2025. “Intrinsically-Motivated Humans and Agents in Open-World Exploration.”

Ramírez-Ruiz, Grytskyy, Mastrogiuseppe, et al. 2024. “Complex Behavior from Intrinsic Motivation to Occupy Future Action-State Path Space.” Nature Communications.

Ringstrom. 2022. “Reward Is Not Necessary: How to Create a Compositional Self-Preserving Agent for Life-Long Learning.”

Schmidhuber. 2010. “Formal Theory of Creativity, Fun, and Intrinsic Motivation (1990–2010).” IEEE Transactions on Autonomous Mental Development.

Tarsney. 2025. “Will Artificial Agents Pursue Power by Default?”

Taylor, and Dorin. 2020. Rise of the Self-Replicators: Early Visions of Machines, AI and Robots That Can Reproduce and Evolve.

Turner, Smith, Shah, et al. 2021. “Optimal Policies Tend To Seek Power.” In Advances in Neural Information Processing Systems.

Wang, Lehman, Clune, et al. 2019. “POET: Open-Ended Coevolution of Environments and Their Optimized Solutions.” In Proceedings of the Genetic and Evolutionary Computation Conference. GECCO ’19.

Wang, Lehman, Rawal, et al. 2020. “Enhanced POET: Open-Ended Reinforcement Learning Through Unbounded Invention of Learning Challenges and Their Solutions.”