Garbled highlights from ICLR 2025

2025-02-22 — 2025-05-06

neural nets

South East Asia

statistics

Suspiciously similar content

I was at ICLR in Singapore this year to present MacKinlay (2025). I hoped to also present MacKinlay et al. (2025), but I missed the rebuttal notification due to a technical email fail, for which I made a grovelling apology to the reviewers. That will appear in a subsequent conference.

1 tl;dr

Don’t have time to read the whole proceedings? Here are my necessarily idiosyncratic highlights. They relate to my current research projects; maybe you can guess what those are from the following list.

Millions of diffusion papers, especially for physical systems. I am obsessed with the physics ones. Stay tuned for more.
- Physics-Informed Diffusion Models code at jhbastek/PhysicsInformedDiffusionModels: Implementation of Physics-Informed Diffusion Models; (Bastek, Sun, and Kochmann 2024). Yo Dawg, I heard you liked diffusions, so I diffused your diffusions. This trick is super cool and very simple.
- Latent-EnSF: A Latent Ensemble Score Filter for High-Dimensional Data Assimilation with Sparse Observation Data (Si and Chen 2024). I am annoyed this works! It turns out that latent estimation in Ensemble Score filters is messy, and there is a way you can amortise the difficulties away. It is not the only way, though. Stay tuned.
Multi-agent alignment.
- Advantage Alignment Algorithms Oral Poster. Code at jduquevan/advantage-alignment (Duque et al. 2024).
- Meulemans et al. (2024) is related to the previous one, and in fact recommended to me by Juan, one of the authors of the previous paper.
- Neural Interactive Proofs / SamAdamDay/neural-interactive-proofs: Experiments for the Neural Interactive Proofs paper (Hammond and Adam-Day 2025).
- Assistance games (Laidlaw et al. 2025), a.k.a. Cooperative inverse reinforcement learning (code).
Causal / graphical learning.

I am generally sceptical of learning graphical models because the popular approaches seem to have tough limitations (e.g. credible only over observables, or over i.i.d. data), but this year there were some generalisations in interesting and IMO useful directions.
- Efficient and Trustworthy Causal Discovery with Latent Variables and Complex Relations XiuchuanLi/ICLR2025-ETCD (Li and Liu 2024).
- Unifying Causal Representation Learning with the Invariance Principle (Yao et al. 2024) (boring example, but tidy method).
- Identifiable Exchangeable Mechanisms for Causal Structure and Representation Learning.
- Differentiable Causal Discovery for Latent Hierarchical Causal Models (Prashant et al. 2024) (I have not actually digested this one yet, but it is interesting because the authors are just down the road).
Connection to causal abstraction.
Singular learning theory, as explained in Lau et al. (2024). I think I’m starting to get its application to developmental interpretability.

Figure 2: With registration less than 1% of the price of ICLR registrations, the Pet Expo was a better deal.

2 Vibes

The gradual closing-down of science continues. Frontier labs are still present but seem to present fewer papers, as more of the LLM research becomes proprietary commercial advantage. The hiring frenzy is not any longer so frenzied. Generally there was a strong showing for research that was not frontier model, but maybe adjacent to it (applications to domain science, convergence bounds that companies don’t care about, etc). I could attempt to quantify this by plotting authors’ origins (which would be not too hard via OpenReview), but I don’t have time to do that right now.

Generally, I’m concerned that the business model for open research is not working out so well. The tech firms are not longer playing a reciprocal open game with the research community, perhaps because they feel like we are no longer in an iterated game but rather a one-shot game, or one end-game or another or another.

3 Finding papers

There are two paper search engines, each of which sucks differently:

Dissatisfied with both, I built a quick semantic search which is better than either of them, mostly while I was waiting in line at the local pharmacy for my flu vaccination. You will find it here if you want to install it on your machine: danmackinlay/openreview_finder. It takes about 10 minutes to download and index the ICLR 2025 papers.

It’s a few hundred megabytes of data, so I haven’t deployed it on the open internet; that is left as an exercise for the student.

4 Finding people

There is a conference app called Whova which combines the conference schedule with messaging. As you might expect from a bespoke app with a captive but limited audience, it is not amazing at either of these things.¹

The messaging works but is not capable enough for a conference with 8000 attendees. It is not amazing at prioritising or surfacing important content, so I will, for the duration of the conference, activate social media apps as a side channel.

There is a Bluesky starter pack of ICLR attendees which might be helpful for finding people to follow. I am at @danmackinlay on bluesky and @dan_mackinlay on Xitter.

5 Workshops

Observation: The workshops felt a little thin this year. Previously I have had an amazing time at workshops, smashing out bold ideas with other researchers. This year we had less momentum. I’m not throwing shade on any specific workshop organisers, to be clear. I think we were all haunted by a blight despite much vigour and enthusiasm from the organisers. I have a vague sense that this might be because the large tech firms are no longer prioritising sharing open research, and so we lack their well-funded momentum. I will be sad if this is true.

Alternatively, it might be that I am getting jaded and can no longer feel the collective effervescence.

5.1 Day 1, 27th April

ICLR 2025 Workshop on Machine Learning Multiscale Processes

Given low-level theory and computationally-expensive simulation code, how can we model complex systems on a useful time scale?

Aee also AI4X

This was to me a surprise breakout workshop. Some pretty good papers; see OpenReview
Workshop on Weight Space Learning
The recent surge in the number of publicly available neural network models — exceeding a million on platforms like Hugging Face — calls for a shift in how we perceive neural network weights. This workshop aims to establish neural network weights as a new data modality, offering immense potential across various fields.

We plan to address key dimensions of weight space learning:
- Weight Space as a Modality: Characterisation of the weight space, symmetries, scaling laws, and model zoo datasets.
- Model analysis: Inferring model properties, such as test performance or generalisation error, by the inspection of their weights.
- Model synthesis: Generating model weights for specific datasets and tasks, and improving tasks like pruning, merging, and robustness through weight manipulation.
This is a great idea. I have not really had time to digest it all though. TBC
XAI4Science: From Understanding Model Behaviour to Discovering New Scientific Knowledge

Machine learning (ML) models are impressive when they work but they can also show unreliable, untrustworthy, and harmful behaviour. Yet, such models are widely adopted and deployed, even though we do not understand why they work so well and fail miserably at times. Such rapid dissemination encourages irresponsible use, for example, to spread misinformation or create deep fakes, while hindering the efforts to use them to solve pressing societal problems and advance human knowledge.

Ideally, we want models to help us improve our understanding of the world and, at the very least, we want them to aid human knowledge and help us to further enrich it. Our goal in this workshop is to take a step in this direction by bringing together researchers working on understanding model behaviour and using it to discover new human knowledge. The workshop will include theoretical topics on understanding model behaviour, namely interpretability and explainability (XAI), but also three distinct scientific application areas: weather and climate, healthcare, and material science (ML4Science).

Generally a nice idea, but it felt like the tech firms were rather absent from this workshop given their engagement in the area. More worrying signs of the tech firms pulling back from open research?
BiAlign ICLR 2025
This workshop focuses on bidirectional Human↔︎AI alignment, a paradigm shift in how we approach the challenge of human-AI alignment, which emphasises the dynamic, complex, and evolving alignment process between humans and AI systems. This is grounded on the “bidirectional human-AI alignment” framework (see Definition and ReadingList) derived from a systematic survey of over 400 interdisciplinary alignment papers in Machine Learning (ML), Human Computer Interaction (HCI), Natural Language Processing (NLP), and more domains. Particularly, it involves two directions to maximise its benefits for human society.
- Aligning AI with Humans (AI-centred perspective): focuses on integrating human specifications into training, steering, customising, and monitoring AI systems;
- Aligning Humans with AI (Human-centred perspective): aims to preserve human agency and empower humans to critically evaluate, explain, and collaborate with AI systems.
I appreciated the goals of this workshop, but the bits I saw didn’t help me so much. However, I did learn about assistance games, which was interesting indeed.

5.2 Day 2, 28th April

Tackling Climate Change with Machine Learning | Climate Change AI

Building on our past workshops on this topic, this edition focuses on two aspects related to data-centric approaches to ML for climate action. Data-centric machine learning is not only a timely topic within the ICLR community, as analysing and engineering (pre)training datasets becomes increasingly important, but holds specific challenges and opportunities in climate-related areas, for example in smart grid management or water engineering.

This was IMO a surprise breakout workshop. I was not expecting much from it, but the workshops were generally IMO starved for content this year, and the climate one was popping with ideas and energy.
FPI @ ICLR2025

The Frontiers in Probabilistic Inference: Sampling meets Learning (FPI) workshop at ICLR 2025 focuses on modern approaches to probabilistic inference to address the challenging and under-explored area of sampling from an unnormalised distribution. Sampling spans a wide range of difficult and timely problems from molecular dynamics simulation, and Bayesian posterior inference/inverse problems to sampling from generative models weighted by target density (e.g. finetuning, inference-time alignment). We hope to provide an inclusive and collaborative environment to discuss emerging ML methods for learning samplers and their applications to real-world problems. We aim to facilitate discussions around identifying some key challenges of learning-based approaches, compared to classical sampling approaches, along with techniques to overcome them.

This was amazing IMO, but somewhat eclipsed by AABI.
Workshop on Spurious Correlation and Shortcut Learning

Reliance on spurious correlations due to simplicity bias is a well-known pitfall of deep learning models. This issue stems from the statistical nature of deep learning algorithms and their inductive biases at all stages, including data preprocessing, architectures, and optimisation. Therefore, spurious correlations and shortcut learning are fundamental and common practical problems across all branches of AI. The foundational nature and widespread occurrence of reliance on spurious correlations and shortcut learning make it an important research topic and a gateway to understanding how deep models learn patterns and the underlying mechanisms responsible for their effectiveness and generalisation. This workshop aims to address two aspects of this phenomenon: its foundations and potential solutions.

5.3 Day 3, 29th April

Not technically the same conference, but with a massive overlap: Advances in Approximate Bayesian Inference

In recent years, advances in probabilistic machine learning and approximate inference methods have enabled the development of state-of-the-art methods across many domains, such as in generative modelling, AI-driven scientific discovery, and AI safety, among many others.

This year’s symposium will have an expanded scope and will be focused on the development, analysis, and application of probabilistic machine learning methods broadly construed. In line with this expanded focus, we particularly welcome submissions that explore connections between probabilistic machine learning and other fields such as deep learning, natural language processing, active learning, reinforcement learning, compression, AI safety, AI for scientific discovery, and causal inference. We also encourage contributions that advance the foundations of probabilistic machine learning and machine learning theory.

This was amazing and highly recommended. I learned about q-exponential processes, Jesse Hoogland and Zach Furman got me one step closer to understanding the sampling problem in singular learning theory, as evidenced in Lau et al. (2024). Emtiyaz Khan corrected some of my misapprehensions about the Bayes Learning Rule. I did not quite get to the bottom of my MMD problems despite Pierre Alquier’s noble efforts, although the fault here is mine. Christian A. Naesseth explained Bartosh, Vetrov, and Naesseth (2025) to me, which was super cool.

Thanks to the AABI organisers for a great workshop!

6 Side events

AI Safety Networking Social was great.

7 Interesting papers

Infuriatingly, the conference smartphone app and website both offer incompatible means of marking your preferred sessions. So I will introduce a third method: hyperlinks.

7.1 Blog posts

Firstly, don’t sleep on the ICLR Blogposts, which demonstrate some next level science-communication. My picks, in alphabetical order:

7.2 Physics I’m interested in

7.3 Multi-agent systems + Safety

Advantage Alignment Algorithms Oral Poster. Code at jduquevan/advantage-alignment (Duque et al. 2024)

Artificially intelligent agents are increasingly being integrated into human decision-making: from large language model (LLM) assistants to autonomous vehicles. These systems often optimize their individual objective, leading to conflicts, particularly in general-sum games where naive reinforcement learning agents empirically converge to Pareto-suboptimal Nash equilibria. To address this issue, opponent shaping has emerged as a paradigm for finding socially beneficial equilibria in general-sum games. In this work, we introduce Advantage Alignment, a family of algorithms derived from first principles that perform opponent shaping efficiently and intuitively. We achieve this by aligning the advantages of interacting agents, increasing the probability of mutually beneficial actions when their interaction has been positive. We prove that existing opponent shaping methods implicitly perform Advantage Alignment. Compared to these methods, Advantage Alignment simplifies the mathematical formulation of opponent shaping, reduces the computational burden and extends to continuous action domains. We demonstrate the effectiveness of our algorithms across a range of social dilemmas, achieving state-of-the-art cooperation and robustness against exploitation.
Meulemans et al. (2024) is related to the previous one, and in fact recommended to me by Juan, one of the authors of the previous paper.

Promising recent work has shown that in certain tasks cooperation can be established between “learning-aware” agents who model the learning dynamics of each other. Here, we present the first unbiased, higher-derivative-free policy gradient algorithm for learning-aware reinforcement learning, which takes into account that other agents are themselves learning through trial and error based on multiple noisy trials. We then leverage efficient sequence models to condition behaviour on long observation histories that contain traces of the learning dynamics of other agents. Training long-context policies with our algorithm leads to cooperative behaviour and high returns on standard social dilemmas, including a challenging environment where temporally-extended action coordination is required.
Neural Interactive Proofs / SamAdamDay/neural-interactive-proofs: Experiments for the Neural Interactive Proofs paper (Hammond and Adam-Day 2025)

In our new paper, we study how a trusted, weak model can learn to interact with one or more stronger but untrusted models in order to solve tasks beyond the weak model’s capabilities. We introduce several new interaction protocols and evaluate them both theoretically and empirically alongside a range of earlier proposals. To facilitate further research on creating and evaluating different protocols for scalable oversight, we also provide a comprehensive open-source codebase.
Moral Alignment for LLM Agents (Tennant, Hailes, and Musolesi 2024)

In this work, instead of relying on human feedback, we introduce the design of reward functions that explicitly and transparently encode core human values for Reinforcement Learning-based fine-tuning of foundation agent models. Specifically, we use intrinsic rewards for the moral alignment of LLM agents.

We evaluate our approach using the traditional philosophical frameworks of Deontological Ethics and Utilitarianism, quantifying moral rewards for agents in terms of actions and consequences on the Iterated Prisoner’s Dilemma (IPD) environment.

This paper was fun but actually I think their previous paper (Tennant, Hailes, and Musolesi 2023) was more elegant.
Inverse Attention Agents for Multi-Agent Systems (Long et al. 2024)

Unlike classical ToM research, which focuses on attributing mental states such as beliefs and desires, our model shifts to the crucial yet less-emphasized component of attention. Our methodology adopts a mentalistic approach, explicitly modelling the internal attentional state of agents using an attention recognition neural network that can be trained end-to-end in combination with components of [Multi-agent reinforcement Learning]. Contrary to traditional ToM modelling approaches that rely heavily on Bayesian inference to handle mental state transitions,[…] our method maintains the ontology of these states while focusing on the direct modelling of agents’ attentional mechanisms.
Do as We Do, Not as You Think: the Conformity of Large Language Models (Weng, Chen, and Wang 2024)
Strong Preferences Affect the Robustness of Preference Models and Value Alignment (Xu and Kankanhalli 2024)
Competing Large Language Models in Multi-Agent Gaming Environments (Huang et al. 2024)
Towards Empowerment Gain through Causal Structure Learning in Model-Based Reinforcement Learning ECL site (Cao et al. 2024) (straddling causal and multi-agent)

In Model-Based Reinforcement Learning (MBRL), incorporating causal structures into dynamics models provides agents with a structured understanding of the environments, enabling efficient decision. Empowerment as an intrinsic motivation enhances the ability of agents to actively control their environments by maximizing the mutual information between future states and actions. We posit that empowerment coupled with causal understanding can improve controllability, while enhanced empowerment gain can further facilitate causal reasoning in MBRL. To improve learning efficiency and controllability, we propose a novel framework, Empowerment through Causal Learning (ECL), where an agent with the awareness of causal dynamics models achieves empowerment-driven exploration and optimizes its causal structure for task learning. Specifically, ECL operates by first training a causal dynamics model of the environment based on collected data. We then maximize empowerment under the causal structure for exploration, simultaneously using data gathered through exploration to update causal dynamics model to be more controllable than dense dynamics model without causal structure.

If this takes off, the machines will come for us all. I am faintly relieved that it doesn’t work that well yet.

7.4 Causal learning

7.5 Useful diffusions

7.6 Serendipitous weirdness

Superdiff Skreta et al. (2024) https://github.com/necludov/super-diffusion
Rusch and Rus (2024)

We propose Linear Oscillatory State-Space models (LinOSS) for efficiently learning on long sequences. Inspired by cortical dynamics of biological neural networks, we base our proposed LinOSS model on a system of forced harmonic oscillators. A stable discretization, integrated over time using fast associative parallel scans, yields the proposed state-space model. We prove that LinOSS produces stable dynamics only requiring a nonnegative diagonal state matrix. This is in stark contrast to many previous state-space models relying heavily on restrictive parameterizations. Moreover, we rigorously show that LinOSS is universal, i.e., it can approximate any continuous and causal operator mapping between time-varying functions, to desired accuracy.…

8 References

Bartosh, Vetrov, and Naesseth. 2025. “SDE Matching: Scalable and Simulation-Free Training of Latent Stochastic Differential Equations.”

Bastek, Sun, and Kochmann. 2024. “Physics-Informed Diffusion Models.” In.

Cao, Feng, Fang, et al. 2024. “Towards Empowerment Gain Through Causal Structure Learning in Model-Based Reinforcement Learning.” In.

Chen, Richter, Berner, et al. 2024. “Sequential Controlled Langevin Diffusions.” In.

D’Angelo, Croce, and Flammarion. 2024. “Selective Induction Heads: How Transformers Select Causal Structures in Context.” In.

Dhir, Ashman, Requeima, et al. 2024. “A Meta-Learning Approach to Bayesian Causal Discovery.” In.

Dörfler, Zander, Bläser, et al. 2024. “From Probability to Counterfactuals: The Increasing Complexity of Satisfiability in Pearl’s Causal Hierarchy.” In.

Duong, Gupta, and Nguyen. 2024. “Causal Discovery via Bayesian Optimization.” In.

Duque, Aghajohari, Cooijmans, et al. 2024. “Advantage Alignment Algorithms.” In.

Furman, and Lau. 2024. “Estimating the Local Learning Coefficient at Scale.”

Hammond, and Adam-Day. 2025. “Neural Interactive Proofs.” In.

Huang, Li, Lam, et al. 2024. “Competing Large Language Models in Multi-Agent Gaming Environments.” In.

Kiciman, Ness, Sharma, et al. 2024. “Causal Reasoning and Large Language Models: Opening a New Frontier for Causality.” Transactions on Machine Learning Research.

Kim, Kim, Moon, et al. 2024. “Discrete Diffusion Schrödinger Bridge Matching for Graph Transformation.” In.

Kobialka, Sommer, Kwon, et al. 2025. “Approximate Posteriors in Neural Networks: A Sampling Perspective.” In.

Laidlaw, Bronstein, Guo, et al. 2025. “AssistanceZero: Scalably Solving Assistance Games.” In Workshop on Bidirectional Human↔︎AI Alignment.

Lau, Furman, Wang, et al. 2024. “The Local Learning Coefficient: A Singularity-Aware Complexity Measure.”

Li, and Liu. 2024. “Efficient and Trustworthy Causal Discovery with Latent Variables and Complex Relations.” In.

Long, Li, Zhao, et al. 2024. “Inverse Attention Agents for Multi-Agent Systems.” In.

MacKinlay. 2025. “The Ensemble Kalman Update Is an Empirical Matheron Update.”

MacKinlay, Tsuchida, Pagendam, et al. 2025. “Gaussian Ensemble Belief Propagation for Efficient Inference in High-Dimensional Systems.” In Proceedings of the International Conference on Learning Representations (ICLR).

Marks, Rager, Michaud, et al. 2024. “Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models.” In.

Meulemans, Kobayashi, Oswald, et al. 2024. “Multi-Agent Cooperation Through Learning-Aware Policy Gradients.” In.

Ormaniec, Sussex, Lorch, et al. 2024. “Standardizing Structural Causal Models.” In.

Prashant, Ng, Zhang, et al. 2024. “Differentiable Causal Discovery for Latent Hierarchical Causal Models.” In.

Rector-Brooks, Hasan, Peng, et al. 2024. “Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction.” In.

Reizinger, Guo, Huszár, et al. 2024. “Identifiable Exchangeable Mechanisms for Causal Structure and Representation Learning.” In.

Ren, Chen, Rotskoff, et al. 2024. “How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework.” In.

Rusch, and Rus. 2024. “Oscillatory State-Space Models.” In.

Sclocchi, Favero, Levi, et al. 2024. “Probing the Latent Hierarchical Structure of Data via Diffusion Models.” In.

Shariatian, Simsekli, and Durmus. 2024. “Denoising Levy Probabilistic Models.” In.

Si, and Chen. 2024. “Latent-EnSF: A Latent Ensemble Score Filter for High-Dimensional Data Assimilation with Sparse Observation Data.” In.

Skreta, Atanackovic, Bose, et al. 2024. “The Superposition of Diffusion Models Using the Itô Density Estimator.” In.

Sommer, Robnik, Nozadze, et al. 2025. “Microcanonical Langevin Ensembles: Advancing the Sampling of Bayesian Neural Networks.”

Tennant, Hailes, and Musolesi. 2023. “Modeling Moral Choices in Social Dilemmas with Multi-Agent Reinforcement Learning.” In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence.

———. 2024. “Moral Alignment for LLM Agents.” In.

Valencia, Pfaff, and Thuerey. 2024. “Learning Distributions of Complex Fluid Simulations with Diffusion Graph Networks.” In.

Vashishtha, Reddy, Kumar, et al. 2024. “Causal Order: The Key to Leveraging Imperfect Experts in Causal Inference.” In.

Wang, Lu, Cao, et al. 2024. “Neural Causal Graph for Interpretable and Intervenable Classification.” In.

Weng, Chen, and Wang. 2024. “Do as We Do, Not as You Think: The Conformity of Large Language Models.” In.

Willig, Tobiasch, Busch, et al. 2024. “Systems with Switching Causal Relations: A Meta-Causal Perspective.” In.

Wu, Yulun, McConnell, and Iriondo. 2024. “Counterfactual Generative Modeling with Variational Causal Inference.” In.

Wu, Anpeng, Qiu, Chen, et al. 2024. “Causal Graph Transformer for Treatment Effect Estimation Under Unknown Interference.” In.

Xu, and Kankanhalli. 2024. “Strong Preferences Affect the Robustness of Preference Models and Value Alignment.” In.

Yao, Rancati, Cadei, et al. 2024. “Unifying Causal Representation Learning with the Invariance Principle.” In.

Zhang, Zikun, Chen, and Gu. 2024. “Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis.” In.

Zhang, Yanbo, Hartl, Hazan, et al. 2024. “Diffusion Models Are Evolutionary Algorithms.” In.

Zheng, He, Chen, et al. 2024. “Diffusion Bridge Implicit Models.” In.

Footnotes

This sounds like a criticism, but TBH I’m not sure how one could do better. Should the conference be an overlay on WeChat or Xitter or Slack or something?↩︎