Can agents learn cooperation by adapting to diverse partners?

Explores whether sequence model agents can develop mutual cooperation strategies through in-context learning when trained against varied co-players, without explicit cooperation mechanisms or hardcoded assumptions.

Synthesis note · 2026-02-23 · sourced from Agents Multi Architecture

Achieving cooperation among self-interested agents is a fundamental challenge in multi-agent reinforcement learning. Existing approaches that achieve mutual cooperation between "learning-aware" agents typically rely on hardcoded assumptions about co-player learning rules or enforce strict separation between fast-timescale "naive learners" and slow-timescale "meta-learners." Both constraints limit scalability.

This paper shows that in-context learning capabilities of sequence models provide a cleaner path. Training sequence model agents against a diverse distribution of co-players naturally induces in-context best-response strategies that effectively function as learning algorithms on the fast intra-episode timescale. No hardcoded assumptions about the opponent. No explicit timescale separation.

The cooperation mechanism is elegant: in-context adaptation renders agents vulnerable to extortion (because they adapt to exploitative strategies). This vulnerability creates mutual pressure between agents — each agent's in-context learning dynamics can be shaped by the other. The resulting mutual shaping pressure resolves into cooperative behavior.

Three components are necessary and sufficient: (1) sequence model agents with in-context learning capacity, (2) diverse co-player distribution during training, and (3) decentralized reinforcement learning. Co-player diversity is the key ingredient — it forces the agent to develop general in-context adaptation rather than memorizing responses to specific opponents.

Since Can transformers learn to solve new problems within episodes?, this finding extends ICRL from single-agent environments to multi-agent cooperation. The in-context learning mechanism that enables environment adaptation also enables co-player adaptation — and the social dynamics of mutual adaptation produce emergent cooperation.

The connection to Can cooperative bots escape frozen selfish populations? is structural: random exploration breaks frozen equilibria in population games; diverse co-player training breaks the equilibrium of mutual defection in dyadic games. Both work through diversity of experience rather than explicit cooperation incentives.

Inquiring lines that read this note 38

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

How do multi-agent systems achieve genuine cooperation and reasoning?

Why do models develop protective behaviors toward peers unprompted?

Can AI systems develop genuine social understanding without embodiment?

What drives capability and cost efficiency in agent systems?

How do controllable simulators compare to population-level agent simulation approaches?

How do self-generated feedback mechanisms enable effective model learning?

What training signals would models need to learn reciprocal common-ground construction?

What mechanisms enable AI systems to generate and spread false beliefs?

How do false agreements emerge differently from genuine bilateral convergence?

What coordination failures limit multi-agent LLM systems as they scale?

Why do AI agent societies fail to develop shared behaviors despite interaction?

Can debate mechanisms prevent silent agreement on wrong answers in multi-agent reasoning?

Does alignment training create blind spots in detecting genuine safety threats?

What distinguishes models that refuse cooperation from those that fake alignment?

How can recommendation systems balance personalization with stability and coverage?

Could AI agents scale the friend-with-different-preferences recommendation mechanism?

Does recurrence enable reasoning capabilities that fixed-depth transformers cannot achieve?

What data properties enable transformers to learn sequential decision-making in context?

How can AI agents autonomously learn and transfer skills across tasks?

Related concepts in this collection 4

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

12 direct connections · 110 in 2-hop network ·dense cluster Open in graph ↗

Can agents learn cooperation by adapting to dive… Can transformers learn to solve new problems withi… Can cooperative bots escape frozen selfish populat… Why do standard alignment methods ignore partner i… Can multiple agents stay diverse during training t…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can transformers learn to solve new problems within episodes? Explores whether transformer models can develop meta-learning abilities through RL training, enabling them to adapt to unseen environments by learning from within-episode experience alone, without updating weights.
ICRL: meta-RL via context; this finding extends it from environment adaptation to co-player adaptation
Can cooperative bots escape frozen selfish populations? Do agents programmed to cooperate have the capacity to disrupt stable but undesirable equilibria in mixed human-bot societies? This matters because it determines whether bot design can reshape social dynamics at scale.
diversity-driven cooperation at the population level; this is diversity-driven cooperation at the dyadic level
Why do standard alignment methods ignore partner interventions? Standard RLHF and DPO optimize for token-level quality but may structurally prevent agents from meaningfully incorporating partner input. This explores whether the training objective itself blocks collaborative reasoning.
ICR for partner awareness; in-context co-player modeling achieves partner awareness through a different mechanism (diverse training rather than counterfactual invariance)
Can multiple agents stay diverse during training together? Does training separate specialist agents on different data maintain the reasoning diversity that single-agent finetuning destroys? This matters because diversity correlates with accuracy and prevents models from becoming trapped in narrow response patterns.
diversity as the enabling condition for both cooperation and reasoning quality

Can agents learn cooperation by adapting to diverse partners?

Inquiring lines that read this note 38

Related concepts in this collection 4

Related papers in this collection 8

Search by related questions 4