Evolutionary Methods

Can evaluators improve alongside the agents they score?

Most self-improving systems rely on a fixed benchmark or verifier that doesn't change. But what if the evaluator itself learned and adapted as the agent improved? This explores whether co-evolution unlocks tasks that resist static scoring.

Do self-improving agents really split into two distinct loops?

Explores whether modern self-improving agents can be understood through a clean abstraction separating fast scaffold updates from slow model weight updates, and whether this framework actually explains the field's recent progress.

How can agent self-evolution be made safe and auditable?

As agents begin updating their own prompts and tools, how can we track these changes, measure their effects, and safely reverse problematic updates? This matters because untracked evolution leads to unmaintainable systems and makes regressions impossible to diagnose.

Why do LLM agents ignore condensed experience summaries?

LLM agents faithfully learn from raw experience but systematically disregard condensed summaries of the same experience. This study investigates whether the problem lies in how summaries are made, how models process them, or whether models simply don't need them.

Why do fixed benchmarks fail as agents get stronger?

Static evaluation criteria become vulnerable to gaming as optimizers improve. Does this fundamental problem require dynamic objectives rather than better static metrics?