SYNTHESIS NOTE

What makes agent-authored code worth persisting and sharing?

Agent-created artifacts like patches, tests, and skill libraries outlive single tasks, but we lack guidance on what should persist, how to maintain consistency across agents, and when persistence is worth the engineering effort.

Synthesis note · 2026-05-28 · sourced from Agent Harness

Among the three elements of agentic code — model capability, harness infrastructure, and agent-initiated artifacts — the survey flags the third as the one that "remains relatively underexplored." Agent-initiated code artifacts are the interactive objects an agent creates, executes, observes, revises, persists, and shares during a task: patches and tests authored over a live repository, interface commands synthesized against DOM trees, hypothesis-testing pipelines composed on the fly, executable policies and skill libraries revised in response to environment feedback. These appear across coding assistance, GUI/OS automation, scientific discovery, and embodied control — yet they sit outside the well-mapped territory of predefined infrastructure.

The open questions cluster around persistence and sharing. When an agent writes code that outlives the current step, what should persist and what should be discarded? When multiple agents share artifacts, how is consistent state maintained, and how is a useful artifact promoted from one-off scratch work to durable, reviewable infrastructure? The survey's listed open challenges — evaluation beyond final task success, verification under incomplete feedback, regression-free harness improvement, consistent shared state across agents, human oversight for safety-critical actions — converge on exactly this layer. The counterpoint is that some agent-authored code is genuinely disposable and over-engineering its lifecycle wastes effort. But this matters because the artifacts an agent creates may be where the next gains in autonomy and coordination live, and they are precisely what current harness engineering least understands.

Inquiring lines that read this note 11

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

How do standardized protocols improve coordination in multi-agent systems?

How do standardized artifacts improve coordination between writing agents?

How should systems govern persistent agent-generated code in shared infrastructure?

Does externalizing cognitive work and state improve agent reliability?

What makes skills worth externalizing into a persistent harness?

What coordination failures limit multi-agent LLM systems as they scale?

What breaks when multiple agents share and revise the same artifacts?

Related concepts in this collection 2

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

17 direct connections · 106 in 2-hop network ·medium cluster Open in graph ↗

What makes agent-authored code worth persisting … Can agents learn reusable sub-task routines from p… Can agents adapt without pausing service to users?

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can agents learn reusable sub-task routines from past experience? Do web agents fail at long-horizon tasks because they cannot extract and reuse workflows shared across similar problems? This explores whether sub-task abstraction enables skill accumulation rather than task-by-task problem solving.
a concrete case of persistent agent-authored artifacts (reusable routines) compounding over time
Can agents adapt without pausing service to users? Can deployed LLM agents continuously improve their capabilities while serving users without interruption? This explores whether fast behavioral updates and slow policy learning can coexist across different timescales.
addresses how agent-created skills should persist and be promoted, the lifecycle this note raises

What makes agent-authored code worth persisting and sharing?

Inquiring lines that read this note 11

Related concepts in this collection 2

Related papers in this collection 8

Search by related questions 4