SYNTHESIS NOTE
Model Architecture and Internals Agentic Systems and Tool Use Reasoning, Retrieval, and Evaluation

Can iterative revision cycles match how humans actually write?

Does framing research writing as a diffusion process—where drafts are refined through retrieval-augmented cycles—better capture human cognition than linear pipelines and reduce information loss?

Synthesis note · 2026-05-03 · sourced from Diffusion LLM

Existing deep research agents combine test-time scaling techniques (CoT, best-of-n, MCTS, debate, self-refinement) without a deliberate cognitive design. Most public agents employ a linear or parallelized pipeline of planning → searching → generation, which loses global context and misses critical dependencies. Cognitive studies of human writing (Flower and Hayes, 1981) show that people do not write linearly — they establish a high-level plan, draft based on the plan, and then engage in multiple revision cycles that interleave further information gathering with rewriting.

TTD-DR observes a structural similarity between this human pattern and retrieval-augmented diffusion sampling: a noisy initial draft is iteratively denoised toward higher-quality outputs, with each denoising step informed by retrieved external information. The framework operationalizes this as report-level diffusion — a preliminary draft serves as an updatable skeleton that evolves through iterative refinement, with each step augmented by targeted retrieval. The draft is a global anchor that maintains coherence across iterations, addressing the information-loss problem of linear pipelines.

Two mechanisms make the analogy useful in practice. Denoising with retrieval drives report-level evolution: the draft and research plan jointly steer the next retrieval, and retrieved content drives the next denoising step. Self-evolution operates at the component level: each unit agent (plan generator, question generator, answer searcher, report generator) undergoes its own optimization, mitigating per-component information loss across long agentic trajectories. The interplay is essential — without component-level self-evolution, the draft-level diffusion lacks high-quality context to refine on.

The conceptual yield is that diffusion is not just a generation technique but a process model for cognitively-inspired iterative work. Any task that humans approach as draft-and-revise rather than write-once-correctly — research reports, design documents, complex prose — is a candidate for the same draft-centric, retrieval-augmented diffusion treatment. The draft becomes the persistent state that the agentic system refines, rather than a final output produced by a feed-forward pipeline — analogous to how Why does vanilla RAG produce shallow and redundant results? argues iterative loops are required for depth.

Inquiring lines that use this note as a source 5

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 6

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
17 direct connections · 160 in 2-hop network ·dense cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

research report writing maps onto diffusion sampling — drafts are noisy outputs and revision cycles are denoising steps augmented by retrieval