INQUIRING LINE

Inquiring lines›How should we train models for cap…›What systematic failures and vulne…›Does fine-tuning modify underlying…›this inquiring line

If you can't collect any examples from a new domain, describing it in plain English might be all an AI needs to adapt.

What access constraints allow description-based adaptation but block conventional techniques?

This explores the specific gap where you can't reach the target — no labeled examples, no access to the model's weights, no sample collection — yet a plain-language description of the new domain is still enough to adapt the system. The constraint that makes description-based methods shine is *zero-target-access*: you know what the new domain looks like, but you can't get documents or queries from it to fine-tune on the normal way. The clearest case is retrieval adaptation, where a brief textual description of a domain is used to synthesize training data, beating baselines precisely in the scenarios where conventional fine-tuning has nothing to chew on Can you adapt retrieval models without accessing target data?. Description substitutes for the missing collection.

There's a useful map for why this works: specialization techniques live on a ladder of access — black-box (you only get to send prompts), grey-box (you see some internals like token probabilities), and white-box (you can change weights) Does model access level determine which specialization techniques work?. Conventional adaptation lives at the white-box end and gets blocked the moment you lose weight access or training data. A domain description is a black-box move — it activates and steers knowledge the model already has rather than injecting new parameters. That's also the ceiling: black-box methods can only surface existing capability, so when the new knowledge genuinely isn't in the model, description alone won't conjure it.

The same access logic shows up in agents that adapt without ever touching their weights. Reflexion stores verbal self-critiques in episodic memory and improves across attempts on nothing but a binary success/failure signal Can agents learn from failure without updating their weights?, and AgentFly formalizes the whole learning loop as memory operations, reaching strong benchmark scores with the underlying model frozen Can agents learn continuously from experience without updating weights?. When you can't do gradient updates, text becomes the adaptation channel. Fast-Slow Training makes this explicit by deliberately routing task-specific lessons into optimized prompts instead of weights — and finds that doing so dodges the catastrophic forgetting that conventional fine-tuning incurs Can splitting adaptation into two channels reduce forgetting?.

The quiet catch worth knowing: description-based and context-based adaptation assume the model will actually *use* what you tell it. But models often ignore in-context information when their training priors are strong, generating outputs that contradict the very description you supplied Why do language models ignore information in their context?. So the access constraint cuts both ways — text gets you in the door when data and weights are off-limits, but a textual instruction only steers a model that's willing to be steered. Description-based adaptation is best understood not as a weaker substitute for fine-tuning, but as the technique matched to a different constraint: you have words, not data, and you can prompt, not retrain.

Sources 6 notes

Can you adapt retrieval models without accessing target data?

Research demonstrates that a brief textual domain description suffices to generate synthetic training data for retrieval fine-tuning, outperforming baselines in zero-target-access scenarios and enabling adaptation where conventional methods are blocked.

Does model access level determine which specialization techniques work?

Three tiers of access—black-box, grey-box, and white-box—create a hierarchy of specialization power. Black-box techniques can only activate existing knowledge; white-box methods can inject new knowledge but risk over-specialization.

Can agents learn from failure without updating their weights?

Reflexion demonstrates that unambiguous environmental feedback (success/failure) enables agents to write useful self-diagnoses and improve across episodes without parameter updates. The binary signal prevents rationalization, and keeping reflections uncompressed preserves their usability.

Can agents learn continuously from experience without updating weights?

AgentFly formalizes agent learning as a Memory-augmented MDP with three memory modules (case, subtask, tool) that enable credit assignment and policy improvement entirely through memory operations. The approach achieved 87.88% on GAIA validation without modifying LLM parameters.

Can splitting adaptation into two channels reduce forgetting?

Fast-Slow Training routes task-specific lessons into optimized prompts while keeping parameter updates minimal, reaching equivalent performance 1.4–3x faster with substantially less catastrophic forgetting and plasticity loss, demonstrating that forgetting is a misallocation problem rather than an inherent cost.

Show all 6 sources

Why do language models ignore information in their context?

Research demonstrates that LMs generate outputs inconsistent with their context because parametric knowledge from training dominates over in-context information. Textual prompting alone cannot override strong priors; causal intervention in representations is required.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs2.60 match · arxiv ↗
Useful Memories Become Faulty When Continuously Updated by LLMs1.77 match · arxiv ↗
The AI Hippocampus: How Far are We From Human Memory?1.67 match · arxiv ↗
Dense Retrieval Adaptation using Target Domain Description0.90 match · arxiv ↗
ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory0.90 match · arxiv ↗
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning0.89 match · arxiv ↗
Continual Learning Bench: Evaluating Frontier AI Systems in Real-World Stateful Environments0.88 match · arxiv ↗
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization0.88 match · arxiv ↗

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst updating a synthesis on description-based adaptation in LLMs. The precise question: *which access constraints allow plain-language domain descriptions to steer model behavior, and why do they fail?*

What a curated library found — and when (dated claims, not current truth):
Findings span 2023–2026. The library identified:
- Description-based adaptation succeeds in zero-target-access regimes: you can steer retrieval and task behavior with textual domain summaries when you cannot collect labeled examples or modify weights (2023–2024).
- Black-box methods (prompting, memory, in-context learning) activate existing model knowledge without gradient updates; white-box methods (fine-tuning) are blocked when data or weights are inaccessible (2023).
- Verbal reflection stored as episodic memory (Reflexion, AgentFly) enables continual agent adaptation without weight updates, routing task-specific learning into text rather than parameters (2024).
- Models often ignore in-context instructions when training priors are strong, contradicting supplied descriptions — a fundamental ceiling on description-based steering (2024).
- Fast-Slow Training deliberately splits adaptation: textual prompts handle task-specifics, frozen weights hold general knowledge, avoiding catastrophic forgetting seen in conventional fine-tuning (2026).

Anchor papers (verify; mind their dates):
- arXiv:2307.02740 (2023-07): Dense Retrieval Adaptation using Target Domain Description
- arXiv:2310.10134 (2023-10): CLIN: Continually Learning Language Agent for Rapid Task Adaptation
- arXiv:2605.12484 (2026-05): Learning, Fast and Slow: Towards LLMs That Adapt Continually
- arXiv:2605.12978 (2026-05): Useful Memories Become Faulty When Continuously Updated by LLMs

Your task:
(1) RE-TEST EACH CONSTRAINT. For every finding above, judge whether newer models (Claude 3.5, GPT-4o, open-source 70B+), in-context learning length, routing mechanisms (mixture-of-experts, adaptive routing), or better memory architectures have since relaxed the in-context-instruction-ignoring problem or the zero-target-access ceiling. Separate the durable question (description-based steering likely still asymptotic to model priors) from the perishable limitation (perhaps multi-stage prompting or chain-of-thought now overcomes prior override).
(2) Surface the strongest work from late 2024–2026 that contradicts or supersedes the "description alone cannot inject genuinely novel knowledge" claim — or confirm it still holds.
(3) Propose 2 research questions that assume the regime may have shifted: (a) Can adaptive routing (e.g., dynamic LoRA insertion without retraining) combine black-box description-steering with parameter-light white-box tuning? (b) Does multi-hop retrieval of synthetic in-domain examples, derived from descriptions, overcome the "contradiction with training priors" failure mode?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

If you can't collect any examples from a new domain, describing it in plain English might be all an AI needs to adapt.

Related lines of inquiry

Sources 6 notes

Papers this line draws on 8