SYNTHESIS NOTE

Why do paraphrased definitions work better than expert ones?

When instructing LLMs to classify argument schemes, should we use formal Walton definitions or LLM-generated paraphrases? This explores which source better enables reliable scheme recognition and why.

Synthesis note · 2026-05-18 · sourced from Argumentation

When the task is to tell an LLM what an argument scheme is so it can recognize one, two strategies are available: paste in the formal Walton definition (the normative source) or generate a description with another LLM (operational paraphrase). Intuition says the formal definition wins — it is the source of truth, written by domain experts. The evaluation shows the opposite. LLM-generated descriptions yield better classification performance than formal definitions.

The mechanism is worth taking seriously because it inverts a common assumption in prompt engineering. Formal definitions are written for readers who already share a technical vocabulary. They presuppose the reader can decode terms like "presumptive inference," "warrant," and "defeasible conclusion." An LLM-generated description rewrites the scheme in the model's native distribution: less precise, more redundant, anchored to examples and paraphrases the model has seen during training. The model understands its own paraphrase better than it understands the original.

This is operationalization-beats-definition as a prompting principle. The same lesson appears in instruction-tuned datasets where rewriting expert instructions in conversational style outperforms preserving the original. The model is not "dumb" for failing on the formal definition; it is reading the definition through a distribution shaped by web text, where formal logical vocabulary is rare. Paraphrasing into the training distribution is the cheap fix.

The deeper implication is that normative sources and operational prompts are different artifacts. A normative source aims for unambiguous truth; an operational prompt aims for reliable behavior. The two optimize different objectives and produce different texts. For task instructions, optimize for the second.

Inquiring lines that read this note 6

This note is a source for these research framings, grouped by the broader line of inquiry each explores. Scan the bold lines of inquiry; follow any specific question forward.

How faithfully do LLMs reflect their actual reasoning in outputs and explanations?

Why do reasoning models fail at systematic problem-solving and search?

Can prompting strategies overcome LLM biases without model fine-tuning?

Can LLM-generated descriptions of schemes outperform formal dictionary definitions for prompting?

Do language models perform faithful symbolic reasoning independent of semantic grounding?

Why do LLM descriptions of argument schemes work better than formal definitions for classification?

Related concepts in this collection 2

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

13 direct connections · 112 in 2-hop network ·medium cluster Open in graph ↗

Why do paraphrased definitions work better than … Can large language models classify argument scheme… Can structured argument prompts make LLM reasoning…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can large language models classify argument schemes reliably? Explores whether LLMs can recognize Walton's 60+ argument schemes—abstract patterns of reasoning rather than surface features—and what conditions enable accurate classification.
same paper, the size-and-format dependency that motivates description-based prompting
Can structured argument prompts make LLM reasoning more rigorous? Does requiring language models to explicitly check warrants, backing, and rebuttals—rather than reasoning freely—improve reasoning quality and catch failures that standard step-by-step prompting misses?
another case where operationalizing argument theory into prompt structure beats handing models the theory directly

Why do paraphrased definitions work better than expert ones?

Inquiring lines that read this note 6

Related concepts in this collection 2

Related papers in this collection 8

Search by related questions 4