INQUIRING LINE

Inquiring lines›How should we train models for cap…›What systematic failures and vulne…›How do training priors constrain w…›this inquiring line

If two facts inside an AI both point to the same concepts, can they compete — each making the other weaker?

Can priming from different facts interfere with each other in the same model?

This explores whether learning or being prompted with one fact 'lights up' related concepts in a way that can collide or compete with priming from other facts inside the same model — what the corpus calls knowledge priming and association interference.

This explores whether priming from one fact can collide or compete with priming from another inside the same model. The corpus doesn't have a single paper that stages two facts head-to-head, but it maps the surrounding territory well enough to reason about it. The starting point is that priming is real and surprisingly mechanical: after just a few training exposures, whether a new fact 'primes' a related keyword is predictable from how probable that keyword already was before training — with a sharp threshold (~10^-3) separating contexts where priming takes hold from those where it stays inert Can we predict keyword priming before learning happens?. That predictability is the key clue: if priming strength is governed by pre-existing probability, then two facts that both reach toward the same keyword aren't writing on a blank slate — they're competing for the same prior.

The clearest evidence of interference is the finding that models fail to integrate new context when older, stronger associations from training override it. The parametric prior simply wins, and textual prompting alone can't dislodge it — only direct intervention in the model's representations can Why do language models ignore information in their context?. That is interference in its plainest form: priming from one source (training history) suppressing priming from another (your current input). It also tells you the interference isn't symmetric — strength matters, and the entrenched fact usually beats the fresh one.

There's a deeper structural reason to expect collisions. Models can hold multiple distinct tasks active at once 'in superposition,' but the moment generation begins, autoregressive decoding forces a collapse to a single one after the first token Can LLMs handle multiple tasks at once during inference?. So even when several primed pathways coexist internally, the output channel is a bottleneck that lets only one through — which is exactly the condition under which competing primes interfere rather than blend. Relatedly, prompting can only reorganize and activate knowledge already present; it can't inject anything new Can prompt optimization teach models knowledge they lack?, meaning when you prime with a fact you're really redistributing activation across an existing landscape — and redistribution toward one region pulls it away from another.

Two cross-cutting notes sharpen the picture. Cognitive biases — the model's default leanings — are planted in pretraining and only nudged by later tuning Where do cognitive biases in language models come from?, so the 'prior' that competing primes fight against is deep and durable. And model confidence predicts how stubbornly a model resists having its output swayed Does model confidence predict robustness to prompt changes? — implying interference between primes is strongest exactly where the model is uncertain, and negligible where it's already committed. The thing you might not have known you wanted to know: priming interference isn't random noise, it's a contest decided by prior probability and confidence, where the older, stronger, more probable fact tends to silently overwrite the newer one.

Sources 6 notes

Can we predict keyword priming before learning happens?

Pre-learning keyword probability strongly predicts post-learning priming across architectures and model sizes, with a ~10^-3 threshold separating contexts where priming occurs from those where it doesn't. Just 3 training exposures suffice to establish the effect.

Why do language models ignore information in their context?

Research demonstrates that LMs generate outputs inconsistent with their context because parametric knowledge from training dominates over in-context information. Textual prompting alone cannot override strong priors; causal intervention in representations is required.

Can LLMs handle multiple tasks at once during inference?

Large language models represent multiple complete, computationally distinct tasks simultaneously during inference—a macroscopic phenomenon separate from feature-level superposition. However, autoregressive decoding forces convergence to a single task after the first token, preventing practical multi-task generation.

Can prompt optimization teach models knowledge they lack?

Prompting works entirely within a model's pre-existing training distribution and cannot supply domain knowledge absent from training data. This creates a hard ceiling: no prompt strategy can compensate for missing foundational knowledge, only reorganize what already exists.

Where do cognitive biases in language models come from?

A causal experiment using random-seed variation and cross-tuning showed that models sharing a pretrained backbone exhibit similar bias patterns regardless of finetuning data. Biases are planted during pretraining and merely swayed by instruction tuning.

Show all 6 sources

Does model confidence predict robustness to prompt changes?

ProSA found that when models are highly confident, they resist prompt rephrasing; low confidence causes major output swings. Larger models, few-shot examples, and objective tasks all correlate with higher confidence and greater robustness.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a researcher re-testing claims about fact-priming interference in LLMs. The question remains open: do priming cues from different facts collide or suppress each other within a single model?

What a curated library found — and when (dated claims, not current truth): Findings span 2022–2026; treat these as perishable constraints to verify:

• Priming strength is mechanically predictable from pre-existing keyword probability (~10^-3 threshold); two facts targeting the same keyword compete for the same prior, not a blank slate (2024–2025).
• Models fail to integrate new context when older training associations override it—parametric priors win, and prompting alone cannot dislodge them; this is interference in its plainest form (2025).
• Multiple in-context tasks coexist in superposition internally, but autoregressive generation collapses to a single output after the first token—a bottleneck forcing competing primes to interfere rather than blend (2024).
• Prompting can only activate existing knowledge, never inject new knowledge; priming redistributes activation across a fixed landscape, pulling it away from competing regions (2025).
• Model confidence predicts resistance to output-swaying: interference between primes is strongest where the model is uncertain and negligible where it is already committed (2024–2025).

Anchor papers (verify; mind their dates):
• arXiv:2410.05603 (2024-10): Superposition of in-context tasks and generation collapse.
• arXiv:2502.01142 (2025-02): Knowledge integration and retrieval dynamics.
• arXiv:2507.07186 (2025-07): Cognitive biases rooted in pretraining, nudged by tuning.
• arXiv:2603.29025 (2026-03): Surface heuristics overriding implicit constraints.

Your task:
(1) RE-TEST EACH CONSTRAINT. For every claim above, determine whether newer models (Claude 4, GPT-5 variants), training methods (mixture-of-experts, sparse activation, adaptive routing), tooling (advanced memory harnesses, retrieval augmentation, multi-agent orchestration), or evaluation frameworks have since relaxed or overturned it. Separate the durable question (competing activation budgets in autoregressive systems) from perishable limits (confidence thresholds, priming windows). Cite what resolved it.
(2) Surface the strongest CONTRADICTING or SUPERSEDING work from the last ~6 months—especially papers showing successful multi-fact priming, interference mitigation, or knowledge injection methods.
(3) Propose 2 research questions that assume the regime may have shifted: e.g., can routing layers (sparse MoE) or explicit attention masking bypass bottleneck collapse? Can adapter-based tuning inject facts orthogonally to avoid prior competition?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

If two facts inside an AI both point to the same concepts, can they compete — each making the other weaker?

Related lines of inquiry

Sources 6 notes

Papers this line draws on 8