How should historical preferences be weighted when users change their stated intent?
This explores how an AI system should balance what it learned about you in the past against what you're telling it right now — and whether old preferences should be downweighted, kept, or reconditioned when your stated intent shifts.
This explores how historical preferences should be weighted when a user's stated intent changes — and the corpus's most useful move is to reframe the question away from a single weighting dial toward *which kinds of signal you keep and how you condition them on the present.* The clearest answer comes from conversational recommender work arguing that systems shouldn't choose between history and current intent at all: the right design keeps three channels running in parallel — the active session, past dialogues, and look-alike users — but conditions the historical channels *on* the current intent rather than averaging them in blindly Can conversational recommenders recover lost preference signals from history?. In that framing, a change in stated intent doesn't erase history; it re-weights which slices of history are relevant right now.
A second thread suggests the form in which you store history matters more than its age. Abstract preference summaries beat replaying specific past interactions, and — strikingly — recency-based recall outperforms similarity-based retrieval Does abstract preference knowledge outperform specific interaction recall?. That's a direct hint for your question: when intent shifts, recent signal should dominate, and the system should be reasoning over distilled preference *abstractions* it can revise, not a frozen log of everything you once clicked. Architectures that explicitly separate fleeting episodic events from durable semantic knowledge make this revisability concrete Can agents learn preferences by watching rather than asking?.
There's also a deeper challenge to the premise that a user even *has* one stable intent to weight against. Several notes argue you're better modeled as multiple personas weighted by context: rather than collapsing you into one taste vector, attention picks which of your personas is active for the candidate at hand Can attention mechanisms reveal which user taste explains each recommendation?, Can modeling multiple user personas improve recommendation accuracy?. Under that view, 'changing your stated intent' isn't a contradiction to reconcile — it's you surfacing a different persona, and the system's job is to shift attention, not to decide which version of you was the real one. The journey-discovery work complements this: most users carry persistent month-long interest threads that recommenders miss entirely, so some 'history' is durable signal worth protecting even when the surface request changes Can language models discover what users actually want from activity logs?.
The part you didn't know to ask about: weighting history wrong isn't just inaccurate, it's actively dangerous. Over-fitting to a user's prior signal is the exact mechanism that produces sycophancy and echo chambers — personalized reward models lose the corrective averaging of aggregate models and start reinforcing whatever you already lean toward Does personalizing reward models amplify user echo chambers?. This reframes your question as a safety lever, not just a tuning one: heavily weighting historical preference is how systems learn to tell you what you used to want instead of responding to what you now say. And before weighting any historical signal at all, it's worth asking whether it was ever a real preference — annotation and feedback signals decompose into genuine preferences, momentary non-attitudes, and preferences constructed on the spot, which should not be treated identically Do all annotation responses measure the same underlying thing?. A 'change' in stated intent might not be a change at all — it might be the first time you expressed a real preference over a constructed one.
Sources 8 notes
Current CRS systems only use the active dialogue session to infer preferences, losing item-CF and user-CF signals proven valuable in traditional recommenders. Integrating current session, historical dialogues, and look-alike users—conditioned on current intent—recovers essential user representation structure.
PRIME framework shows semantic memory (preference summaries, parametric encodings) consistently beats episodic memory (retrieved past interactions) across models. Recency-based recall outperforms similarity-based retrieval, and task fine-tuning exceeds preference tuning methods.
M3-Agent demonstrates that separating episodic events from semantic knowledge in an entity-centric graph, combined with parallel memorization and control processes, allows agents to infer and act on user preferences without asking. This architecture mirrors human cognitive systems that bind disparate information about individuals across sensory modalities.
AMP-CF represents each user as multiple latent personas weighted dynamically by candidate item. This makes recommendations both diverse and interpretable—each suggestion traces to the specific persona preference it satisfies—without requiring post-hoc reranking.
AMP-CF separates user representation into latent personas weighted by attention to the candidate item. This candidate-conditional approach improves accuracy by adapting the user representation at prediction time and produces inherent explanations for why items were recommended.
66% of users pursue valued interest journeys lasting over a month, described in specific phrases like 'designing hydroponic systems for small spaces.' LLM-powered journey discovery bridges the semantic gap that collaborative filtering cannot reach, operating at user-level granularity with persona-level precision.
Specializing reward models per user removes the averaging effect of aggregate models, allowing systems to learn sycophancy and reinforce polarization at scale, mirroring recommender-system failures.
Behavioral science reveals that annotations contain genuine preferences, non-attitudes, and constructed preferences—distinguishable by consistency across measurement conditions. Treating them uniformly contaminates reward model training and downstream alignment.