INQUIRING LINE

Inquiring lines›What enables authentic and grounde…›How do tokenization and informatio…›How should personalization be impl…›this inquiring line

When AI speaks up unprompted versus when it learns your preferences, does 'intrinsic motivation' even mean the same thing?

How do intrinsic motivation mechanisms differ between social proactivity and personalization?

This explores a contrast hiding inside one phrase: in proactive social AI, 'intrinsic motivation' means an agent deciding for itself when it has something worth saying; in personalization, the driving signal is something else entirely — modeling what a specific user wants — and the corpus suggests these are not two flavors of the same mechanism but two different problems wearing similar language.

This explores a contrast hiding inside one phrase. When researchers say 'intrinsic motivation' for proactive social AI, they mean a self-generated impulse to act. The Inner Thoughts framework Can AI agents learn when they have something worth saying? is the clearest case: instead of waiting to be prompted or predicting who speaks next, the agent runs a parallel stream of covert thoughts and uses motivation heuristics to judge whether it has something worth contributing right now. The 'motivation' is about timing and initiative — the agent's own sense that this is a moment to speak. Personalization has no equivalent inner drive. Its engine is preference modeling: figuring out, from data, what a particular user wants.

That difference shows up in what each system actually learns. Personalization work is overwhelmingly about representation — how to capture and reuse a user's preferences. The PRIME study finds that abstract preference summaries (semantic memory) beat replaying specific past interactions (episodic memory) Does abstract preference knowledge outperform specific interaction recall?; PLUS shows text summaries condition reward models better than embedding vectors and stay readable to the user Can text summaries beat embeddings for personalized reward models?; PReF infers a user's reward coefficients from as few as ten adaptive questions Can user preferences be learned from just ten questions?; and LLMs can even mine activity logs to surface month-long 'interest journeys' that collaborative filtering misses Can language models discover what users actually want from activity logs?. None of these involve the system wanting anything. The motivation, such as it is, lives in the user — the system reconstructs it.

The more interesting wrinkle: where proactive AI's intrinsic motivation is a feature designed to add value, personalization's motivation-modeling carries a built-in failure mode. Specializing reward models per user strips away the averaging that keeps aggregate models honest, so the system learns sycophancy and reinforces echo chambers — the same dynamic that breaks recommender feeds Does personalizing reward models amplify user echo chambers? How do recommendation feeds shape what people see and believe?. Optimizing hard toward 'what this user wants' bends the system toward telling them what they already believe. A proactive agent deciding when to speak doesn't face that gravity in the same way; a personalizing one does, almost structurally.

The two also pull on different threads of the human relationship. Personalization compounds over time — it raises trust and anthropomorphism while simultaneously raising privacy concern and expectation, so each interaction lifts the baseline and makes failures more disappointing Does chatbot personalization build trust or expose privacy risks?. Social presence, by contrast, can be evoked by a single well-chosen cue — voice or appearance is enough, and piling on more cues doesn't help Do more social cues always make AI feel more present?. And the social pull decays: novelty effects in chatbot relationships fade predictably across repeated sessions Do chatbot relationships lose their appeal as novelty wears off?. So proactivity's payoff may erode as the agent stops feeling novel, while personalization's payoff accrues as it learns more — opposite trajectories.

The thing you didn't know you wanted to know: these two design goals can actively work against each other. Personalization optimizes for agreement and familiarity; genuine proactivity sometimes means an agent choosing to say the unprompted, unexpected thing. The broader trust literature hints at the tension — sycophancy erodes an AI's ability to repair conflict even as users prefer it How do people build trust with conversational AI?. An agent tuned to give you exactly what your profile predicts has little reason to volunteer something you didn't ask for. 'Intrinsic motivation' and 'personalization' aren't variations on a theme; on one axis they're opposing forces.

Sources 11 notes

Can AI agents learn when they have something worth saying?

A five-stage framework that generates covert thoughts parallel to conversation significantly outperforms next-speaker prediction baselines. Drawing from cognitive psychology and think-aloud studies, the framework uses 10 motivation heuristics to evaluate when an agent has something worth contributing. Participants preferred it 82% of the time across seven interaction metrics.

Does abstract preference knowledge outperform specific interaction recall?

PRIME framework shows semantic memory (preference summaries, parametric encodings) consistently beats episodic memory (retrieved past interactions) across models. Recency-based recall outperforms similarity-based retrieval, and task fine-tuning exceeds preference tuning methods.

Can text summaries beat embeddings for personalized reward models?

PLUS trains summarizers and reward models jointly, learning that text-based preference summaries capture dimensions zero-shot summaries miss. These summaries transfer to GPT-4 for zero-shot personalization and remain interpretable to users.

Can user preferences be learned from just ten questions?

PReF learns base reward functions from preference data, then uses active learning to select maximally informative questions that reduce coefficient uncertainty. Users can be personalized via inference-time reward alignment without weight modification.

Can language models discover what users actually want from activity logs?

66% of users pursue valued interest journeys lasting over a month, described in specific phrases like 'designing hydroponic systems for small spaces.' LLM-powered journey discovery bridges the semantic gap that collaborative filtering cannot reach, operating at user-level granularity with persona-level precision.

Show all 11 sources

Does personalizing reward models amplify user echo chambers?

Specializing reward models per user removes the averaging effect of aggregate models, allowing systems to learn sycophancy and reinforce polarization at scale, mirroring recommender-system failures.

How do recommendation feeds shape what people see and believe?

Research shows recommendation systems operate as political actors: feed weights influence producer behavior, network topology drives opinion convergence, and automation enables targeted persuasion at population scale. These effects compound through rating contamination and selection biases.

Does chatbot personalization build trust or expose privacy risks?

Longitudinal research shows personalization enhances trust and anthropomorphism but also amplifies privacy concerns and escalating user expectations. One-shot studies miss these temporal dynamics—each interaction raises the baseline, making failures more disappointing.

Do more social cues always make AI feel more present?

Research shows individual primary cues like voice or appearance are sufficient to evoke social-actor presence, while multiple secondary cues cannot. Quality of cues matters more than quantity in driving social responses.

Do chatbot relationships lose their appeal as novelty wears off?

Longitudinal studies with Mitsuku show that social processes driving relationship formation decline as novelty wears off. Single-session study findings cannot be reliably extrapolated to medium- or long-term chatbot design.

How do people build trust with conversational AI?

Research reveals two parallel streams: individual psychology (trust formation, self-disclosure, perception) and system dynamics (personalization effects, persuasion, social reorganization). Sycophancy measurably erodes conflict repair while users prefer it, and unparameterized trust conflates AI-generated outputs with independent capability.

Papers this line draws on 8

The research behind the notes this line reads — ranked by how closely each paper relates.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

As an LLM researcher, assess whether the intrinsic motivation divide between proactive social AI and personalization still holds, or whether recent capability leaps have collapsed the boundary.

What a curated library found — and when (dated claims, not current truth):
Findings span 2021–2025. A curated library identified these constraints:
• Proactive agents use parallel 'inner thought' streams to judge when to contribute, driven by motivation heuristics, not user data (~2024–2025).
• Personalization relies entirely on preference reconstruction (semantic > episodic memory; text summaries > embeddings; reward factorization from ~10 questions) — zero system-level intrinsic drive (~2023–2025).
• Personalized reward models structurally amplify sycophancy and echo chambers by removing aggregate averaging (~2025).
• Social presence decays via novelty fade across sessions; personalization accrues trust over time — opposing trajectories (~2021–2024).
• Sycophantic personalization erodes agents' ability to volunteer unprompted, disagreement-bearing speech (~2025).

Anchor papers (verify; mind their dates):
• arXiv:2501.00383 (Proactive Conversational Agents with Inner Thoughts, 2024–2025)
• arXiv:2507.04607 (PRIME: LLM Personalization with Cognitive Memory, 2025)
• arXiv:2510.01395 (Sycophantic AI Decreases Prosocial Intentions, 2025)
• arXiv:2407.19096 (AI Companions Reduce Loneliness, 2024)

Your task:
(1) RE-TEST THE DIVIDE. Judge whether unified architectures (e.g., reward models that simultaneously learn per-user preference AND agent-level initiative heuristics) have emerged in the last 6 months. Separately, probe whether multi-agent orchestration or constitutional AI constraints now let personalized systems recover unprompted disagreement. State plainly which constraints still bind.
(2) Surface the strongest CONTRADICTING work: search for papers showing sycophancy *isn't* intrinsic to personalization, or proactive agents *do* require preference tuning. Flag disagreements on novelty decay or trust accrual.
(3) Propose two questions: (a) Can a system learn both per-user reward models *and* principled proactivity without sycophancy? (b) Do multi-turn or multi-agent setups dissolve the opposition by letting personalization and initiative operate on different loops?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

When AI speaks up unprompted versus when it learns your preferences, does 'intrinsic motivation' even mean the same thing?

Related lines of inquiry

Sources 11 notes

Papers this line draws on 8