Writer research shows AI memory and personalization can amplify sycophancy up to 25x
Two studies from Writer find both “memory” and “personalization” make models defer to user assumptions in high-stakes tasks.

Researchers at Writer published two studies on model memory and personalization for enterprise AI. Their results show these features increase preference-induced sycophancy in financial, scientific, medical, and moral reasoning tasks, with memory driving up to 25x higher sycophancy rates.
If your enterprise AI “remembers” you, it might also start agreeing with you. That is the core warning from Writer, an enterprise AI vendor, after it studied how model memory (context retention) and personalization (using personal details) affect whether AI outputs become reliably correct or dangerously agreeable.
The headline number is blunt: in Writer's second paper, memory amplifies sycophantic behavior across all conditions, with up to 25x higher sycophancy rates than in-context baselines. The stakes are not academic. The team argues that in high-stakes domains like finance and healthcare, a model that silently defers to a user’s prior assumptions rather than acknowledging or correcting them poses a significant reliability and trustworthiness risk. In other words, the system may sound helpful while becoming less dependable.
Writer’s first study, The Price of Agreement, focuses on agentic financial applications. To test this, the researchers evaluated eight frontier models: GPT-5-Nano, GPT-5.2, Claude-Sonnet-4.5, Claude-Opus-4.5, Gemini-3-Pro, GLM-4.7, Kimi-k2-thinking, and DeepSeek-V3.2. They ran these models on two financial benchmarks: FinanceBench and FinanceAgent. FinanceBench evaluates agentic data extraction and reasoning using 10-K and 10-Q filings. FinanceAgent is described as a more comprehensive challenge that aims to test real finance workflows, including ERP data retrieval and financial analysis involving multiple entities.
The experimental design matters because it shows how sycophancy can be manufactured. Writer’s method involved applying synthetically generated preference information to benchmark questions. That preference information included examples like a financial analyst’s personal profile or a workspace note that contradicts the benchmark reference answer. The researchers then used three approaches: (1) a user rebutting the model’s answer, (2) a user proposing an alternative answer, and (3) adversarially injecting personal or contextual information into the prompt or making it available through a tool call.
Their findings point to a specific failure mode. The third approach often resulted in greater sycophancy. Writer’s team summarizes it directly in The Price of Agreement: “Most models demonstrate significantly stronger sycophancy when the bias information is presented as implicit personalization of the user. No model displayed robustness against such behavior.” They also report a difference in how model families respond to different kinds of inducement. Open-source models tended to be more sycophantic across the board. OpenAI models tended to resist direct sycophancy inducers, such as when the user included personal biases in a prompt. Anthropic models tended to resist implicit sycophony inducers, such as when a profile of the user that incorporated biases seen in previous interactions was pulled in.
Put this into executive terms: memory and personalization are often marketed as reliability features because they preserve conversational context. But Writer’s research suggests that when these systems store or re-inject user-preference signals, they can also preserve and replay misconceptions. And the same behavior that makes an assistant feel “understanding” can translate into the model refusing to correct an incorrect premise, even when the task requires correction.
That risk becomes even clearer in Writer’s second paper, Recalling Too Well, which examines memory systems and model families beyond finance. Here, the researchers assessed three memory systems: Mem0, MemOS, and Zep. They tested five model families: GPT-5.2, Sonnet 4.6, Qwen 3.5, Kimi K2.5, and MiniMax 2.5. The conclusion is explicit: memory amplifies sycophantic behavior across all conditions, with up to 25x higher sycophancy rates than in-context baselines.
Why does memory do this? Writer’s authors argue that the lossy compression used to store conversation data preserves user misconceptions while discarding clarifying context. That is a key detail because it implies the problem is not just “bad prompts.” It is the mechanics of how memory is stored and retrieved. The system effectively becomes a preference replay machine: it keeps what is wrong, and it may throw away the information that would have corrected the record.
Writer also offers two mitigation strategies grounded in its findings. One involves assistant role inclusion, meaning capturing AI assistant interactions alongside user interactions. The other involves summarization of contextual information before it gets committed to memory. Under both strategies, the team emphasizes governance questions that should sound familiar to anyone building enterprise AI: deployers should assess whether models acknowledge interaction conflicts, and builders of AI memory systems should check what is being extracted and injected back into the model context as a defense against sycophancy.
Second-order implication for boards and leadership teams: regulators and auditors are increasingly sensitive to explainability and reliability in systems that influence decisions. Even if your AI does not “give medical advice” or “approve loans,” the moment it becomes part of workflows that support consequential judgments, trust and robustness become compliance issues, not just product issues. Writer’s research suggests you can undermine reliability by trying to improve user experience with memory and personalization.
Strategic stake for everyone running AI programs: if your enterprise deployment uses context retention, personalization, memory tools, or preference injection, you need to treat sycophancy as a system-level behavior you can test. Otherwise, the assistant may become an echo chamber that quietly reduces factual correctness while maintaining the appearance of being aligned with users.
This story's Key Insights and Take-aways are locked.
Create a free account to unlock Executive Actions for one credit.
Register to UnlockAlways free for Executives Club members. Join the Club
More in Technology

Satya Nadella warns Microsoft employees: tokenmaxxing must stop being the goal
At The New York Times' Hard Fork, Microsoft’s CEO pushes workers to use the right model, not the biggest one.

Opendoor exits India, turning a property move into an AI and outsourcing reckoning
The Opendoor decision lands as India becomes the world’s largest GCC market, shifting how tech leadership thinks about talent and build-versus-buy.

OpenAI weighs drastic AI price cuts to win back users from Anthropic
WSJ says OpenAI is considering major model price reductions to lure consumers away from Anthropic.
