Reading Between the Prompts: How Stereotypes Shape LLM’s Implicit Personalization

V. Neplenbroek; Arianna Bisazza; R. Fernández

doi:https://doi.org/10.18653/v1/2025.emnlp-main.1029

Reading Between the Prompts: How Stereotypes Shape LLM’s Implicit Personalization

Authors	V. Neplenbroek Arianna Bisazza R. Fernández
Publication date	2025
Host editors	C. Christodoulopoulos T. Chakraborty C. Rose V. Peng
Book title	The 2025 Conference on Empirical Methods in Natural Language Processing : Proceedings of the Conference
Book subtitle	EMNLP 2025 : November 4-9, 2025
ISBN (electronic)	9798891763326
Event	30th Conference on Empirical Methods in Natural Language Processing, EMNLP 2025
Pages (from-to)	20367-20400
Publisher	Kerrville, TX: Association for Computational Linguistics
Organisations	Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract	Generative Large Language Models (LLMs) infer user’s demographic information from subtle cues in the conversation — a phenomenon called implicit personalization. Prior work has shown that such inferences can lead to lower quality responses for users assumed to be from minority groups, even when no demographic information is explicitly provided. In this work, we systematically explore how LLMs respond to stereotypical cues using controlled synthetic conversations, by analyzing the models’ latent user representations through both model internals and generated answers to targeted user questions. Our findings reveal that LLMs do infer demographic attributes based on these stereotypical signals, which for a number of groups even persists when the user explicitly identifies with a different demographic group. Finally, we show that this form of stereotype-driven implicit personalization can be effectively mitigated by intervening on the model’s internal representations using a trained linear probe to steer them toward the explicitly stated identity. Our results highlight the need for greater transparency and control in how LLMs represent user identity.
Document type	Conference contribution
Note	With checklist
Language	English
Published at	https://doi.org/10.18653/v1/2025.emnlp-main.1029 (Final published version)
Other links	https://github.com/Veranep/implicit-personalization-stereotypes
Downloads	2025.emnlp-main.1029 (Final published version)
Supplementary materials	2025.emnlp-main.1029.checklist
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Reading Between the Prompts: How Stereotypes Shape LLM’s Implicit Personalization