University of Amsterdam at the CLEF 2025 Eloquent Track

Bruno N. Sotic; Jaap Kamps

University of Amsterdam at the CLEF 2025 Eloquent Track Evaluating the Influence of Stylistic Prompt Variations on Semantic Interpretation

Authors	Bruno N. Sotic Jaap Kamps
Publication date	2025
Host editors	G. Faggioli N. Ferro P. Rosso D. Spina
Book title	Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2025)
Book subtitle	Madrid, Spain, 9-12 September 2025
Series	CEUR Workshop Proceedings
Event	26th Working Notes of the Conference and Labs of the Evaluation Forum, CLEF 2025
Pages (from-to)	1435-1442
Number of pages	8
Publisher	Aachen: CEUR-WS
Organisations	Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract	This paper reports on the University of Amsterdam’s participation in the CLEF 2025 Eloquent Track’s Robustness and Consistency Task. Our overall goal is to evaluate the influence of stylistic prompt variations on semantic interpretation. Our specific focus is to investigate how variations in prompt tone, structure, and persona affect the consistency and robustness of responses generated by large language models (LLMs). We approach this through two complementary methods. First, we use a model-as-judge setup to quantify semantic consistency: each stylistic variant prompt is compared to its original base prompt using GPT-4.1 to rate the similarity of the generated responses on a 0–5 scale. Second, we conduct an inductive qualitative analysis on a selected prompt to closely examine how different stylistic framings influence content shifts in model outputs. Our results suggest that prompt reformulations can lead to variations in output, informational content, and tone.
Document type	Conference contribution
Language	English
Published at	https://ceur-ws.org/Vol-4038/paper_115.pdf (Final published version)
Other links	https://ceur-ws.org/Vol-4038/ https://www.scopus.com/pages/publications/105019057746
Downloads	paper_115 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

University of Amsterdam at the CLEF 2025 Eloquent Track Evaluating the Influence of Stylistic Prompt Variations on Semantic Interpretation