University of Amsterdam at the CLEF 2025 Eloquent Track Evaluating the Influence of Stylistic Prompt Variations on Semantic Interpretation

Open Access
Authors
Publication date 2025
Host editors
  • G. Faggioli
  • N. Ferro
  • P. Rosso
  • D. Spina
Book title Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2025)
Book subtitle Madrid, Spain, 9-12 September 2025
Series CEUR Workshop Proceedings
Event 26th Working Notes of the Conference and Labs of the Evaluation Forum, CLEF 2025
Pages (from-to) 1435-1442
Number of pages 8
Publisher Aachen: CEUR-WS
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract

This paper reports on the University of Amsterdam’s participation in the CLEF 2025 Eloquent Track’s Robustness and Consistency Task. Our overall goal is to evaluate the influence of stylistic prompt variations on semantic interpretation. Our specific focus is to investigate how variations in prompt tone, structure, and persona affect the consistency and robustness of responses generated by large language models (LLMs). We approach this through two complementary methods. First, we use a model-as-judge setup to quantify semantic consistency: each stylistic variant prompt is compared to its original base prompt using GPT-4.1 to rate the similarity of the generated responses on a 0–5 scale. Second, we conduct an inductive qualitative analysis on a selected prompt to closely examine how different stylistic framings influence content shifts in model outputs. Our results suggest that prompt reformulations can lead to variations in output, informational content, and tone.

Document type Conference contribution
Language English
Published at https://ceur-ws.org/Vol-4038/paper_115.pdf
Other links https://ceur-ws.org/Vol-4038/ https://www.scopus.com/pages/publications/105019057746
Downloads
paper_115 (Final published version)
Permalink to this page
Back