University of Amsterdam at the CLEF 2025 Eloquent Track Evaluating the Influence of Stylistic Prompt Variations on Semantic Interpretation
| Authors | |
|---|---|
| Publication date | 2025 |
| Host editors |
|
| Book title | Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2025) |
| Book subtitle | Madrid, Spain, 9-12 September 2025 |
| Series | CEUR Workshop Proceedings |
| Event | 26th Working Notes of the Conference and Labs of the Evaluation Forum, CLEF 2025 |
| Pages (from-to) | 1435-1442 |
| Number of pages | 8 |
| Publisher | Aachen: CEUR-WS |
| Organisations |
|
| Abstract |
This paper reports on the University of Amsterdam’s participation in the CLEF 2025 Eloquent Track’s Robustness and Consistency Task. Our overall goal is to evaluate the influence of stylistic prompt variations on semantic interpretation. Our specific focus is to investigate how variations in prompt tone, structure, and persona affect the consistency and robustness of responses generated by large language models (LLMs). We approach this through two complementary methods. First, we use a model-as-judge setup to quantify semantic consistency: each stylistic variant prompt is compared to its original base prompt using GPT-4.1 to rate the similarity of the generated responses on a 0–5 scale. Second, we conduct an inductive qualitative analysis on a selected prompt to closely examine how different stylistic framings influence content shifts in model outputs. Our results suggest that prompt reformulations can lead to variations in output, informational content, and tone. |
| Document type | Conference contribution |
| Language | English |
| Published at | https://ceur-ws.org/Vol-4038/paper_115.pdf |
| Other links | https://ceur-ws.org/Vol-4038/ https://www.scopus.com/pages/publications/105019057746 |
| Downloads |
paper_115
(Final published version)
|
| Permalink to this page | |
