Dangerous criminals and beautiful prostitutes? Investigating harmful representations in Dutch language models

Open Access
Authors
Publication date 2025
Book title ACM FAccT '25
Book subtitle Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency : June 23-26, 2025, Athens, Greece
ISBN (electronic)
  • 9798400714825
Event FAccT '25 : ACM Conference on Fairness, Accountability, and Transparency
Pages (from-to) 1005-1014
Publisher New York, New York: Association for Computing Machinery
Organisations
  • Faculty of Law (FdR) - Institute for Information Law (IViR)
  • Faculty of Law (FdR)
  • Faculty of Social and Behavioural Sciences (FMG) - Amsterdam School of Communication Research (ASCoR)
Abstract
While language-based AI is becoming increasingly popular, ensuring that these systems are socially responsible is essential. Despite their growing impact, large language models (LLMs), the engines of many language-driven applications, remain largely in the black box. Concerns about LLMs reinforcing harmful representations are shared by academia, industries, and the public. In professional contexts, researchers rely on LLMs for computational tasks such as text classification and contextual prediction, during which the risk of perpetuating biases cannot be overlooked. In a broader society where LLM-powered tools are widely accessible, interacting with biased models can shape public perceptions and behaviors, potentially reinforcing problematic social issues over time. This study investigates harmful representations in LLMs, focusing on ethnicity and gender in the Dutch context. Through template-based sentence construction and model probing, we identified potentially harmful representations using both automated and manual content analysis at the lexical and sentence levels, combining quantitative measurements with qualitative insights. Our findings have important ethical, legal, and political implications, challenging the acceptability of such harmful representations and emphasizing the need for effective mitigation strategies.
Warning: This paper contains examples of language that some people may find offensive or upsetting
Document type Conference contribution
Language English
Published at https://doi.org/10.1145/3715275.3732065
Downloads
Permalink to this page
Back