Multilinguality and multiculturalism Towards effective and inclusive neural language models

Open Access
Authors
Supervisors
Cosupervisors
Award date 22-01-2025
ISBN
  • 9789464736526
Series ILLC Dissertation series , DS-2024-14
Number of pages 230
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
  • Faculty of Science (FNWI)
Abstract
Language models require vast amounts of data during training. This limits their use to languages for which such data requirements can be met. To extend access to language technology to more linguistic communities, researchers have developed multilingual language models (MLMs) that are trained on data from multiple languages. The idea is that languages can support each other as they share common patterns, making the models useful across more languages.
However, this approach brings new challenges from both a technical and social perspective. When a model is trained on multiple languages, these languages start competing for limited model capacity, which can lead to negative interference and reduce effectiveness. In addition, to deploy MLMs in culturally-diverse communities, their output needs to be sensitive to the sociocultural norms and biases of those communities. This necessitates MLMs to become inherently multicultural as well.
In this thesis, we investigate how to build more effective MLMs that mitigate negative cross-language interference and study the effect that multilingual training has on the social biases and cultural values that they encode.
Document type PhD thesis
Language English
Downloads
Permalink to this page
cover
Back