Evaluating Large Language Models on Lithuanian Grammatical Cases
| Authors |
|
|---|---|
| Publication date | 2026 |
| Host editors |
|
| Book title | The Second Workshop on Language Models for Low-Resource Languages : proceedings of the workshop |
| Book subtitle | LoResLM 2026 : March 29, 2026 |
| ISBN (electronic) |
|
| Event | 2nd Workshop on Language Models for Low-Resource Languages |
| Pages (from-to) | 371-377 |
| Number of pages | 7 |
| Publisher | Kerrville, TX: Association for Computational Linguistics |
| Organisations |
|
| Abstract |
We present a systematic evaluation of large language models (LLMs) on Lithuanian grammatical case marking, a task that has received little prior attention. Lithuanian is a relatively low-resource language, with rich morphology and explicit marking. To enable fine-grained syntactic and morphological assessment, we introduce a novel dataset of 305 minimal sentence pairs contrasting correct and incorrect case usage. Our results show that case marking is challenging for current models, with overall accuracy ranging from 0.662 to 0.852. A monolingual Lithuanian LLM consistently outperforms multilingual counterparts, highlighting the value of language-specific training over model size. Performance varies across cases: genitive and locative forms are generally better handled, while rarer constructions and subtle functional distinctions remain difficult. The dataset and analysis provide a resource for future work, supporting the development of more robust LLMs and targeted evaluation benchmarks for morphologically rich, low-resource languages.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.18653/v1/2026.loreslm-1.32 |
| Downloads |
2026.loreslm-1.32
(Final published version)
|
| Permalink to this page | |
