Automatic Animacy Classification for Latvian Nouns

Open Access
Authors
Publication date 2025
Host editors
  • Sudhansu Bala Das
  • Pruthwik Mishra
  • Alok Singh
  • Shamsuddeen Hassan Muhammad
  • Asif Ekbal
  • Uday Kumur Dasi
Book title Proceedings of the Workshop Beyond English: Natural Language Processing for all Languages in an Era of Large Language Models
Book subtitle associated with The 15th International Conference on Recent Advances in Natural Language Processing RANLP'2025 : GlobalNLP 2025 : 12 September, 2025, Varna, Bulgaria
ISBN (electronic)
  • 9789544521059
Event Workshop Beyond English: Natural Language Processing for all Languages in an Era of Large Language Models
Pages (from-to) 90-97
Number of pages 8
Publisher Shoumen: INCOMA Ltd.
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
  • Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR)
Abstract
We introduce the first automatic animacy classifier for the Latvian language. Animacy, a linguistic feature indicating whether a noun refers to a living entity, plays an important role in Latvian grammatical structures and syntactic agreement, but remains unexplored in Latvian NLP. We adapt and extend existing methods to develop type-based animacy classifiers that distinguish between human and non-human nouns. Due to the limited utility of Latvian WordNet, the classifier’s training data was derived from the WordNets of Lithuanian, English, and Japanese. These lists were intersected and mapped to Latvian nouns from the Tēzaurs dictionary through automatic translation. The resulting dataset was used to train classifiers with fastText and LVBERT embeddings. Results show good performance from a MLP classifier using the last four layers of LVBERT, with Lithuanian data contributing more than English. This demonstrates a viable method for animacy classification in languages lacking robust lexical resources and shows potential for broader application in morphologically rich, under-resourced languages.
Document type Conference contribution
Language English
Published at https://doi.org/10.26615/978-954-452-105-9-011
Published at https://aclanthology.org/2025.globalnlp-1.11 https://acl-bg.org/proceedings/2025/GlobalNLP%202025/pdf/2025.globalnlp-1.11.pdf
Other links https://acl-bg.org/proceedings/2025/GlobalNLP%202025/index.html
Downloads
2025.globalnlp-1.11 (Final published version)
Permalink to this page
Back