An evaluation of Named Entity Recognition tools for detecting person names in philosophical text

Open Access
Authors
Publication date 2025
Host editors
  • M. Hämäläinen
  • E. Öhman
  • Y. Bizzoni
  • S. Miyagawa
  • K. Alnajjar
Book title The 5th International Conference on Natural Language Processing for Digital Humanities : proceedings of the conference
Book subtitle NLP4DH 2025 : May 3-4, 2025
ISBN (electronic)
  • 9798891762343
Event 5th International Conference on Natural Language Processing for Digital Humanities
Pages (from-to) 418-425
Publisher Stroudsburg, PA: Association for Computational Linguistics
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
For philosophers, mentions of the names of other philosophers and scientists are an important indicator of relevance and influence. However, they don’t always come in neat citations, especially in older works. We evaluate various approaches to named entity recognition for person names in 20th century, English-language philosophical texts. We use part of a digitized corpus of the works of W.V. Quine, manually annotated for person names, to compare the performance of several systems: the rule-based edhiphy, spaCy’s CNN-based system, FLAIR’s BiLSTM-based system, and SpanBERT, ERNIE-v2 and ModernBERT’s transformer-based approaches. We also experiment with enhancing the smaller models with domain-specific embedding vectors. We find that both spaCy and FLAIR outperform transformer-based models, perhaps due to the small dataset sizes involved.
Document type Conference contribution
Language English
Published at https://doi.org/10.18653/v1/2025.nlp4dh-1.36
Downloads
2025.nlp4dh-1.36 (Final published version)
Permalink to this page
Back