An evaluation of Named Entity Recognition tools for detecting person names in philosophical text
| Authors |
|
|---|---|
| Publication date | 2025 |
| Host editors |
|
| Book title | The 5th International Conference on Natural Language Processing for Digital Humanities : proceedings of the conference |
| Book subtitle | NLP4DH 2025 : May 3-4, 2025 |
| ISBN (electronic) |
|
| Event | 5th International Conference on Natural Language Processing for Digital Humanities |
| Pages (from-to) | 418-425 |
| Publisher | Stroudsburg, PA: Association for Computational Linguistics |
| Organisations |
|
| Abstract |
For philosophers, mentions of the names of other philosophers and scientists are an important indicator of relevance and influence. However, they don’t always come in neat citations, especially in older works. We evaluate various approaches to named entity recognition for person names in 20th century, English-language philosophical texts. We use part of a digitized corpus of the works of W.V. Quine, manually annotated for person names, to compare the performance of several systems: the rule-based edhiphy, spaCy’s CNN-based system, FLAIR’s BiLSTM-based system, and SpanBERT, ERNIE-v2 and ModernBERT’s transformer-based approaches. We also experiment with enhancing the smaller models with domain-specific embedding vectors. We find that both spaCy and FLAIR outperform transformer-based models, perhaps due to the small dataset sizes involved.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.18653/v1/2025.nlp4dh-1.36 |
| Downloads |
2025.nlp4dh-1.36
(Final published version)
|
| Permalink to this page | |
