Unsilencing colonial archives via automated entity recognition

Open Access
Authors
Publication date 03-09-2024
Journal Journal of Documentation
Volume | Issue number 80 | 5
Pages (from-to) 1080-1105
Organisations
  • Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR) - Amsterdam School for Heritage, Memory and Material Culture (AHM)
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
Purpose
This paper aims to expand the scope and mitigate the biases of extant archival indexes.

Design/methodology/approach
The authors use automatic entity recognition on the archives of the Dutch East India Company to extract mentions of underrepresented people.

Findings
The authors release an annotated corpus and baselines for a shared task and show that the proposed goal is feasible.

Originality/value
Colonial archives are increasingly a focus of attention for historians and the public, broadening access to them is a pressing need for archives.
Document type Article
Language English
Related dataset Unsilencing Colonial Archives via Automated Entity Recognition Unsilencing Dutch Colonial Archives
Published at https://doi.org/10.48550/arXiv.2210.02194 https://doi.org/10.1108/JD-02-2022-0038
Downloads
10-1108_JD-02-2022-0038 (Final published version)
Permalink to this page
Back