Examining a hate speech corpus for hate speech detection and popularity prediction

Open Access
Authors
Publication date 2018
Host editors
  • A. Branco
  • N. Calzolari
  • K. Choukri
Book title 4REAL 2018 : Workshop on Replicability and Reproducibility of Research Results in Science and Technology of Language
Book subtitle LREC 2018 Workshop : proceedings
ISBN (electronic)
  • 9791095546214
Event Workshop on Replicability and Reproducibility of Research Results in Science and Technology of Language
Pages (from-to) 16-23
Publisher Paris: European Language Resources Association (ELRA)
Organisations
  • Faculty of Science (FNWI)
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
As research on hate speech becomes more and more relevant every day, most of it is still focused on hate speech detection. By attempting to replicate a hate speech detection experiment performed on an existing Twitter corpus annotated for hate speech, we highlight some issues that arise from doing research in the field of hate speech, which is essentially still in its infancy. We take a critical look at the training corpus in order to understand its biases, while also using it to venture beyond hate speech detection and investigate whether it can be used to shed light on other facets of research, such as popularity of hate tweets.
Document type Conference contribution
Language English
Published at https://arxiv.org/abs/1805.04661 http://lrec-conf.org/workshops/lrec2018/W25/summaries/2_W25.html
Other links http://lrec-conf.org/workshops/lrec2018/W25/index.html
Downloads
Examining a hate speech corpus (Final published version)
Permalink to this page
Back