Author Profiling for Abuse Detection

Open Access
Authors
Publication date 2018
Host editors
  • E.M. Bender
  • L. Derczynski
  • P. Isabelle
Book title The 27th International Conference on Computational Linguistics
Book subtitle COLING 2018 : proceedings of the conference : August 20-26, 2018, Santa Fe, New Mexico, USA
ISBN (electronic)
  • 9781948087506
Event 27th International Conference on Computational Linguistics
Pages (from-to) 1088–1098
Publisher Association for Computational Linguistics
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
The rapid growth of social media in recent years has fed into some highly undesirable phenomena such as proliferation of hateful and offensive language on the Internet. Previous research suggests that such abusive content tends to come from users who share a set of common stereotypes and form communities around them. The current state-of-the-art approaches to abuse detection are oblivious to user and community information and rely entirely on textual (i.e., lexical and semantic) cues. In this paper, we propose a novel approach to this problem that incorporates community-based profiling features of Twitter users. Experimenting with a dataset of 16k tweets, we show that our methods significantly outperform the current state of the art in abuse detection. Further, we conduct a qualitative analysis of model characteristics. We release our code, pre-trained models and all the resources used in the public domain.
Document type Conference contribution
Language English
Published at https://aclweb.org/anthology/C18-1093/
Downloads
C18-1093 (Final published version)
Permalink to this page
Back