Combining multiple signals for semanticizing tweets: University of Amsterdam at #Microposts2015

Open Access
Authors
Publication date 2015
Host editors
  • R. Rowe
  • M. Stankovic
  • A.-S. Dadzie
Book title Proceedings of the the 5th Workshop on Making Sense of Microposts
Book subtitle co-located with the 24th International World Wide Web Conference (WWW 2015) : Florence, Italy, May 18th, 2015
Series CEUR Workshop Proceedings
Event 5th Workshop on Making Sense of Microposts
Pages (from-to) 59-60
Publisher Aachen: CEUR-WS
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
In this paper we present an approach for extracting and linking entities from short and noisy microblog posts. We describe a diverse set of approaches based on the Semanticizer, an open-source entity linking framework developed at the University of Amsterdam, adapted to the task of the #Microposts2015 challenge. We consider alternatives for dealing with ambiguity that can help in the named entity extraction and linking processes. We retrieve entity candidates from multiple sources and process them in a four-step pipeline. Results show that we correctly manage to identify entity mentions (our best run attains an F1 score of 0.809 in terms of the strong mention match metric), but subsequent steps prove to be more challenging for our approach.
Document type Conference contribution
Language English
Published at http://ceur-ws.org/Vol-1395/paper_17.pdf
Other links http://ceur-ws.org/Vol-1395/
Downloads
paper_17-2 (Final published version)
Permalink to this page
Back