Towards an active learning system for company name disambiguation in microblog streams

Open Access
Authors
Publication date 2013
Host editors
  • P. Forner
  • R. Navigli
  • D. Tufis
  • N. Ferro
Book title Working Notes for CLEF 2013 Conference
Book subtitle Valencia, Spain, September 23-26, 2013
Series CEUR Workshop Proceedings
Event CLEF 2013 Conference
Number of pages 8
Publisher Aachen: CEUR-WS
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract In this paper we describe the collaborative participation of UvA & UNED at RepLab 2013. We propose an active learning approach for the filtering subtask, using features based on the detected semantics in the tweet (using Entity Linking with Wikipedia), as well as tweet-inherent features such as hashtags and usernames. The tweets manually inspected during the active learning process is at most 1% of the test data. While our baseline does not perform well, we can see that active learning does improve the results.
Document type Conference contribution
Language English
Published at http://ceur-ws.org/Vol-1179/CLEF2013wn-RepLab-PeetzEt2013.pdf
Other links http://ceur-ws.org/Vol-1179/
Downloads
CLEF2013wn-RepLab-PeetzEt2013 (Final published version)
Permalink to this page
Back