Towards an active learning system for company name disambiguation in microblog streams
| Authors |
|
|---|---|
| Publication date | 2013 |
| Host editors |
|
| Book title | Working Notes for CLEF 2013 Conference |
| Book subtitle | Valencia, Spain, September 23-26, 2013 |
| Series | CEUR Workshop Proceedings |
| Event | CLEF 2013 Conference |
| Number of pages | 8 |
| Publisher | Aachen: CEUR-WS |
| Organisations |
|
| Abstract | In this paper we describe the collaborative participation of UvA & UNED at RepLab 2013. We propose an active learning approach for the filtering subtask, using features based on the detected semantics in the tweet (using Entity Linking with Wikipedia), as well as tweet-inherent features such as hashtags and usernames. The tweets manually inspected during the active learning process is at most 1% of the test data. While our baseline does not perform well, we can see that active learning does improve the results. |
| Document type | Conference contribution |
| Language | English |
| Published at | http://ceur-ws.org/Vol-1179/CLEF2013wn-RepLab-PeetzEt2013.pdf |
| Other links | http://ceur-ws.org/Vol-1179/ |
| Downloads |
CLEF2013wn-RepLab-PeetzEt2013
(Final published version)
|
| Permalink to this page | |
