Feeding the Second Screen: Semantic Linking based on Subtitles

Authors
Publication date 2013
Host editors
  • J. Ferreira
  • J. Magalhães
  • P. Calado
Book title OAIR 2013: Open Research Areas in Information Retrieval: 10th Conference on the RIAO series
ISBN
  • 9782905450098
Event Open research Areas in Information Retrieval (OAIR 2013)
Pages (from-to) 9-16
Publisher Paris: Centre de Hautes Etudes Internationales D'Informatique Documentaire
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Television is changing. Increasingly, broadcasts are consumed interactively. This allows broadcasters to provide consumers with additional background information that they may bookmark for later consumption. To support this type of functionality, we consider the task of linking a textual streams derived from live broadcasts to Wikipedia. While link generation has received considerable attention in recent years, our task has unique demands that require an approach that needs to (i) be high-precision oriented, (ii) perform in real-time, (iii) work in a streaming setting, and (iv) typically, with a very limited context. We propose a learning to rerank approach that significantly improves over a strong baseline in terms of effectiveness and whose processing time is very short. We extend this approach, leveraging the streaming nature of the textual sources that we link by modeling context as a graph. We show how our graph-based context model further improves effectiveness. For evaluation purposes we create a dataset of segments of television subtitles that we make available to the research community.
Document type Conference contribution
Language English
Published at http://dl.acm.org/citation.cfm?id=2491751
Permalink to this page
Back