Adding semantics to microblog posts

Authors
Publication date 2012
Book title Proceedings of the 5th ACM International Conference on Web Search and Data Mining
ISBN
  • 9781450307475
Event ACM International Conference on Web Search and Data Mining ; 5 (Seattle, Wash.) : 2012.02.08-12
Pages (from-to) 563-572
Publisher New York, NY, USA: ACM
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Microblogs have become an important source of information for the purpose of marketing, intelligence, and reputation management. Streams of microblogs are of great value because of their direct and real-time nature. Determining what an individual microblog post is about, however, can be non-trivial because of creative language usage, the highly contextualized and informal nature of microblog posts, and the limited length of this form of communication. We propose a solution to the problem of determining what a microblog post is about through semantic linking: we add semantics to posts by automatically identifying concepts that are semantically related to it and generating links to the corresponding Wikipedia articles. The identified concepts can subsequently be used for, e.g., social media mining, thereby reducing the need for manual inspection and selection. Using a purpose-built test collection of tweets, we show that recently proposed approaches for semantic linking do not perform well, mainly due to the idiosyncratic nature of microblog posts. We propose a novel method based on machine learning with a set of innovative features and show that it is able to achieve significant improvements over all other methods, especially in terms of precision.
Document type Conference contribution
Language English
Published at https://doi.org/10.1145/2124295.2124364
Permalink to this page
Back