The impact of semantic document expansion on cluster-based fusion for microblog search

Authors
Publication date 2014
Host editors
  • M. de Rijke
  • T. Kenter
  • A.P. de Vries
  • C.X. Zhai
  • F. de Jong
  • K. Radinsky
  • K. Hofmann
Book title Advances in Information Retrieval
Book subtitle 36th European Conference on IR Research, ECIR 2014, Amsterdam, The Netherlands, April 13-16, 2014: proceedings
ISBN
  • 9783319060279
ISBN (electronic)
  • 9783319060286
Series Lecture Notes in Computer Science
Event 36th European Conference on Information Retrieval (ECIR'14)
Pages (from-to) 493-499
Publisher Cham: Springer
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Searching microblog posts, with their limited length and creative language usage, is challenging. We frame the microblog search problem as a data fusion problem. We examine the effectiveness of a recent cluster-based fusion method on the task of retrieving microblog posts. We find that in the optimal setting the contribution of the clustering information is very limited, which we hypothesize to be due to the limited length of microblog posts. To increase the contribution of the clustering information in cluster-based fusion, we integrate semantic document expansion as a preprocessing step. We enrich the content of microblog posts appearing in the lists to be fused by Wikipedia articles, based on which clusters are created. We verify the effectiveness of our combined document expansion plus fusion method by making comparisons with microblog search algorithms and other fusion methods.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-319-06028-6_47
Permalink to this page
Back