Result disambiguation in web people search

Authors
Publication date 2012
Host editors
  • R. Baeza-Yates
  • A.P. de Vries
  • H. Zaragoza
  • B.B. Cambazoglu
  • V. Murdock
  • R. Lempel
  • F. Silvestri
Book title Advances in Information Retrieval
Book subtitle 34th European Conference on IR Research, ECIR 2012, Barcelona, Spain, April 1-5, 2012: proceedings
ISBN
  • 9783642289965
ISBN (electronic)
  • 9783642289972
Series Lecture Notes in Computer Science
Event 34th European Conference on Information Retrieval (ECIR 2012)
Pages (from-to) 146-157
Publisher Heidelberg: Springer
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
We study the problem of disambiguating the results of a web people search engine: given a query consisting of a person name plus the result pages for this query, find correct referents for all mentions by clustering the pages according to the different people sharing the name. While the problem has been studied extensively, we discover that the increasing availability of results retrieved from social media platforms causes state-of-the-art methods to break down. We analyze the problem and propose a dual strategy where we distinguish between results obtained from social media platforms and those obtained from other sources. In our dual strategy, the two types of documents are disambiguated separately, using different strategies, and their results are then merged. We study several instantiations for the different stages in our proposed strategy and manage to achieve state-of-the-art performance.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-642-28997-2_13
Permalink to this page
Back