Incorporating hierarchical domain information to disambiguate very short queries

Authors
Publication date 2019
Book title ICTIR'19
Book subtitle proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval : October 2-5, 2019, Santa Clara, CA, USA
ISBN (electronic)
  • 9781450368810
Event 9th ACM SIGIR International Conference on the Theory of Information Retrieval, ICTIR 2019
Pages (from-to) 51-54
Number of pages 4
Publisher New York, NY: The Association for Computing Machinery
Organisations
  • Faculty of Science (FNWI) - Institute for Biodiversity and Ecosystem Dynamics (IBED)
Abstract

Users often express their information needs using incomplete or ambiguous queries of only one or two terms in length, particularly in theWeb environments. The ambiguity of short queries is a recognized problem for information retrieval (IR) systems. In this study, we investigate various approaches for incorporating hierarchical domain information into IR models such that the domain specification resolves the ambiguity. To this end, we develop practical models for constructing evaluation datasets from existing corpora. In terms of effectiveness, we further study the trade-off between a short query and its domain specification information. In doing so, we find that domains with the highest number of relevant documents are not always the best ones to select. We also evaluate the utility of a domain hierarchy and find that incorporating the hierarchical structure of a collection into the retrieval model could have a high impact on short query disambiguation.

Document type Conference contribution
Language English
Published at https://doi.org/10.1145/3341981.3344251
Other links https://www.scopus.com/pages/publications/85074240028
Permalink to this page
Back