Incorporating hierarchical domain information to disambiguate very short queries
| Authors |
|
|---|---|
| Publication date | 2019 |
| Book title | ICTIR'19 |
| Book subtitle | proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval : October 2-5, 2019, Santa Clara, CA, USA |
| ISBN (electronic) |
|
| Event | 9th ACM SIGIR International Conference on the Theory of Information Retrieval, ICTIR 2019 |
| Pages (from-to) | 51-54 |
| Number of pages | 4 |
| Publisher | New York, NY: The Association for Computing Machinery |
| Organisations |
|
| Abstract |
Users often express their information needs using incomplete or ambiguous queries of only one or two terms in length, particularly in theWeb environments. The ambiguity of short queries is a recognized problem for information retrieval (IR) systems. In this study, we investigate various approaches for incorporating hierarchical domain information into IR models such that the domain specification resolves the ambiguity. To this end, we develop practical models for constructing evaluation datasets from existing corpora. In terms of effectiveness, we further study the trade-off between a short query and its domain specification information. In doing so, we find that domains with the highest number of relevant documents are not always the best ones to select. We also evaluate the utility of a domain hierarchy and find that incorporating the hierarchical structure of a collection into the retrieval model could have a high impact on short query disambiguation. |
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.1145/3341981.3344251 |
| Other links | https://www.scopus.com/pages/publications/85074240028 |
| Permalink to this page | |
