Probabilistic Multileave for Online Retrieval Evaluation

Authors
  • A. Schuth
  • R.-J. Bruintjes
  • F. Büttner
  • J. van Doorn
Publication date 2015
Book title SIGIR 2015
Book subtitle proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval: August 9-13, 2015, Santiago, Chile
ISBN
  • 9781450336215
Event 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2015
Pages (from-to) 955-958
Publisher New York, NY: Association for Computing Machinery
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Online evaluation methods for information retrieval use implicit signals such as clicks from users to infer preferences between rankers. A highly sensitive way of inferring these preferences is through interleaved comparisons. Recently, interleaved comparisons methods that allow for simultaneous evaluation of more than two rankers have been introduced. These so-called multileaving methods are even more sensitive than their interleaving counterparts. Probabilistic interleaving--whose main selling point is the potential for reuse of historical data--has no multileaving counterpart yet. We propose probabilistic multileave and empirically show that it is highly sensitive and unbiased. An important implication of this result is that historical interactions with multileaved comparisons can be reused, allowing for ranker comparisons that need much less user interaction data. Furthermore, we show that our method, as opposed to earlier sensitive multileaving methods, scales well when the number of rankers increases.
Document type Conference contribution
Language English
Published at https://doi.org/10.1145/2766462.2767838
Permalink to this page
Back