On the reusability of open test collections

Open Access
Authors
Publication date 2015
Book title SIGIR 2015
Book subtitle proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval : August 9-13, 2015, Santiago, Chile
ISBN
  • 9781450336215
Event 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2015
Pages (from-to) 827-830
Number of pages 4
Publisher New York: Association for Computing Machinery
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract

Creating test collections for modern search tasks is increasingly more challenging due to the growing scale and dynamic nature of content, and need for richer contextualization of the statements of request. To address these issues, the TREC Contextual Suggestion Track explored an open test collection, where participants were allowed to submit any web page as a result for a personalized venue recommendation task. This prompts the question on the reusability of the resulting test collection: How does the open nature affect the pooling process? Can participants reliably evaluate variant runs with the resulting qrels? Can other teams evaluate new runs reliably? In short, does the set of pooled and judged documents effectively produce a post hoc test collection? Our main findings are the following: First, while there is a strongly significant rank correlation, the effect of pooling is notable and results in underestimation of performance, implying the evaluation of non-pooled systems should be done with great care. Second, we extensively analyze impacts of open corpus on the fraction of judged documents, explaining how low recall affects the reusability, and how the personalization and low pooling depth aggravate that problem. Third, we outline a potential solution by deriving a fixed corpus from open web submissions.

Document type Conference contribution
Language English
Published at https://doi.org/10.1145/2766462.2767788
Other links https://www.scopus.com/pages/publications/84953776159
Downloads
p827-hashemi (Final published version)
Permalink to this page
Back