Web directories as topical context

Authors
Publication date 2009
Host editors
  • R. Aly
  • C. Hauff
  • I. den Hamer
  • D. Hiemstra
  • T. Huibers
  • F. de Jong
Book title Proceedings of the 9th Dutch-Belgian Information Retrieval Workshop (DIR 2009): February 2-3, 2009, Enschede, the Netherlands
Event 9th Dutch-Belgian Information Retrieval Workshop (DIR 2009), Enschede, The Netherlands
Pages (from-to) 71-78
Publisher Enschede: University of Twente, Centre for Telematics and Information Technology (CTIT)
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
In this paper we explore whether the Open Directory (or DMOZ) can be used to classify queries into topical categories on different levels and whether we can use this topical context to improve retrieval performance. We have set up a user study to let test persons explicitly classify queries into topical categories. Categories are either chosen freely from DMOZ, or from a list of suggestions created by several automatic topic categorization techniques. The results of this user study show that DMOZ categories are suitable for topic categorization. Either free search or evaluation of a list of suggestions can be used to elicit the topical context. Free search leads to more specific topic categories than the list of suggestions. Different test persons show moderate agreement between their individual judgments, but broadly agree on the initial levels of the chosen categories. When we use the topic categories selected by the free search as topical context, this leads to significant improvements over the baseline retrieval results. The more general topic categories selected from the suggestions list, and top level categories do not lead to significant improvements.
Document type Conference contribution
Published at http://dir2009.cs.utwente.nl/dir2009proceedings.pdf
Permalink to this page
Back