Using links to classify Wikipedia pages

Authors
Publication date 2009
Host editors
  • S. Geva
  • J. Kamps
  • A. Trotman
Book title Advances in Focused Retrieval
Book subtitle 7th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2008, Dagstuhl Castle, Germany, December 15-18, 2008 : revised and selected papers
ISBN
  • 9783642037603
ISBN (electronic)
  • 9783642037610
Series Lecture Notes in Computer Science
Event 7th International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2008), Dagstuhl Castle, Germany
Pages (from-to) 432-435
Publisher Berlin: Springer
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract This paper contains a description of experiments for the 2008 INEX XML-mining track. Our goal for the XML-mining track is to explore whether we can use link information to improve classification accuracy. Our approach is to propagate category probabilities over linked pages. We find that using link information leads to marginal improvements over a baseline that uses a Naive Bayes model. For the initially misclassified pages, link information is either not available or contains too much noise.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-642-03761-0_44
Permalink to this page
Back