An effective statistical approach to blog post opinion retrieval

Authors
  • B. He
  • C. Macdonald
  • J. He
  • I. Ounis
Publication date 2008
Host editors
  • J.G. Shanahan
  • S. Amer-Yahia
  • Y. Zhang
  • A. Kołcz
  • A. Chowdury
  • D. Kelly
Book title CIKM 2008: ACM 17th Conference on Information and Knowledge Management: October 26-30, 2008, Napa Valley, California
ISBN
  • 9781595939913
Event ACM 17th Conference on Information and Knowledge Managment (CIKM 2008), Napa Valley, CA, USA
Pages (from-to) 1063-1072
Publisher New York, NY: Association for Computing Machinery (ACM)
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Finding opinionated blog posts is still an open problem in information retrieval, as exemplified by the recent TREC blog tracks. Most of the current solutions involve the use of external resources and manual efforts in identifying subjective features. In this paper, we propose a novel and effective dictionary-based statistical approach, which automatically derives evidence for subjectivity from the blog collection itself, without requiring any manual effort. Our experiments show that the proposed approach is capable of achieving remarkable and statistically significant improvements over robust baselines, including the best TREC baseline run. In addition, with relatively little computational costs, our proposed approach provides an effective performance in retrieving opinionated blog posts, which is as good as a computationally expensive approach using Natural Language Processing techniques.
Document type Conference contribution
Published at http://doi.acm.org/10.1145/1458082.1458223
Permalink to this page
Back