An effective statistical approach to blog post opinion retrieval
| Authors |
|
|---|---|
| Publication date | 2008 |
| Host editors |
|
| Book title | CIKM 2008: ACM 17th Conference on Information and Knowledge Management: October 26-30, 2008, Napa Valley, California |
| ISBN |
|
| Event | ACM 17th Conference on Information and Knowledge Managment (CIKM 2008), Napa Valley, CA, USA |
| Pages (from-to) | 1063-1072 |
| Publisher | New York, NY: Association for Computing Machinery (ACM) |
| Organisations |
|
| Abstract |
Finding opinionated blog posts is still an open problem in information retrieval, as exemplified by the recent TREC blog tracks. Most of the current solutions involve the use of external resources and manual efforts in identifying subjective features. In this paper, we propose a novel and effective dictionary-based statistical approach, which automatically derives evidence for subjectivity from the blog collection itself, without requiring any manual effort. Our experiments show that the proposed approach is capable of achieving remarkable and statistically significant improvements over robust baselines, including the best TREC baseline run. In addition, with relatively little computational costs, our proposed approach provides an effective performance in retrieving opinionated blog posts, which is as good as a computationally expensive approach using Natural Language Processing techniques.
|
| Document type | Conference contribution |
| Published at | http://doi.acm.org/10.1145/1458082.1458223 |
| Permalink to this page | |