Finding key bloggers, one post at a time
| Authors | |
|---|---|
| Publication date | 2008 |
| Journal | Frontiers in Artificial Intelligence and Applications |
| Event | 18th European Conference on Artificial Intelligence (ECAI 2008), Patras, Greece |
| Volume | Issue number | 178 |
| Pages (from-to) | 318-322 |
| Organisations |
|
| Abstract |
User generated content in general, and blogs in particular, form an interesting and relatively little explored domain for mining knowledge. We address the task of blog distillation: to find blogs that are principally devoted to a given topic, as opposed to blogs that merely happen to discuss the topic in passing. Working in the setting of statistical language modeling, we model the task by aggregating a blogger's blog posts to collect evidence of relevance to the topic and persistence of interest in the topic. This approach achieves state-of-the-art performance. On top of this baseline, we extend our model by incorporating a number of blog-specific features, concerning document structure, social structure, and temporal structure. These blog-specific features yield further improvements.
|
| Document type | Article |
| Note | Proceedings title: ECAI 2008: 18th European Conference on Artificial Intelligence, July 21-25, 2008, Patras, Greece: Including Prestigious Applications of Intelligent Systems (PAIS 2008): Proceedings Publisher: IOS Press Place of publication: Amsterdam ISBN: 978-1-58603-891-5 Editors: M. Ghallab, C.D. Spyropoulos, N. Fakotakis, N. Avouris |
| Published at | http://staff.science.uva.nl/~mdr/Publications/Files/ecai2008.pdf |
| Downloads | |
| Permalink to this page | |
