Using temporal bursts for query modeling
| Authors |
|
|---|---|
| Publication date | 2014 |
| Journal | Information Retrieval |
| Volume | Issue number | 17 | 1 |
| Pages (from-to) | 74-108 |
| Organisations |
|
| Abstract |
We present an approach to query modeling that leverages the temporal distribution of documents in an initially retrieved set of documents. In news-related document collections such distributions tend to exhibit bursts. Here, we define a burst to be a time period where unusually many documents are published. In our approach we detect bursts in result lists returned for a query. We then model the term distributions of the bursts using a reduced result list and select its most descriptive terms. Finally, we merge the sets of terms obtained in this manner so as to arrive at a reformulation of the original query. For query sets that consist of both temporal and non-temporal queries, our query modeling approach incorporates an effective selection method of terms. We consistently and significantly improve over various baselines, such as relevance models, on both news collections and a collection of blog posts.
|
| Document type | Article |
| Language | English |
| Published at | https://doi.org/10.1007/s10791-013-9227-2 |
| Permalink to this page | |
