A signal-to-noise approach to score normalization
| Authors | |
|---|---|
| Publication date | 2009 |
| Host editors |
|
| Book title | Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM 2009) |
| ISBN |
|
| Event | 18th ACM Conference on Information and Knowledge Management (CIKM 2009), Hong Kong, China |
| Pages (from-to) | 797-806 |
| Publisher | New York: ACM Press |
| Organisations |
|
| Abstract |
Score normalization is indispensable in distributed retrieval and fusion or meta-search where merging of result-lists is required. Distributional approaches to score normalization with reference to relevance, such as binary mixture models like the normal-exponential, suffer from lack of universality and troublesome parameter estimation especially under sparse relevance. We develop a new approach which tackles both problems by using aggregate score distributions without reference to relevance, and is suitable for uncooperative engines. The method is based on the assumption that scores produced by engines consist of a signal and a noise component which can both be approximated by submitting well-defined sets of artificial queries to each engine. We evaluate in a standard distributed retrieval testbed and show that the signal-to-noise approach yields better results than other distributional methods. As a significant by-product, we investigate query-length distributions.
|
| Document type | Conference contribution |
| Published at | http://doi.acm.org/10.1145/1645953.1646055 |
| Permalink to this page | |
