Reproducing and Extending Causal Insights Into Term Frequency Computation in Neural Rankers
| Authors |
|
|---|---|
| Publication date | 2025 |
| Book title | SIGIR-AP 2025 |
| Book subtitle | Proceedings of the 2025 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region : December 7-10, 2025, Xi'an, China |
| ISBN (electronic) |
|
| Event | 3rd International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, SIGIR-AP 2025 |
| Pages (from-to) | 189-198 |
| Number of pages | 10 |
| Publisher | New York, NY: Association for Computing Machinery |
| Organisations |
|
| Abstract |
Neural ranking models have shown outstanding performance across a
variety of tasks, such as document retrieval, re-ranking, question
answering and conversational retrieval. However, the inner decision
process of these models remains largely unclear, especially as models
increase in size. Most interpretability approaches, such as probing,
focus on correlational insights rather than establishing causal
relationships. The paper 'Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models'
by Chen et al. addresses this gap by introducing a framework for
activation patching - a causal interpretability method - in the
information retrieval domain, offering insights into how neural
retrieval models compute document relevance. The study demonstrates that
neural ranking models not only capture term-frequency information, but
also that these representations can be localized to specific components
of the model, such as individual attention heads or layers. This paper
aims to reproduce the findings by Chen et al. and to further explore the
presence of pre-defined retrieval axioms in neural IR models. We
validate the main claims made by Chen et al., and extend the framework
to include an additional term-frequency axiom, which states that the
impact of increasing query term frequency on document ranking diminishes
as the frequency becomes higher. We successfully identify a group of
attention heads that encode this axiom and analyze their behavior to
give insight into the inner decision-making process of neural ranking
models.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.1145/3767695.3769507 |
| Other links | https://www.scopus.com/pages/publications/105026317582 |
| Downloads |
3767695.3769507
(Final published version)
|
| Permalink to this page | |