Evaluating Sequential Recommendations in the Wild A Case Study on Offline Accuracy, Click Rates, and Consumption
| Authors |
|
|---|---|
| Publication date | 2025 |
| Host editors |
|
| Book title | Advances in Information Retrieval |
| Book subtitle | 47th European Conference on Information Retrieval, ECIR 2025, Lucca, Italy, April 6–10, 2025 : proceedings |
| ISBN |
|
| ISBN (electronic) |
|
| Series | Lecture Notes in Computer Science |
| Event | 47th European Conference on Information Retrieval, ECIR 2025 |
| Volume | Issue number | II |
| Pages (from-to) | 72-87 |
| Publisher | Cham: Springer |
| Organisations |
|
| Abstract |
Sequential recommendation problems have received increased research interest in recent years. Our knowledge about the effectiveness of sequential algorithms in practice is however limited. In this paper, we report on the outcomes of an A/B test on a video and movie streaming platform, where we benchmarked a sequential model against a non-sequential, personalized recommendation model, as well as a popularity-based baseline. Contrary to what we had expected from a preceding offline experiment, we observed that the popularity-based and the non-sequential models led to the highest click-through rates. However, in terms of the adoption of the recommendations, the sequential model was the most successful one in terms of viewing times. While our work points out the effectiveness of sequential models in practice, it also reminds us about important open challenges regarding (a) the sometimes limited predictive power of classic offline evaluations and (b) the dangers of optimizing recommendation models for click-through-rates.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.1007/978-3-031-88711-6_5 |
| Other links | https://www.scopus.com/pages/publications/105006641545 |
| Downloads |
Evaluating Sequential Recommendations in the Wild
(Final published version)
|
| Permalink to this page | |
