Evaluating Sequential Recommendations in the Wild

A. Klimashevskaia; S. Alvsvåg; C. Trattner; A.D. Starke; A. Tessem; D. Jannach

doi:https://doi.org/10.1007/978-3-031-88711-6_5

Evaluating Sequential Recommendations in the Wild A Case Study on Offline Accuracy, Click Rates, and Consumption

Authors	A. Klimashevskaia S. Alvsvåg C. Trattner A.D. Starke A. Tessem D. Jannach
Publication date	2025
Host editors	C. Hauff C. Macdonald D. Jannach G. Kazai F.M. Nardini F. Pinelli F. Silvestri N. Tonellotto
Book title	Advances in Information Retrieval
Book subtitle	47th European Conference on Information Retrieval, ECIR 2025, Lucca, Italy, April 6–10, 2025 : proceedings
ISBN	9783031887109
ISBN (electronic)	9783031887116
Series	Lecture Notes in Computer Science
Event	47th European Conference on Information Retrieval, ECIR 2025
Volume \| Issue number	II
Pages (from-to)	72-87
Publisher	Cham: Springer
Organisations	Faculty of Social and Behavioural Sciences (FMG) - Amsterdam School of Communication Research (ASCoR)
Abstract	Sequential recommendation problems have received increased research interest in recent years. Our knowledge about the effectiveness of sequential algorithms in practice is however limited. In this paper, we report on the outcomes of an A/B test on a video and movie streaming platform, where we benchmarked a sequential model against a non-sequential, personalized recommendation model, as well as a popularity-based baseline. Contrary to what we had expected from a preceding offline experiment, we observed that the popularity-based and the non-sequential models led to the highest click-through rates. However, in terms of the adoption of the recommendations, the sequential model was the most successful one in terms of viewing times. While our work points out the effectiveness of sequential models in practice, it also reminds us about important open challenges regarding (a) the sometimes limited predictive power of classic offline evaluations and (b) the dangers of optimizing recommendation models for click-through-rates.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.1007/978-3-031-88711-6_5
Other links	https://www.scopus.com/pages/publications/105006641545
Downloads	Evaluating Sequential Recommendations in the Wild (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Evaluating Sequential Recommendations in the Wild A Case Study on Offline Accuracy, Click Rates, and Consumption