Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some Alternatives

Authors	R. Deffayet T. Thonet J.-M. Renders M. de Rijke
Publication date	12-2022
Journal	SIGIR Forum
Article number	3
Volume \| Issue number	56 \| 2
Number of pages	14
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	In this paper, we argue that the paradigm commonly adopted for offline evaluation of sequential recommender systems is unsuitable for evaluating reinforcement learning-based recommenders. We find that most of the existing offline evaluation practices for reinforcement learning-based recommendation are based on a next-item prediction protocol, and detail three shortcomings of such an evaluation protocol. Notably, it cannot reflect the potential benefits that reinforcement learning (RL) is expected to bring while it hides critical deficiencies of certain offline RL agents. Our suggestions for alternative ways to evaluate RL-based recommender systems aim to shed light on the existing possibilities and inspire future research on reliable evaluation protocols.
Document type	Article
Language	English
Published at	https://doi.org/10.1145/3582900.3582905 (Final published version)
Permalink to this page

Back

UvA-DARE