Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some Alternatives

Authors
Publication date 12-2022
Journal SIGIR Forum
Article number 3
Volume | Issue number 56 | 2
Number of pages 14
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
In this paper, we argue that the paradigm commonly adopted for offline evaluation of sequential recommender systems is unsuitable for evaluating reinforcement learning-based recommenders. We find that most of the existing offline evaluation practices for reinforcement learning-based recommendation are based on a next-item prediction protocol, and detail three shortcomings of such an evaluation protocol. Notably, it cannot reflect the potential benefits that reinforcement learning (RL) is expected to bring while it hides critical deficiencies of certain offline RL agents. Our suggestions for alternative ways to evaluate RL-based recommender systems aim to shed light on the existing possibilities and inspire future research on reliable evaluation protocols.
Document type Article
Language English
Published at https://doi.org/10.1145/3582900.3582905
Permalink to this page
Back