Offline Evaluation for Reinforcement Learning-based Recommendation - A Critical Issue and Some Alternatives
We discussed offline evaluation for RL-based recommender systems and noted that the most common evaluation protocol, i.e., next-item prediction, is unsuited to such approaches.