REVEAL 2020: Bandit and Reinforcement Learning from User Interactions

Joachims, Thorsten; Raimond, Yves; Koch, Olivier; Dimakopoulou, Maria; Vasile, Flavian; Swaminathan, Adith

doi:10.1145/3383313.3411536

Citation Details

REVEAL 2020: Bandit and Reinforcement Learning from User Interactions

The REVEAL workshop1 focuses on framing the recommendation problem as a one of making personalized interventions, e.g. deciding to recommend a particular item to a particular user. Moreover, these interventions sometimes depend on each other, where a stream of interactions occurs between the user and the system, and where each decision to recommend something will have an impact on future steps and long-term rewards. This framing creates a number of challenges we will discuss at the workshop. How can recommender systems be evaluated offline in such a context? How can we learn recommendation policies that are aware of these delayed consequences and outcomes? more »

Award ID(s):: 1901168

PAR ID:: 10309946

Author(s) / Creator(s):: Joachims, Thorsten; Raimond, Yves; Koch, Olivier; Dimakopoulou, Maria; Vasile, Flavian; Swaminathan, Adith

Date Published:: 2020-09-01

Journal Name:: ACM Conference on Recommender Systems

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3383313.3411536

More Like this