Supporting End Users in Defining Reinforcement-Learning Problems for Human-Robot Interactions (Extended Abstract)

Zhao, Valerie; Littman, Michael L.; Lu, Shan; Sebo, Sarah; Ur, Blase

Citation Details

Reinforcement learning (RL) can help agents learn complex tasks that would be hard to specify using standard imperative programming. However, end users may have trouble personalizing their technology using RL due to a lack of technical expertise. Prior work has explored means of supporting end users after a problem for the RL agent to solve has been defined. Little work, however, has explored how to support end users when defining this problem. We propose a tool to provide structured support for end users defining problems for RL agents. Through this tool, users can (i) directly and indirectly specify the problem as a Markov decision process (MDP); (ii) receive automatic suggestions on possible MDP changes that would enhance training time and accuracy; and (iii) revise the MDP after training the agent to solve it. We believe this work will help reduce barriers to using RL and contribute to the existing literature on designing human-in-the-loop systems. more »

Award ID(s):: 1837120

PAR ID:: 10387468

Author(s) / Creator(s):: Zhao, Valerie; Littman, Michael L.; Lu, Shan; Sebo, Sarah; Ur, Blase

Date Published:: 2022-01-01

Journal Name:: The 5th Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this