Waypoint-Based Reinforcement Learning for Robot Manipulation Tasks

Mehta, Shaunak A; Habibian, Soheil; Losey, Dylan P

doi:10.1109/IROS58592.2024.10802681

Citation Details

Waypoint-Based Reinforcement Learning for Robot Manipulation Tasks

Robot arms should be able to learn new tasks. One framework here is reinforcement learning, where the robot is given a reward function that encodes the task, and the robot autonomously learns actions to maximize its reward. Existing approaches to reinforcement learning often frame this problem as a Markov decision process, and learn a policy (or a hierarchy of policies) to complete the task. These policies reason over hundreds of fine-grained actions that the robot arm needs to take: e.g., moving slightly to the right or rotating the end-effector a few degrees. But the manipulation tasks that we want robots to perform can often be broken down into a small number of high-level motions: e.g., reaching an object or turning a handle. In this paper we therefore propose a waypoint-based approach for model-free reinforcement learning. Instead of learning a low-level policy, the robot now learns a trajectory of waypoints, and then interpolates between those waypoints using existing controllers. Our key novelty is framing this waypoint-based setting as a sequence of multi-armed bandits: each bandit problem corresponds to one waypoint along the robot’s motion. We theoretically show that an ideal solution to this reformulation has lower regret bounds than standard frameworks. We also introduce an approximate posterior sampling solution that builds the robot’s motion one waypoint at a time. Results across benchmark simulations and two real-world experiments suggest that this proposed approach learns new tasks more quickly than state-of-the-art baselines. See our website here: https://collab.me.vt.edu/rl-waypoints/ more »

Award ID(s):: 2129201 2205241

PAR ID:: 10567710

Author(s) / Creator(s):: Mehta, Shaunak A; Habibian, Soheil; Losey, Dylan P

Publisher / Repository:: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Date Published:: 2024-10-14

Journal Name:: Proceedings of the International Conference on Intelligent Robots and Systems

ISSN:: 2153-0866

ISBN:: 979-8-3503-7770-5

Page Range / eLocation ID:: 541 to 548

Format(s):: Medium: X

Location:: Abu Dhabi, United Arab Emirates

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/IROS58592.2024.10802681

More Like this