Learning Exploration Strategies to Solve Real-World Marble Runs

Allaire, Alisa; Atkeson, Christopher G.

doi:10.1109/ICRA48891.2023.10160759

Citation Details

Learning Exploration Strategies to Solve Real-World Marble Runs

Tasks involving locally unstable or discontinuous dynamics (such as bifurcations and collisions) remain challenging in robotics, because small variations in the environment can have a significant impact on task outcomes. For such tasks, learning a robust deterministic policy is difficult. We focus on structuring exploration with multiple stochastic policies based on a mixture of experts (MoE) policy representation that can be efficiently adapted. The MoE policy is composed of stochastic sub-policies that allow exploration of multiple distinct regions of the action space (or strategies) and a highlevel selection policy to guide exploration towards the most promising regions. We develop a robot system to evaluate our approach in a real-world physical problem solving domain. After training the MoE policy in simulation, online learning in the real world demonstrates efficient adaptation within just a few dozen attempts, with a minimal sim2real gap. Our results confirm that representing multiple strategies promotes efficient adaptation in new environments and strategies learned under different dynamics can still provide useful information about where to look for good strategies. more »

Award ID(s):: 1849287

PAR ID:: 10496012

Author(s) / Creator(s):: Allaire, Alisa; Atkeson, Christopher G.

Publisher / Repository:: IEEE

Date Published:: 2023-05-29

Journal Name:: Proceedings IEEE International Conference on Robotics and Automation

ISSN:: 1050-4729

ISBN:: 979-8-3503-2365-8

Page Range / eLocation ID:: 7243 to 7249

Format(s):: Medium: X

Location:: London, United Kingdom

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICRA48891.2023.10160759

More Like this