MAGICS: Adversarial RL with Minimax Actors Guided by Implicit Critic Stackelberg for Convergent Neural Synthesis of Robot Safety

Wang, Justin S; Hu, Haimin; Nguyen, Duy P; Fernández_Fisac, Jaime

Citation Details

While robust optimal control theory provides a rigorous framework to compute robot control policies that are provably safe, it struggles to scale to high- dimensional problems, leading to increased use of deep learning for tractable synthesis of robot safety. Unfortunately, existing neural safety synthesis methods often lack convergence guarantees and solution interpretability. In this paper, we present Minimax Actors Guided by Implicit Critic Stackelberg (MAGICS), a novel adversarial reinforcement learning (RL) algorithm that guarantees local convergence to a minimax equilibrium solution. We then build on this approach to provide local convergence guarantees for a general deep RL-based robot safety synthesis algorithm. Through both simulation studies on OpenAI Gym environ- ments and hardware experiments with a 36-dimensional quadruped robot, we show that MAGICS can yield robust control policies outperforming the state- of-the-art neural safety synthesis methods. more »

Award ID(s):: 2340851

PAR ID:: 10621715

Author(s) / Creator(s):: Wang, Justin S; Hu, Haimin; Nguyen, Duy P; Fernández_Fisac, Jaime

Publisher / Repository:: Springer Proceedings in Advanced Robotics (SPAR)

Date Published:: 2024-10-07

Volume:: XVI

Subject(s) / Keyword(s):: adversarial reinforcement learning robot safety game theory

Format(s):: Medium: X

Location:: Chicago, IL

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this