Enhancing Safety in Learning from Demonstration Algorithms via Control Barrier Function Shielding

Yang, Yue; Chen, Letian; Zaidi, Zulfiqar; van Waveren, Sanne; Krishna, Arjun; Gombolay, Matthew

doi:10.1145/3610977.3635002

Citation Details

Enhancing Safety in Learning from Demonstration Algorithms via Control Barrier Function Shielding

Learning from Demonstration (LfD) is a powerful method for nonroboticists end-users to teach robots new tasks, enabling them to customize the robot behavior. However, modern LfD techniques do not explicitly synthesize safe robot behavior, which limits the deployability of these approaches in the real world. To enforce safety in LfD without relying on experts, we propose a new framework, ShiElding with Control barrier fUnctions in inverse REinforcement learning (SECURE), which learns a customized Control Barrier Function (CBF) from end-users that prevents robots from taking unsafe actions while imposing little interference with the task completion. We evaluate SECURE in three sets of experiments. First, we empirically validate SECURE learns a high-quality CBF from demonstrations and outperforms conventional LfD methods on simulated robotic and autonomous driving tasks with improvements on safety by up to 100%. Second, we demonstrate that roboticists can leverage SECURE to outperform conventional LfD approaches on a real-world knife-cutting, meal-preparation task by 12.5% in task completion while driving the number of safety violations to zero. Finally, we demonstrate in a user study that non-roboticists can use SECURE to efectively teach the robot safe policies that avoid collisions with the person and prevent cofee from spilling. more »

Award ID(s):: 2219755

PAR ID:: 10499423

Author(s) / Creator(s):: Yang, Yue; Chen, Letian; Zaidi, Zulfiqar; van Waveren, Sanne; Krishna, Arjun; Gombolay, Matthew

Publisher / Repository:: ACM

Date Published:: 2024-03-11

Journal Name:: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction

ISBN:: 9798400703225

Page Range / eLocation ID:: 820 to 829

Subject(s) / Keyword(s):: Learning from Demonstration, Control Barrier Function, Safety

Format(s):: Medium: X

Location:: Boulder CO USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3610977.3635002

More Like this