Certifiably-correct Control Policies for Safe Learning and Adaptation in Assistive Robotics

Keyvan Majd, Geoffrey Clark

Citation Details

Guaranteeing safety in human-centric applications is critical in robot learning as the learned policies may demonstrate unsafe behaviors in formerly unseen scenarios. We present a framework to locally repair an erroneous policy network to satisfy a set of formal safety constraints using Mixed Integer Quadratic Programming (MIQP). Our MIQP formulation explicitly imposes the safety constraints to the learned policy while minimizing the original loss function. The policy network is then verified to be locally safe. We demonstrate the application of our framework to derive safe policies for a robotic lower-leg prosthesis. more »

Award ID(s):: 1932068

PAR ID:: 10491306

Author(s) / Creator(s):: Keyvan Majd, Geoffrey Clark

Publisher / Repository:: JMLR NeuRIPS

Date Published:: 2023-03-12

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Workshop Report:
The DOI is not currently available.

More Like this