Safe Exploration in Reinforcement Learning by Reachability Analysis over Learned Models

Wang, Yuning; Zhu, He

Citation Details

We introduce VELM, a reinforcement learning (RL) framework grounded in verification principles for safe exploration in unknown environments. VELM ensures that an RL agent systematically explores its environment, adhering to safety properties throughout the learning process. VELM learns environment models as symbolic formulas and conducts formal reachability analysis over the learned models for safety verification. An online shielding layer is then constructed to confine the RL agent’s exploration solely within a state space verified as safe in the learned model, thereby bolstering the overall safety profile of the RL system. Our experimental results demonstrate the efficacy of VELM across diverse RL environments, highlighting its capacity to significantly reduce safety violations in comparison to existing safe learning techniques, all without compromising the RL agent’s reward performance. more »

Award ID(s):: 2007799 2124155

PAR ID:: 10511029

Author(s) / Creator(s):: Wang, Yuning; Zhu, He

Publisher / Repository:: Springer-Verlag Lecture Notes in Computer Science series

Date Published:: 2024-07-24

Journal Name:: 36th International Conference on Computer Aided Verification (CAV)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this