Safe adaptive output‐feedback optimal control of a class of linear systems

Mahmud, S_M Nahid; Abudia, Moad; Nivison, Scott A; Bell, Zachary I; Kamalapurkar, Rushikesh

doi:10.1002/rnc.7334

Citation Details

Safe adaptive output‐feedback optimal control of a class of linear systems

The objective of this research is to enable safety‐critical systems to simultaneously learn and execute optimal control policies in a safe manner to achieve complex autonomy. Learning optimal policies via trial and error, that is, traditional reinforcement learning, is difficult to implement in safety‐critical systems, particularly when task restarts are unavailable. Safe model‐based reinforcement learning techniques based on a barrier transformation have recently been developed to address this problem. However, these methods rely on full‐state feedback, limiting their usability in a real‐world environment. In this work, an output‐feedback safe model‐based reinforcement learning technique based on a novel barrier‐aware dynamic state estimator has been designed to address this issue. The developed approach facilitates simultaneous learning and execution of safe control policies for safety‐critical linear systems. Simulation results indicate that barrier transformation is an effective approach to achieve online reinforcement learning in safety‐critical systems using output feedback. more »

Award ID(s):: 2027999

PAR ID:: 10530445

Author(s) / Creator(s):: Mahmud, S_M Nahid; Abudia, Moad; Nivison, Scott A; Bell, Zachary I; Kamalapurkar, Rushikesh

Publisher / Repository:: Wiley

Date Published:: 2024-07-25

Journal Name:: International Journal of Robust and Nonlinear Control

Volume:: 34

Issue:: 11

ISSN:: 1049-8923

Page Range / eLocation ID:: 7082 to 7095

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1002/rnc.7334

More Like this