Learning-Based Adaptive Optimal Control of Linear Time-Delay Systems: A Policy Iteration Approach

Cui, Leilei; Pang, Bo; Jiang, Zhong-Ping

doi:10.1109/TAC.2023.3273786

Citation Details

Learning-Based Adaptive Optimal Control of Linear Time-Delay Systems: A Policy Iteration Approach

This paper studies the adaptive optimal control problem for a class of linear time-delay systems described by delay differential equations (DDEs). A crucial strategy is to take advantage of recent developments in reinforcement learning (RL) and adaptive dynamic programming (ADP) and develop novel methods to learn adaptive optimal controllers from finite samples of input and state data. In this paper, the data-driven policy iteration (PI) is proposed to solve the infinite-dimensional algebraic Riccati equation (ARE) iteratively in the absence of exact model knowledge. Interestingly, the proposed recursive PI algorithm is new in the present context of continuous-time time-delay systems, even when the model knowledge is assumed known. The efficacy of the proposed learning-based control methods is validated by means of practical applications arising from metal cutting and autonomous driving. more »

Award ID(s):: 1903781 2148309

PAR ID:: 10479220

Author(s) / Creator(s):: Cui, Leilei; Pang, Bo; Jiang, Zhong-Ping

Publisher / Repository:: IEEE

Date Published:: 2023-05-01

Journal Name:: IEEE Transactions on Automatic Control

ISSN:: 0018-9286

Page Range / eLocation ID:: 1 to 8

Subject(s) / Keyword(s):: Adaptive dynamic programming, optimal control, linear time-delay systems, policy iteration

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/TAC.2023.3273786

More Like this