Dynamic intermittent Q ‐learning–based model‐free suboptimal co‐design of ‐stabilization

Yang, Yongliang  (ORCID:0000000231448604); Vamvoudakis, Kyriakos G.  (ORCID:0000000319784848); Ferraz, Henrique; Modares, Hamidreza

doi:10.1002/rnc.4515

Citation Details

Dynamic intermittent Q ‐learning–based model‐free suboptimal co‐design of ‐stabilization

Summary This paper proposes an intermittent model‐free learning algorithm for linear time‐invariant systems, where the control policy and transmission decisions are co‐designed simultaneously while also being subjected to worst‐case disturbances. The control policy is designed by introducing an internal dynamical system to further reduce the transmission rate and provide bandwidth flexibility in cyber‐physical systems. Moreover, aQ‐learning algorithm with two actors and a single critic structure is developed to learn the optimal parameters of aQ‐function. It is shown by using an impulsive system approach that the closed‐loop system has an asymptotically stable equilibrium and that no Zeno behavior occurs. Furthermore, a qualitative performance analysis of the model‐free dynamic intermittent framework is given and shows the degree of suboptimality concerning the optimal continuous updated controller. Finally, a numerical simulation of an unknown system is carried out to highlight the efficacy of the proposed framework. more »

Award ID(s):: 1851588

PAR ID:: 10461456

Author(s) / Creator(s):: Yang, Yongliang ; Vamvoudakis, Kyriakos G. ; Ferraz, Henrique ; Modares, Hamidreza

Publisher / Repository:: Wiley Blackwell (John Wiley & Sons)

Date Published:: 2019-03-04

Journal Name:: International Journal of Robust and Nonlinear Control

Volume:: 29

Issue:: 9

ISSN:: 1049-8923

Page Range / eLocation ID:: p. 2673-2694

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1002/rnc.4515

More Like this