Meta-learning linear quadratic regulators: a policy gradient MAML approach for model-free LQR

Toso, Leonardo; Zhan, Donglin; Anderson, James; Wang, Han

Citation Details

We investigate the problem of learning linear quadratic regulators (LQR) in a multi-task, heterogeneous, and model-free setting. We characterize the stability and personalization guarantees of a policy gradient-based (PG) model-agnostic meta-learning (MAML) (Finn et al., 2017) approach for the LQR problem under different task-heterogeneity settings. We show that our MAML-LQR algorithm produces a stabilizing controller close to each task-specific optimal controller up to a task-heterogeneity bias in both model-based and model-free learning scenarios. Moreover, in the model-based setting, we show that such a controller is achieved with a linear convergence rate, which improves upon sub-linear rates from existing work. Our theoretical guarantees demonstrate that the learned controller can efficiently adapt to unseen LQR tasks. more »

Award ID(s):: 2231350

PAR ID:: 10542303

Author(s) / Creator(s):: Toso, Leonardo; Zhan, Donglin; Anderson, James; Wang, Han

Editor(s):: Abate, A; Cannon, M; Margellos, K; Papachristodoulou, A

Publisher / Repository:: Proceedings of Machine Learning Research (PMLR)

Date Published:: 2024-07-01

Volume:: 242

Page Range / eLocation ID:: 902-915

Format(s):: Medium: X

Location:: Oxford, United Kingdom

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this