Duality-Based Stochastic Policy Optimization for Estimation with Unknown Noise Covariances

Shahriar Talebi, Amirhossein Taghvaei

Citation Details

Duality of control and estimation allows mapping recent advances in data-guided control to the estimation setup. This paper formalizes and utilizes such a mapping to consider learning the optimal (steady-state) Kalman gain when process and measurement noise statistics are unknown. Specifically, building on the duality between synthesizing optimal control and estimation gains, the filter design problem is formalized as direct policy learning. In this direction, the duality is used to extend existing theoretical guarantees of direct policy updates for Linear Quadratic Regulator (LQR) to establish global convergence of the Gradient Descent (GD) algorithm for the estimation problem–while addressing subtle differences between the two synthesis problems. Subsequently, a Stochastic Gradient Descent (SGD) approach is adopted to learn the optimal Kalman gain without the knowledge of noise covariances. The results are illustrated via several numerical examples. more »

Award ID(s):: 2149470

PAR ID:: 10422993

Author(s) / Creator(s):: Shahriar Talebi, Amirhossein Taghvaei

Date Published:: 2023-05-01

Journal Name:: Proceedings of the American Control Conference

ISSN:: 0743-1619

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this