Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence | NSF Public Access Repository

skip to main content

An official website of the United States government Here's how you know

Official websites use .gov

A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS

A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Citation Details

Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence

Award ID(s):: 2221009 2106739 1907661 2106778 2134080 2007911 2148212 1901199

PAR ID:: 10424937

Author(s) / Creator(s):: Zhan, Wenhao; Cen, Shicong; Huang, Baihe; Chen, Yuxin; Lee, Jason D.; Chi, Yuejie

Date Published:: 2023-06-30

Journal Name:: SIAM Journal on Optimization

Volume:: 33

Issue:: 2

ISSN:: 1052-6234

Page Range / eLocation ID:: 1061 to 1091

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1137/21M1456789