This content will become publicly available on June 30, 2024
Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence
- NSF-PAR ID:
- 10424937
- Date Published:
- Journal Name:
- SIAM Journal on Optimization
- Volume:
- 33
- Issue:
- 2
- ISSN:
- 1052-6234
- Page Range / eLocation ID:
- 1061 to 1091
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found