skip to main content


Title: Softmax Policy Gradient Methods Can Take Exponential Time to Converge
Award ID(s):
2106778 2007911
NSF-PAR ID:
10340932
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
Proceedings of Thirty Fourth Conference on Learning Theory
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
No document suggestions found