skip to main content

Title: Softmax Policy Gradient Methods Can Take Exponential Time to Converge
Authors:
; ; ; ;
Award ID(s):
2106778 2007911
Publication Date:
NSF-PAR ID:
10340932
Journal Name:
Proceedings of Thirty Fourth Conference on Learning Theory
Sponsoring Org:
National Science Foundation
More Like this
No document suggestions found