Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods
- Award ID(s):
- 1750483
- Publication Date:
- NSF-PAR ID:
- 10174466
- Journal Name:
- Proceedings of the Conference on Robot Learning
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found