Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property
- Award ID(s):
- 2312342
- PAR ID:
- 10517573
- Publisher / Repository:
- AAAI
- Date Published:
- Format(s):
- Medium: X
- Location:
- Vancouver, Canada
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found