Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
- Award ID(s):
- 1741341
- PAR ID:
- 10204911
- Date Published:
- Journal Name:
- Proceedings of Machine Learning Research
- Volume:
- 119
- ISSN:
- 2640-3498
- Page Range / eLocation ID:
- 4860-4869
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found
An official website of the United States government

