Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Learning with Sublinear Regret and Zero Constraint Violation
- Award ID(s):
- 2112471
- NSF-PAR ID:
- 10326121
- Date Published:
- Journal Name:
- the 25th International Conference on Arti- cial Intelligence and Statistics (AISTATS)
- Volume:
- 151
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found