Reinforcement learning: a comparison of UCB versus alternative adaptive policies
- Award ID(s):
- 1662629
- PAR ID:
- 10669717
- Publisher / Repository:
- De Gruyter
- Date Published:
- Page Range / eLocation ID:
- 127 to 138
- Subject(s) / Keyword(s):
- Reinforcement Learning, UCB policies, Thompson Sampling
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found
An official website of the United States government

