Yin, Ming, Bai, Yu, and Wang, Yu-Xiang. Near Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning. Retrieved from https://par.nsf.gov/biblio/10298757. Proceedings of Machine Learning Research 130.
Yin, Ming, Bai, Yu, and Wang, Yu-Xiang.
"Near Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning". Proceedings of Machine Learning Research 130 (). Country unknown/Code not available. https://par.nsf.gov/biblio/10298757.
Warning: Leaving National Science Foundation Website
You are now leaving the National Science Foundation website to go to a non-government website.
Website:
NSF takes no responsibility for and exercises no control over the views expressed or the accuracy of
the information contained on this site. Also be aware that NSF's privacy policy does not apply to this site.