Deng, Zihao, Devic, Siddartha, and Juba, Brendan. Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions. Retrieved from https://par.nsf.gov/biblio/10357218. Proceedings of Machine Learning Research 151.
Deng, Zihao, Devic, Siddartha, & Juba, Brendan. Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions. Proceedings of Machine Learning Research, 151 (). Retrieved from https://par.nsf.gov/biblio/10357218.
Deng, Zihao, Devic, Siddartha, and Juba, Brendan.
"Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions". Proceedings of Machine Learning Research 151 (). Country unknown/Code not available. https://par.nsf.gov/biblio/10357218.
@article{osti_10357218,
place = {Country unknown/Code not available},
title = {Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions},
url = {https://par.nsf.gov/biblio/10357218},
abstractNote = {},
journal = {Proceedings of Machine Learning Research},
volume = {151},
author = {Deng, Zihao and Devic, Siddartha and Juba, Brendan},
}
Warning: Leaving National Science Foundation Website
You are now leaving the National Science Foundation website to go to a non-government website.
Website:
NSF takes no responsibility for and exercises no control over the views expressed or the accuracy of
the information contained on this site. Also be aware that NSF's privacy policy does not apply to this site.