Zhang, Qining, and Ying, Lei. Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis. Retrieved from https://par.nsf.gov/biblio/10547269. Reinforcement Learning Journal .
Zhang, Qining, & Ying, Lei. Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis. Reinforcement Learning Journal, (). Retrieved from https://par.nsf.gov/biblio/10547269.
Zhang, Qining, and Ying, Lei.
"Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis". Reinforcement Learning Journal (). Country unknown/Code not available: RLC. https://par.nsf.gov/biblio/10547269.
Warning: Leaving National Science Foundation Website
You are now leaving the National Science Foundation website to go to a non-government website.
Website:
NSF takes no responsibility for and exercises no control over the views expressed or the accuracy of
the information contained on this site. Also be aware that NSF's privacy policy does not apply to this site.