Reward-Agnostic Fine-Tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning | NSF Public Access Repository

skip to main content

An official website of the United States government Here's how you know

Official websites use .gov

A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS

A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Citation Details

Reward-Agnostic Fine-Tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning

Award ID(s):: 2218713 2221009 2218773 2014279 1907661

PAR ID:: 10516496

Author(s) / Creator(s):: Li, Gen; Zhan, Wenhao; Lee, Jason; Chi, Yuejie; Chen, Yuxin

Publisher / Repository:: Neural Information Processing Systems

Date Published:: 2023-12-31

Journal Name:: Neural Information Processing Systems

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Conference Paper:
The DOI is not currently available.