Unified algorithms for RL with Decision-Estimation Coefficients: PAC, reward-free, preference-based learning and beyond | NSF Public Access Repository

skip to main content

An official website of the United States government Here's how you know

Official websites use .gov

A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS

A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Citation Details

This content will become publicly available on February 1, 2026

Unified algorithms for RL with Decision-Estimation Coefficients: PAC, reward-free, preference-based learning and beyond

Award ID(s):: 2315725 2339904

PAR ID:: 10580898

Author(s) / Creator(s):: Chen, Fan; Mei, Song; Bai, Yu

Publisher / Repository:: Annals of Statistics

Date Published:: 2025-02-01

Journal Name:: The Annals of Statistics

Volume:: 53

Issue:: 1

ISSN:: 0090-5364

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on February 1, 2026
Journal Article:
https://doi.org/10.1214/24-AOS2483