On The Statistical Complexity of Offline Decision-Making

Nguyen-Tang, Thanh; Arora, Raman

Citation Details

We study the statistical complexity of offline decision-making with function approximation, establishing (near) minimax-optimal rates for stochastic contextual bandits and Markov decision processes. The performance limits are captured by the pseudo-dimension of the (value) function class and a new characterization of the behavior policy that strictly subsumes all the previous notions of data coverage in the offline decision-making literature. In addition, we seek to understand the benefits of using offline data in online decisionmaking and show nearly minimax-optimal rates in a wide range of regimes. more »

Award ID(s):: 1943251

PAR ID:: 10572972

Author(s) / Creator(s):: Nguyen-Tang, Thanh; Arora, Raman

Publisher / Repository:: Proceedings of the 41st International Conference on Machine Learning, PMLR 235, 2024

Date Published:: 2024-07-01

ISSN:: 2640-3498

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this