Breaking the log(1/Δ_2) Barrier: Better Batched Best Arm Identification with Adaptive Grids

Jin, Tianyuan; Zhang, Qin; Zhou, Dongruo

Citation Details

This content will become publicly available on April 1, 2026

Breaking the log(1/Δ_2) Barrier: Better Batched Best Arm Identification with Adaptive Grids

We investigate the problem of batched best arm identification in multi-armed bandits, where we aim to identify the best arm from a set of n arms while minimizing both the number of samples and batches. We introduce an algorithm that achieves near-optimal sample complexity and features an instance-sensitive batch complexity, which breaks the log(1/Δ_2) barrier. The main contribution of our algorithm is a novel sample allocation scheme that effectively balances exploration and exploitation for batch sizes. Experimental results indicate that our approach is more batch-efficient across various setups. We also extend this framework to the problem of batched best arm identification in linear bandits and achieve similar improvements. more »

Award ID(s):: 1844234

PAR ID:: 10574905

Author(s) / Creator(s):: Jin, Tianyuan; Zhang, Qin; Zhou, Dongruo

Publisher / Repository:: International Conference on Learning Representations (ICLR) 2025

Date Published:: 2025-04-01

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on April 1, 2026
Conference Paper:
The DOI is not currently available.

More Like this