Choosing the Best Arm with Guaranteed ConfidenceVol. 16, Article 71, 2022

Ross, S; Zhang, Z; Azizi, M.

Citation Details

We consider the problem of finding, through adaptive sampling, which of n populations (arms) has the largest mean. Our objective is to determine a rule which identifies the best arm with a fixed minimum confidence using as few observations as possible.We study such problems when the population distributions are either Bernoulli or normal. We take a Bayesian approach that assumes that the unknown means are the values of independent random variables having a common specified distribution. We propose to use the classical vector at a time rule, which samples each remaining arm once in each round, eliminating arms whose cumulative sum falls k below that of another arm. We show how this rule can be implemented and analyzed in our Bayesian setting and how it can be improved by early elimination. We also propose and analyze a variant of the classical play the winner algorithm. Numerical results show that these rules perform quite well, even when considering cases where the set of means do not look like they come from the specified prior. more »

Award ID(s):: 2132759

PAR ID:: 10430247

Author(s) / Creator(s):: Ross, S; Zhang, Z; Azizi, M.

Date Published:: 2022-10-01

Journal Name:: Journal of statistical theory and practice

Volume:: 16

Issue:: 71

ISSN:: 1559-8608

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this