A conjecture on the Feldman bandit problem

Nouiehed, Maher; Ross, Sheldon M.

doi:10.1017/jpr.2018.19

Citation Details

A conjecture on the Feldman bandit problem

Abstract We consider the Bernoulli bandit problem where one of the arms has win probability α and the others β, with the identity of the α arm specified by initial probabilities. With u = max(α, β), v = min(α, β), call an arm with win probability u a good arm. Whereas it is known that the strategy of always playing the arm with the largest probability of being a good arm maximizes the expected number of wins in the first n games for all n , we conjecture that it also stochastically maximizes the number of wins. That is, we conjecture that this strategy maximizes the probability of at least k wins in the first n games for all k , n . The conjecture is proven when k = 1, and k = n , and when there are only two arms and k = n - 1. more »

Award ID(s):: 1662442

PAR ID:: 10070434

Author(s) / Creator(s):: Nouiehed, Maher; Ross, Sheldon M.

Date Published:: 2018-03-01

Journal Name:: Journal of Applied Probability

Volume:: 55

Issue:: 01

ISSN:: 0021-9002

Page Range / eLocation ID:: 318 to 324

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1017/jpr.2018.19

More Like this