Particle Thompson Sampling with Static Particles

Zhou, Zeyu; Hajek, Bruce

doi:10.1109/CISS56502.2023.10089653

Citation Details

Particle Thompson Sampling with Static Particles

Particle Thompson sampling (PTS) is a simple and flexible approximation of Thompson sampling for solving stochastic bandit problems. PTS circumvents the intractability of maintaining a continuous posterior distribution in Thompson sampling by replacing the continuous distribution with a discrete distribution supported at a set of weighted static particles. We analyze the dynamics of particles' weights in PTS for general stochastic bandits without assuming that the set of particles contains the unknown system parameter. It is shown that fit particles survive and unfit particles decay, with the fitness measured in KL-divergence. For Bernoulli bandit problems, all but a few fit particles decay. more »

Award ID(s):: 1900636

PAR ID:: 10414648

Author(s) / Creator(s):: Zhou, Zeyu; Hajek, Bruce

Date Published:: 2023-03-22

Journal Name:: Proc. 57th Annual Conference on Information Sciences and Systems

Page Range / eLocation ID:: 1 to 6

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/CISS56502.2023.10089653

More Like this