Improving Particle Thompson Sampling through Regenerative Particles

Zhou, Zeyu; Hajek, Bruce

doi:10.1109/CISS56502.2023.10089647

Citation Details

Improving Particle Thompson Sampling through Regenerative Particles

This paper proposes regenerative particle Thompson sampling (RPTS) as an improvement of particle Thompson sampling (PTS) for solving general stochastic bandit problems. PTS approximates Thompson sampling by replacing the continuous posterior distribution with a discrete distribution supported at a set of weighted static particles. PTS is flexible but may suffer from poor performance due to the tendency of the probability mass to concentrate on a small number of particles. RPTS exploits the particle weight dynamics of PTS and uses non-static particles: it deletes a particle if its probability mass gets sufficiently small and regenerates new particles in the vicinity of the surviving particles. Empirical evidence shows uniform improvement across a set of representative bandit problems without increasing the number of particles. more »

Award ID(s):: 1900636

PAR ID:: 10414647

Author(s) / Creator(s):: Zhou, Zeyu; Hajek, Bruce

Date Published:: 2023-03-22

Journal Name:: Proc. 57th Annual Conference on Information Sciences and Systems

Page Range / eLocation ID:: 1 to 4

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/CISS56502.2023.10089647

More Like this