Chrome Extension
WeChat Mini Program
Use on ChatGLM

Particle Thompson Sampling with Static Particles

2023 57th Annual Conference on Information Sciences and Systems (CISS)(2023)

Cited 1|Views4
No score
Abstract
Particle Thompson sampling (PTS) is a simple and flexible approximation of Thompson sampling for solving stochastic bandit problems. PTS circumvents the intractability of maintaining a continuous posterior distribution in Thompson sampling by replacing the continuous distribution with a discrete distribution supported at a set of weighted static particles. We analyze the dynamics of particles' weights in PTS for general stochastic bandits without assuming that the set of particles contains the unknown system parameter. It is shown that fit particles survive and unfit particles decay, with the fitness measured in KL-divergence. For Bernoulli bandit problems, all but a few fit particles decay.
More
Translated text
Key words
stochastic bandit,Thompson sampling,particles
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined