Kolmogorov–Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits

IEEE Transactions on Artificial Intelligence(2022)

引用 4|浏览1
暂无评分
摘要
We consider the nonstationary multiarmed bandit framework and propose a Kolmogorov–Smirnov (KS) test based Thompson sampling (TS) algorithm named TS-KS that actively detects change points and resets the TS parameters once a change is detected. In particular, for the two-armed bandit case, we derive bounds on the number of samples of the reward distribution to detect the chan...
更多
查看译文
关键词
Change detection algorithms,Portfolios,Heuristic algorithms,Task analysis,Optimization,Clinical trials,Artificial intelligence
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要