Risk-Averse Multi-Armed Bandit Problems under Mean-Variance Measure.

IEEE Journal of Selected Topics in Signal Processing(2016)

引用 45|浏览120
暂无评分
摘要
The multi-armed bandit (MAB) problems have been studied mainly under the measure of expected total reward accrued over a horizon of length T . In this paper, we address the issue of risk in MAB problems and develop parallel results under the measure of mean-variance, a commonly adopted risk measure in economics and mathematical finance. We show that the model-specific regret and the model-independ...
更多
查看译文
关键词
Biological system modeling,Economics,Loss measurement,Time measurement,Portfolios,Analytical models
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要