Overcoming Prior Misspecification in Online Learning to Rank

Javad Azizi,Ofer Meshi,Masrour Zoghi,Maryam Karimzadehgan

arxiv（2023）

引用 0|浏览26

暂无评分

摘要

The recent literature on online learning to rank (LTR) has established the utility of prior knowledge to Bayesian ranking bandit algorithms. However, a major limitation of existing work is the requirement for the prior used by the algorithm to match the true prior. In this paper, we propose and analyze adaptive algorithms that address this issue and additionally extend these results to the linear and generalized linear models. We also consider scalar relevance feedback on top of click feedback. Moreover, we demonstrate the efficacy of our algorithms using both synthetic and real-world experiments.

查看译文

关键词

online learning,prior misspecification

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要