Temperature Regret Matching for Imperfect-Information Games

user-5d4bc4a8530c70a9b361c870(2021)

引用 0|浏览20
暂无评分
摘要
Counterfactual regret minimization (CFR) methods are effective for solving two player zero-sum extensive games with imperfect information. Regret matching (RM) plays a crucial role in CFR and its variants to approach Nash equilibrium. In this paper, we present Temperature Regret Matching (TRM), a novel RM algorithm that adopts a different strategy. Also, we consider not only the opponent's strategy under the current strategy but also the opponent's strategies of the several last iterations for updating the external regret of each iteration. Furthermore, we theoretically demonstrate that the update of TRM converges to Nash Equilibrium. Competitive results in imperfect-information games have verified its effectiveness and efficiency.
更多
查看译文
关键词
Regret,Nash equilibrium,Counterfactual thinking,Perfect information,Mathematical optimization,Computer science,Regret minimization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要