Multilabeled Value Networks for Computer Go.

arXiv: Artificial Intelligence(2018)

引用 19|浏览25
暂无评分
摘要
This paper proposes a new approach to a novel value network architecture for the game Go, called a multilabeled (ML) value network. In the ML value network, different values (win rates) are trained simultaneously for different settings of komi, a compensation given to balance the initiative of playing first. The ML value network has three advantages: 1) it outputs values for different komi; (2) it...
更多
查看译文
关键词
Games,Training,Network architecture,Supervised learning,Learning (artificial intelligence),Monte Carlo methods,Servers
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要