谷歌浏览器插件
订阅小程序
在清言上使用

LLaMA-Berry: Pairwise Optimization for Olympiad-level Mathematical Reasoning Via O1-like Monte Carlo Tree Search.

Di Zhang, Jianbo Wu, Jingdi Lei, Tong Che, Jiatong Li, Tong Xie,Xiaoshui Huang,Shufei Zhang,Marco Pavone,Yuqiang Li,Wanli Ouyang,Dongzhan Zhou

North American Chapter of the Association for Computational Linguistics(2025)

引用 0|浏览0
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要