Olapa-MCoT: Enhancing the Chinese Mathematical Reasoning Capability of LLMs

Shaojie Zhu,Zhaobin Wang, Chengxiang Zhuo,Hui Lu,Bo Hu,Zang Li

CoRR(2023)

引用 0|浏览10
暂无评分
摘要
CoT (Chain-of-Thought) is a way to solve reasoning problems for LLMs . Recently, many researches appear for improving the CoT capability of LLMs. In this work, we also proposed Olapa-MCoT, which is a LLMs based on llama2-13B PLM for finetuning and alignment learning. During the alignment training, we proposed the SimRRHF algorithm and Incorrect Data Relearning and mainly focused on optimizing the Chinese mathematical reasoning ability of Olapa-MCoT. The experiment achieved significant results, with the accuracy of Chinese mathematical reasoning up to 50 the accuracy of English reasoning ability also increased by nearly 4
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要