MOSS: An Open Conversational Large Language Model

Tianxiang Sun,Xiaotian Zhang,Zhengfu He, Peng Li,Qinyuan Cheng,Xiangyang Liu, Hang Yan,Yunfan Shao, Qiong Tang, Shiduo Zhang, Xingjian Zhao, Ke Chen,Yining Zheng, Zhejian Zhou, Ruixiao Li, Jun Zhan,Yunhua Zhou,Linyang Li, Xiaogui Yang,Lingling Wu,Zhangyue Yin,Xuanjing Huang,Yu-Gang Jiang,Xipeng Qiu

Machine Intelligence Research(2024)

引用 0|浏览14
暂无评分
摘要
Conversational large language models (LLMs) such as ChatGPT and GPT-4 have recently exhibited remarkable capabilities across various domains, capturing widespread attention from the public. To facilitate this line of research, in this paper, we report the development of MOSS, an open-sourced conversational LLM that contains 16 B parameters and can perform a variety of instructions in multi-turn interactions with humans. The base model of MOSS is pre-trained on large-scale unlabeled English, Chinese, and code data. To optimize the model for dialogue, we generate 1.1 M synthetic conversations based on user prompts collected through our earlier versions of the model API. We then perform preference-aware training on preference data annotated from AI feedback. Evaluation results on real-world use cases and academic benchmarks demonstrate the effectiveness of the proposed approaches. In addition, we present an effective practice to augment MOSS with several external tools. Through the development of MOSS, we have established a complete technical roadmap for large language models from pre-training, supervised fine-tuning to alignment, verifying the feasibility of chatGPT under resource-limited conditions and providing a reference for both the academic and industrial communities. Model weights and code are publicly available at https://github.com/OpenMOSS/MOSS .
更多
查看译文
关键词
Large language models,natural language processing,pre-training,alignment,chatGPT,MOSS
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要