Chrome Extension
WeChat Mini Program
Use on ChatGLM

Model Merging in Pre-training of Large Language Models

Yunshui Li, Yiyuan Ma,Shen Yan, Chaoyi Zhang,Jing Liu,Jianqiao Lu, Ziwen Xu,Mengzhao Chen, Minrui Wang, Shiyi Zhan, Jin Ma, Xunhao Lai, Deyi Liu, Yao Luo, Xingyan Bin, Hongbin Ren, Mingji Han, Wenhao Hao, Bairen Yi, LingJun Liu, Bole Ma, Xiaoying Jia, Xun Zhou, Siyuan Qiao, Liang Xiang,Yonghui Wu

arxiv(2025)

Cited 0|Views2
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined