Model Merging in Pre-training of Large Language ModelsYunshui Li, Yiyuan Ma,Shen Yan, Chaoyi Zhang,Jing Liu,Jianqiao Lu, Ziwen Xu,Mengzhao Chen, Minrui Wang, Shiyi Zhan, Jin Ma, Xunhao Lai, Deyi Liu, Yao Luo, Xingyan Bin, Hongbin Ren, Mingji Han, Wenhao Hao, Bairen Yi, LingJun Liu, Bole Ma, Xiaoying Jia, Xun Zhou, Siyuan Qiao, Liang Xiang,Yonghui Wuarxiv(2025)Cited 0|Views2AI Read ScienceMust-Reading TreeExampleGenerate MRT to find the research sequence of this paperChat PaperSummary is being generated by the instructions you defined