谷歌浏览器插件
订阅小程序
在清言上使用

MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism

Ruidong Zhu,Ziheng Jiang,Chao Jin, Peng Wu, Cesar A. Stuardo, Dongyang Wang, Xinlei Zhang, Huaping Zhou,Haoran Wei, Yang Cheng, Jianzhe Xiao, Xinyi Zhang, Lingjun Liu,Haibin Lin, Li-Wen Chang, Jianxi Ye, Xiao Yu,Xuanzhe Liu,Xin Jin, Xin Liu

arxiv(2025)

引用 0|浏览12
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要