Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference Yinghan Li, Yifei Li, Jiejing Zhang, Bujiao Chen, Xiaotong Chen, Lian Duan, Yejun Jin, Zheng Li, Xuanyu Liu, Haoyu Wang, Wente Wang, Yajie Wang, Jiacheng Yang, Peiyang Zhang, Laiwen Zheng,Wenyuan YuCoRR(2025)引用 0|浏览3AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要