S-LoRA: Serving Thousands of Concurrent LoRA AdaptersYing Sheng,Shiyi Cao,Dacheng Li,Coleman Hooper,Nicholas Lee, Shuo Yang,Christopher Chou,Banghua Zhu,Lianmin Zheng,Kurt Keutzer,Joseph E. Gonzalez,Ion StoicaarXivorg(2023)引用 102|浏览594关键词Performance Optimization,GPU ComputingAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要