Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling.

Sohaib Ahmad,Hui Guan, Brian D. Friedman, Thomas Williams,Ramesh K. Sitaraman,Thomas Y. C. Woo

International Conference on Architectural Support for Programming Languages and Operating Systems(2024)

引用 0|浏览0
暂无评分
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要