Impact of Synchronization Topology on DML Performance: Both Logical Topology and Physical Topology

IEEE/ACM Transactions on Networking(2022)

引用 8|浏览13
暂无评分
摘要
To tackle the increasingly larger training data and models, researchers and engineers resort to multiple servers in a data center for distributed machine learning (DML). On one hand, DML enables us to leverage the computation power of multiple servers, which can effectively accelerate those computation-intensive tasks. On the other hand, DML also incurs significant communication cost due to parame...
更多
查看译文
关键词
Synchronization,Topology,Servers,Network topology,Computational modeling,Parallel processing,Hip
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要