The 16,384-node Parallelism of 3D-CNN Training on An Arm CPU based Supercomputer

2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC)(2021)

引用 2|浏览4
暂无评分
摘要
As the computational cost and datasets available for deep neural network training continue to increase, there is a significant demand for fast distributed training on supercomputers. However, porting and tuning applications for new advanced supercomputers requires tremendous amount of development efforts. Therefore, we present software tuning best practice for a 3D-CNN model training on a new Arm ...
更多
查看译文
关键词
Training,Computational modeling,Neural networks,Parallel processing,Supercomputers,Software,Computational efficiency
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要