Performance study of HPC applications on an Arm-based cluster using a generic efficiency model

2020 28th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)(2020)

引用 7|浏览11
暂无评分
摘要
HPC systems and parallel applications are increasing their complexity. Therefore the possibility of easily study and project at large scale the performance of scientific applications is of paramount importance. In this paper we describe a performance analysis method and we apply it to four complex HPC applications. We perform our study on a pre-production HPC system powered by the latest Arm-based CPUs for HPC, the Marvell ThunderX2. For each application we spot inefficiencies and factors that limit their scalability. The results show that in several cases the bottlenecks do not come from the hardware but from the way applications are programmed or the way the system software is configured.
更多
查看译文
关键词
Performance analysis,High Performance Computing,Parallel Applications,Arm,ThunderX2
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要