Parallel-algorithm extension for tsunami and earthquake-cycle simulators for massively parallel execution on the K computer.

IJHPCA(2016)

引用 4|浏览10
暂无评分
摘要
This article presents a case study on the extension of parallel algorithms in tsunami and earthquake-cycle simulators for massively parallel execution on the K computer. We use two target applications: a tsunami-simulation program, źJAGURS,ź and an earthquake-cycle program, źRSGDX.ź Our optimization strategy for collective communication is to split the Message Passing Interface MPI communicator and perform multistage localized communication to minimize the communication frequency, transferred data size, and network congestion. Moreover, in the case of severe load imbalances, we apply cyclic distribution and extend the axes for parallelization. For each application, we conduct a performance evaluation with massively parallel execution on the K computer. It is shown that our optimized code enables JAGURS to attain a 21.8× speedup for collective communication and a 7.9× speedup for the time-step loop on 8748 nodes 69,984 cores. RSGDX attains a 4.25× speedup for collective communication and an 18.7× speedup for the time-step loop on 8192 nodes 65,536 cores.
更多
查看译文
关键词
Tsunami simulation, earthquake-cycle simulation, optimization, collective communication, domain decomposition, K computer, massively parallel system
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要