High Performance Gravitational N-body Simulations on Supercomputer Fugaku.

HPC Asia(2022)

引用 0|浏览2
暂无评分
摘要
We report performance results of large astrophysical N-body simulations on supercomputer Fugaku. We use the hybrid TreePM algorithm, in which the short-range force is evaluated by the oct-tree method, and the long-range force is solved by the particle-mesh method. We introduce three innovations that significantly improve the performance on the novel CPU and extremely large system: the near-ultimate optimization of gravity kernel for short-range forces, the domain decomposition with accelerated communications in hardware-oriented ways, and a novel communication algorithm for long-range forces. We demonstrate that our code scales near linearly up to 158,976 nodes (7,630,848 CPU cores) of Fugaku and enables us to integrate about 1.45 trillion particles per second, which is the world fastest time-to-solution. The average performance achieved is 95.82 Pflops, which corresponds to ∼ 10% of the peak in single precision.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要