A Novel Framework for Unstructured Finite-Volume Methods with Optimized Hamiltonian Path Cell Ordering and Parallel Efficiency

AIAA SCITECH 2022 Forum(2022)

引用 0|浏览1
暂无评分
摘要
A novel optimized framework for unstructured finite-volume methods is presented, with an emphasis on memory bandwidth optimizations and parallel efficiency. The same numerical methods as an OpenFOAM-based solver named caafoam are used, which was recently developed for direct computation of aeroacoustic simulations. The current work presents a bottom-up optimization of caafoam. The mesh cells are ordered using Hamiltonian paths, where two consecutive cells in the path are guaranteed to be contiguous in memory, greatly reducing the failed data requests inherent to unstructured grids. To further reduce the memory footprint from the CPU, efficient data structures and compute kernel fusion are used. For optimal parallel efficiency, boundary and interior cells are treated separately to take full advantage of asynchronous MPI communication. Furthermore, the latest MPI-4 persistent neighborhood collective framework was implemented for optimal communications of ghost cells. A series of benchmarks is used to validate the accuracy and performance of the novel solver. For this preliminary study, relatively small grids of up to 648,000 cells were run. A comparative performance study shows a speed-up of 1.8x-6.6x when compared to the original caafoam solver.
更多
查看译文
关键词
hamiltonian path cell ordering,parallel efficiency,finite-volume
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要