An efficient MPI/openMP parallelization of the Hartree-Fock method for the second generation of Intel ® Xeon Phi ™ processor

CoRR(2017)

引用 23|浏览5
暂无评分
摘要
Modern OpenMP threading techniques are used to convert the MPI-only Hartree-Fock code in the GAMESS program to a hybrid MPI/OpenMP algorithm. Two separate implementations that differ by the sharing or replication of key data structures among threads are considered, density and Fock matrices. All implementations are benchmarked on a super-computer of 3,000 Intel® Xeon Phi™ processors. With 64 cores per processor, scaling numbers are reported on up to 192,000 cores. The hybrid MPI/OpenMP implementation reduces the memory footprint by approximately 200 times compared to the legacy code. The MPI/OpenMP code was shown to run up to six times faster than the original for a range of molecular system sizes.
更多
查看译文
关键词
Quantum chemistry,parallel Hartree-Fock,parallel Self Consistent Field,OpenMP,MPI,GAMESS
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要