The analysis of a plane wave pseudopotential density functional theory code on a GPU machine.

Computer Physics Communications(2013)

引用 266|浏览23
暂无评分
摘要
Plane wave pseudopotential (PWP) density functional theory (DFT) calculation is the most widely used material science simulation, and the PWP DFT codes are arguably the most important material science codes. We have implemented a PWP DFT code PEtot on a multi-node GPU machine. Starting from a previous work, we have further improved the speed of the code, and achieved x13-x22 speedups over the CPU calculations for a typical 512 atom system. Such speedups are much higher than other similar works for this important class of material simulation codes on GPU clusters. The current achievement is obtained by (1) moving the calculation fully into the GPU; (2) adopting a new algorithm to reduce the data amount for MPI communication; and (3) using new GPU and CPU numerical libraries. We have also provided a detail quantitative analysis of the computational times for different physical systems and number of GPU units, which helps one to understand the challenges and bottlenecks of the PWP DFT simulations on GPU machines. Based on the analysis, we listed the machine and library requirements in order to further improve the performances of the PWP DFT calculations.
更多
查看译文
关键词
Electronic structure,First-principles,Density functional theory,Plane wave pseudopotential,GPU
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要