Implementation of GPU-FFT into Planewave Based First Principles Calculation Method
-
- TOMONO Hidekazu
- Department of Mechanical Engineering Informatics, School of Science and Technology, Meiji University
-
- AOKI Masaru
- School of Management, Shizuoka Sangyo University Department of Mechanical Engineering Informatics, School of Science and Technology, Meiji University
-
- IITAKA Toshiaki
- RIKEN (The Institute of Physical and Chemical Research)
-
- TSUMURAYA Kazuo
- Department of Mechanical Engineering Informatics, School of Science and Technology, Meiji University
説明
We present an implementation of a GPU based FFT routine (Graphics Processing Unit based Fast Fourier Transformation) into a CPU based ab initio periodic DFT (Density Functional Theory) calculation code. The FFT calculation in the CPU based DFT codes is the most time-consuming part; for the 128 silicon system, the fraction of time of a CPU FFT calculation amounts to 0.64 of the whole periodic DFT calculation. The replacement of a double precision FFT in the periodic PWscf code with a single precision FFT gives no appreciable differences in both the numerical total energies and the interatomic forces, guaranteeing the use of a single precision GPU based FFT, CUFFT, for the code. The use of the CUFFT reduces the fraction to 0.20 of the whole PWscf code; the replacement speedups a factor of 2.2 for single CPU system. The use of the multi-CPU system with the GPU FFT accelerates by 2.2f, where f is the acceleration factor of the multi-CPU system. The single precision GPU calculation is implementable in any self-consistent electronic structure code, except for the eigensolver part in the DFT codes.
収録刊行物
-
- Journal of Computational Science and Technology
-
Journal of Computational Science and Technology 5 (3), 89-105, 2011
一般社団法人 日本機械学会
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1390001205274226944
-
- NII論文ID
- 130001091548
-
- ISSN
- 18816894
-
- 本文言語コード
- en
-
- 資料種別
- journal article
-
- データソース種別
-
- JaLC
- Crossref
- CiNii Articles
- KAKEN
- OpenAIRE
-
- 抄録ライセンスフラグ
- 使用不可