Implementation and performance evaluation of a communication-avoiding GMRES method for stencil-based code on GPU cluster

GPUクラスタにおけるステンシルコード向け省通信GMRES法の実装と性能評価

Matsumoto, Kazuya*; Idomura, Yasuhiro; Ina, Takuya*; Mayumi, Akie; Yamada, Susumu

ジャイロ運動論的トロイダル5次元オイラーコードGT5Dにおける反復法線形ソルバの性能向上に向けて省通信一般化最小残差法(CA-GMRES)をCPU-GPUハイブリッドクラスタで実装した。CA-GMRESに加え、計算量を削減するために我々が提案した修正版CA-GMRES(M-CA-GMRES)の実装と評価も行った。本研究から、集団通信回数の最小化と密行列積演算による高効率演算というCA-GMRESの利点が実証された。性能評価は1ノードあたりNVIDIA Tesla P100 GPU4台を搭載したReedbush-L GPUクラスタで実施した。この結果、M-CA-GMRESによりCA-GMRES, 一般化共役残差法(GCR), GMRESに比べてそれぞれ1.09x, 1.22x, 1.50xの高速化が示された。

A communication-avoiding generalized minimum residual method (CA-GMRES) is implemented on a hybrid CPU-GPU cluster, targeted for the performance acceleration of iterative linear system solver in the gyrokinetic toroidal five-dimensional Eulerian code GT5D. In addition to the CA-GMRES, we implement and evaluate a modified variant of CA-GMRES (M-CA-GMRES) proposed in our previous study to reduce the amount of floating-point calculations. This study demonstrates that beneficial features of the CA-GMRES are in its minimum number of collective communications and its highly efficient calculations based on dense matrix-matrix operations. The performance evaluation is conducted on the Reedbush-L GPU cluster, which contains four NVIDIA Tesla P100 GPUs per compute node. The evaluation results show that the M-CA-GMRES is 1.09x, 1.22x and 1.50x faster than the CA-GMRES, the generalized conjugate residual method (GCR), and the GMRES, respectively, when 64 GPUs are used.

発表言語	:	English
掲載資料名	:	Journal of Supercomputing
巻	:	75
号	:	12
ページ数	:	p.8115 - 8146
発行年月	:	2019/12
発表会議名	:
開催年月	:
開催都市	:
開催国	:
キーワード	:	Communication avoiding Krylov subspace method; GPU; CFD

特許データ	:
PDF	:

論文URL	:	https://doi.org/10.1007/s11227-019-02983-7
研究データの公開先DOI	:	本成果にかかわる研究データのリンクです。
使用施設	:	大型計算機・スパコン(東海)
広報プレスリリース	:
論文解説 (JAEA R&D Navigator)	:
受委託・共同研究相手機関	:

Access	:	- Accesses
Web of Science® Times Cited Count	:	被引用回数：評価・統計等のため最新の被引用回数を確認したい場合は、直接Web of Science®をご確認ください。 http://www.webofknowledge.com/wos
InCites™	:	パーセンタイル：20.81 分野：Computer Science, Hardware & Architecture
Altmetrics	:

登録番号 : AA20190384
抄録集掲載番号 : 48001307
論文投稿番号 : 23547

[CLARIVATE ANALYTICS], [WEB OF SCIENCE], [HIGHLY CITED PAPER & CUP LOGO] and [HOT PAPER & FIRE LOGO] are trademarks of Clarivate Analytics, and/or its affiliated company or companies, and used herein by permission and/or license.