Optimization of fusion kernels on accelerators with indirect or strided memory access patterns

間接メモリアクセスおよびストライドメモリアクセスを含む核融合カーネルの演算加速器における最適化

朝比祐一* ; Latu, G.*; 伊奈拓也; 井戸村泰宏 ; Grandgirard, V.*; Garbet, X.*

Asahi, Yuichi*; Latu, G.*; Ina, Takuya; Idomura, Yasuhiro; Grandgirard, V.*; Garbet, X.*

セミ・ラグランジュ法における間接メモリアクセス、有限差分法におけるストライドメモリアクセスといった複雑なメモリアクセスパターンを有する核融合プラズマ乱流コードの高次元ステンシル計算をGPGPUやXeon Phiプロセッサ等の演算加速器上で最適化した。どちらのデバイスでも、Array of Structure of Array (AOSOA)データレイアウトが連続的なメモリアクセスに有効である。Xeon Phiでは時空間データ局所性の向上によるローカルキャッシュの効率的利用が必要不可欠である。GPGPUではテクスチャメモリの利用がセミ・ラグランジュ法の間接メモリアクセス性能を向上する。これらの最適化により、アクセラレータ用核融合カーネルはCPU用カーネルに比べてSandy Bridge (CPU)用最適化コードに比べて1.4x - 8.1x高速化した。

High-dimensional stencil computation from fusion plasma turbulence codes involving complex memory access patterns, the indirect memory access in a Semi-Lagrangian scheme and the strided memory access in a Finite-Difference scheme, are optimized on accelerators such as GPGPUs and Xeon Phi coprocessors. On both devices, the Array of Structure of Array (AoSoA) data layout is preferable for contiguous memory accesses. It is shown that the effective local cache usage by improving spatial and temporal data locality is critical on Xeon Phi. On GPGPU, the texture memory usage improves the performance of the indirect memory accesses in the Semi-Lagrangian scheme. Thanks to these optimizations, the fusion kernels on accelerators become 1.4x - 8.1x faster than those on Sandy Bridge (CPU).

発表言語	:	English
掲載資料名	:	IEEE Transactions on Parallel and Distributed Systems
巻	:	28
号	:	7
ページ数	:	p.1974 - 1988
発行年月	:	2017/07
発表会議名	:
開催年月	:
開催都市	:
開催国	:
キーワード	:	Gyrokinetics; Accelerator; Stencil Computation

特許データ	:
PDF	:

論文URL	:	https://doi.org/10.1109/TPDS.2016.2633349
研究データの公開先DOI	:	本成果にかかわる研究データのリンクです。
使用施設	:
広報プレスリリース	:
論文解説記事 (成果普及情報誌)	:	アクセラレータを活用した省電力計算技術開発; 原子力流体計算カーネルのアクセラレータ最適化[/]
受委託・共同研究相手機関	:	文部科学省

Access	:	- Accesses
Web of Science® Times Cited Count	:	被引用回数：評価・統計等のため最新の被引用回数を確認したい場合は、直接Web of Science®をご確認ください。 http://www.webofknowledge.com/wos
InCites™	:	パーセンタイル：51.40 分野：Computer Science, Theory & Methods
Altmetrics	:

登録番号 : AA20160419
抄録集掲載番号 : 45000862
論文投稿番号 : 19106

[CLARIVATE ANALYTICS], [WEB OF SCIENCE], [HIGHLY CITED PAPER & CUP LOGO] and [HOT PAPER & FIRE LOGO] are trademarks of Clarivate Analytics, and/or its affiliated company or companies, and used herein by permission and/or license.