GPU optimization of lattice Boltzmann method with local ensemble transform Kalman filter

格子ボルツマン法および局所アンサンブル変換カルマンフィルタのGPU最適化

長谷川雄太 ; 今村俊幸*; 伊奈拓也 ; 小野寺直幸 ; 朝比祐一 ; 井戸村泰宏

Hasegawa, Yuta; Imamura, Toshiyuki*; Ina, Takuya; Onodera, Naoyuki; Asahi, Yuichi; Idomura, Yasuhiro

格子ボルツマン法(LBM)に基づく数値流体力学シミュレーションおよび局所アンサンブル変換カルマンフィルタ(LETKF)によるアンサンブルデータ同化をNVIDIA A100 GPU搭載スパコンに対して実装し、および最適化した。LBMとLETKFの協働のため、データ転置通信を実装し、LETKFのデータ依存性に基づいて計算,ファイルI/O、および通信のオーバーラップにより通信を最適化した。2次元等方乱流,アンサンブル数,格子点数の条件において、通信を最適化した実装は、LETKFを並列化しない単純な実装に対して3.85倍の高速化を達成した。LETKFの主要な計算カーネルはの実対称密行列の固有値分解であり、その計算のため、バッチ形式固有値分解ソルバEigenGを新たに開発した。EigenGによるバッチ形式固有値分解は、既存ライブラリであるcuSolverに対して64倍の高速化を達成した。

The ensemble data assimilation of computational fluid dynamics simulations based on the lattice Boltzmann method (LBM) and the local ensemble transform Kalman filter (LETKF) is implemented and optimized on a GPU supercomputer based on NVIDIA A100 GPUs. To connect the LBM and LETKF parts, data transpose communication is optimized by overlapping computation, file I/O, and communication based on data dependency in each LETKF kernel. In two dimensional forced isotropic turbulence simulations with the ensemble size of and the number of grid points of , the optimized implementation achieved speedup from the naive implementation, in which the LETKF part is not parallelized. The main computing kernel of the local problem is the eigenvalue decomposition (EVD) of real symmetric dense matrices, which is computed by a newly developed batched EVD in EigenG. The batched EVD in EigenG outperforms that in cuSolver, and speedup was achieved.

発表言語	:	English
掲載資料名	:	Proceedings of 13th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH22) (Internet)
巻	:
号	:
ページ数	:	p.10 - 17
発行年月	:	2022/00
発表会議名	:	International Conference for High Performance Computing, Networking, Storage, and Analysis (SC22); 13th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH22)
開催年月	:	2022/11
開催都市	:	Dallas (online)
開催国	:	U.S.A.
キーワード	:	Data Assimilation; Local Ensemble Transform Kalman Filter; Lattice Boltzmann Method; GPU

特許データ	:
PDF	:

論文URL	:	https://doi.org/10.1109/ScalAH56622.2022.00007
研究データの公開先DOI	:	本成果にかかわる研究データのリンクです。
使用施設	:	大型計算機・スパコン(東海)
広報プレスリリース	:
論文解説記事 (成果普及情報誌)	:	GPUを駆使して流体解析と観測データを高速に同化する手法の開発[]
受委託・共同研究相手機関	:	理化学研究所

Access	:	- Accesses
Web of Science® Times Cited Count	:	被引用回数：評価・統計等のため最新の被引用回数を確認したい場合は、直接Web of Science®をご確認ください。 http://www.webofknowledge.com/wos
InCites™	:
Altmetrics	:

登録番号 : BB20220814
抄録集掲載番号 : 51000349
論文投稿番号 : 26315

[CLARIVATE ANALYTICS], [WEB OF SCIENCE], [HIGHLY CITED PAPER & CUP LOGO] and [HOT PAPER & FIRE LOGO] are trademarks of Clarivate Analytics, and/or its affiliated company or companies, and used herein by permission and/or license.