Refine your search�ソスF     
Report No.

Performance measurement of an urban wind simulation code with the Locally Mesh-Refined Lattice Boltzmann Method over NVIDIA and AMD GPUs

Asahi, Yuichi   ; Onodera, Naoyuki   ; Hasegawa, Yuta   ; Shimokawabe, Takashi*; Shiba, Hayato*; Idomura, Yasuhiro   

We have ported the GPU accelerated Lattice Boltzmann Method code "CityLBM" to AMD MI100 GPU. We present the performance of CityLBM achieved on NVIDIA P100, V100, A100 GPUs and AMDMI100 GPU. Using the host to host MPI communications, the performance on MI100 GPU is around 20% better than on V100 GPU. It has turned out that most of the kernels are successfully accelerated except for interpolation kernels for Adaptive Mesh Refinement (AMR) method.



- Accesses





[CLARIVATE ANALYTICS], [WEB OF SCIENCE], [HIGHLY CITED PAPER & CUP LOGO] and [HOT PAPER & FIRE LOGO] are trademarks of Clarivate Analytics, and/or its affiliated company or companies, and used herein by permission and/or license.