Operational improvements of the job scheduling system in the large-scale computer system at the Japan Atomic Energy Agency
Kawazu, Ryohei 
The Japan Atomic Energy Agency (JAEA) conducts research and development in various fields related to nuclear energy as a comprehensive research and development organization for nuclear power. Computational science and technology are utilized in many of these research and development activities. The supercomputer system HPE SGI8600 (hereinafter referred to as the "supercomputer") was introduced in December 2020 as critical infrastructure to meet the increasing computational demands driven by advancements in technologies such as digital twins, machine learning, and big data processing. It has become indispensable for promoting research and development at JAEA. Improving the efficiency of job operations and program waiting times (hereinafter referred to as "job waiting times") on the supercomputer, which is an essential infrastructure supporting JAEA's computational science and technology, is useful for enhancing research and development efficiency. This report presents the results of the investigation into the changes in job waiting times following the integration of queue classes, which was implemented in fiscal year 2022 to efficiently utilize computational resources. It summarizes the process from the analysis of the supercomputer's usage information to the improvements made for the integration of queue classes and the improvement of job waiting times.