Refine your search:     
Report No.
 - 
Search Results: Records 1-20 displayed on this page of 21

Presentation/Publication Type

Initialising ...

Refine

Journal/Book Title

Initialising ...

Meeting title

Initialising ...

First Author

Initialising ...

Keyword

Initialising ...

Language

Initialising ...

Publication Year

Initialising ...

Held year of conference

Initialising ...

Save select records

Journal Articles

Overlapping communications in gyrokinetic codes on accelerator-based platforms

Asahi, Yuichi*; Latu, G.*; Bigot, J.*; Maeyama, Shinya*; Grandgirard, V.*; Idomura, Yasuhiro

Concurrency and Computation; Practice and Experience, 32(5), p.e5551_1 - e5551_21, 2020/03

 Times Cited Count:0 Percentile:0.01(Computer Science, Software Engineering)

Two five-dimensional gyrokinetic codes GYSELA and GKV were ported to the modern accelerators, Xeon Phi KNL and Tesla P100 GPU. Serial computing kernels of GYSELA on KNL and GKV on P100 GPU were respectively 1.3x and 7.4x faster than those on a single Skylake processor. Scaling tests of GYSELA and GKV were respectively performed from 16 to 512 KNLs and from 32 to 256 P100 GPUs, and data transpose communications in semi-Lagrangian kernels in GYSELA and in convolution kernels in GKV were found to be main bottlenecks, respectively. In order to mitigate the communication costs, pipeline-based and task-based communication overlapping were implemented in these codes.

Journal Articles

Vector performance prediction of kernel loops on Earth Simulator

Yokokawa, Mitsuo; Saito, Minoru*; Hagiwara, Takashi*; Isobe, Yoko*; Jinguji, Satoshi*

Nihon Keisan Kogakkai Rombunshu, 4, p.31 - 36, 2002/00

Earth simulator is a distributed memory parallel system which consists of 640 processor nodes connected by a full crossbar network. Each processor node is a shared memory system which is composed of eight vector processors. The total peak performance and main memory capacity are 40Tflops and 10TB, respectively. A performance prediction system GS$$^3$$ for the Earth Simulator has been developed to estimate sustained performance of programs. To validate accuracy of vector performance prediction by the GS$$^3$$, the processing times for three groups of kernel loops estimated by the GS$$^3$$ are compared with the ones measured on SX-4. It is found that the absolute relative errors of the processing time are 0.89%,1.42% and 6.81% in average for three groups. The sustained performance of three groups on a processor of the Earth Simulator have been estimated by the GS$$^3$$ and those performance are 5.94Gflops,3.76Gflops and 2.17Gflops in average.

Journal Articles

A New memory allocation method for shared memory multiprocessors with large virtual address space

Koide, Hiroshi; Suzuki, Mitsugu*; Nakayama, Yasuichi*

Concurrency; Practice and Experience, 9(9), p.897 - 914, 1997/09

 Times Cited Count:0 Percentile:0.02(Computer Science, Software Engineering)

no abstracts in English

Journal Articles

The Activities of center for promotion of computational science and engineering

Proc. of Joint Int. Conf. on Mathematical Methods and Supercomputing for Nuclear Applications, 1, p.3 - 16, 1997/00

no abstracts in English

Journal Articles

Development of Monte Carlo machine for particle transport problem

Higuchi, Kenji; ;

Journal of Nuclear Science and Technology, 32(10), p.953 - 964, 1995/10

 Times Cited Count:1 Percentile:17.89(Nuclear Science & Technology)

no abstracts in English

Journal Articles

Development of JAERI Monte Carlo machine and its effective performance

Higuchi, Kenji; ; ; ; Tokuda, Shinji; *; *; *

Comput. Assist. Mech. Eng. Sci., 1, p.191 - 204, 1994/00

no abstracts in English

JAEA Reports

Motion-picture graphic display processor for ROSA-IV/LSTF experimental data

Anoda, Yoshinari; *; *; *; *; *

JAERI-M 91-151, 51 Pages, 1991/09

JAERI-M-91-151.pdf:1.42MB

no abstracts in English

Journal Articles

Parallelization of MCACE, A Monte Carlo for shielding analysis code

*; Minami, Kazuyoshi*; ; Masukawa, Fumihiro; Naito, Yoshitaka

Joho Shori Gakkai Kenkyu Hokoku, 91(61), p.25 - 32, 1991/07

no abstracts in English

Journal Articles

Queuing model analysis of the Fujitsu VP2000 with dual scalar architecture

Int. J. Supercomputer Appl., 5(3), p.46 - 62, 1991/00

 Times Cited Count:1 Percentile:51.37(Computer Science, Hardware & Architecture)

no abstracts in English

JAEA Reports

Performance evaluation of a vectorized KENO IV code

*; Higuchi, Kenji;

JAERI-M 90-135, 54 Pages, 1990/08

JAERI-M-90-135.pdf:1.18MB

no abstracts in English

Journal Articles

A new method of liquid crystal thermometry excluding human color sensation; (Narrow band optical filter method)

*; *; Akino, Norio

Nihon Kikai Gakkai Rombunshu, B, 53(485), p.241 - 249, 1987/00

no abstracts in English

JAEA Reports

Structured Programming Supporting Tool EOS77

; Tsunematsu, Toshihide;

JAERI-M 86-159, 32 Pages, 1986/11

JAERI-M-86-159.pdf:0.81MB

no abstracts in English

Journal Articles

The current techniques for utilization of large-scale computers at JAERI

Asai, Kiyoshi

Genshiryoku Kogyo, 31(10), p.15 - 18, 1985/00

no abstracts in English

Journal Articles

Super-computer and nuclear calculations

; ; *

Nihon Genshiryoku Gakkai-Shi, 25(3), p.164 - 171, 1983/00

 Times Cited Count:1 Percentile:22.74(Nuclear Science & Technology)

no abstracts in English

JAEA Reports

A Design of a Computer Complex Including Vector Processors

JAERI-M 82-200, 111 Pages, 1982/12

JAERI-M-82-200.pdf:3.01MB

no abstracts in English

JAEA Reports

Perprocessor System ``EOS'' for a Variable-Array-Size Program

; Tsunematsu, Toshihide;

JAERI-M 82-097, 64 Pages, 1982/08

JAERI-M-82-097.pdf:1.55MB

no abstracts in English

JAEA Reports

SCAN: A Fortran Syntax Analyzer-Its Structure and Facilities

;

JAERI-M 9719, 71 Pages, 1981/10

JAERI-M-9719.pdf:1.51MB

no abstracts in English

Journal Articles

Topics on '79 Nuclear Science Symposium

Kumahara, Tadashi

Denshi Kogyo Geppo, 22(3), p.44 - 49, 1980/00

no abstracts in English

Journal Articles

Current status of automated radiation monitoring systems

Hoken Butsuri, 15(4), p.269 - 276, 1980/00

no abstracts in English

Journal Articles

Running time delays in processor-sharing system

J.Inf.Process., 3(1), p.38 - 44, 1980/00

no abstracts in English

JAEA Reports

「Micro-8」 micro-Computer System

; ; ; ;

JAERI-M 7786, 79 Pages, 1978/08

JAERI-M-7786.pdf:2.11MB

no abstracts in English

21 (Records 1-20 displayed on this page)