XcalableMP and XcalableACC for Productivity and Performance in HPC Challenge Award Competition Class 2 at SC 14

semanticscholar(2014)

Cited 0|Views0
No score
Abstract
We present XcalableMP [1–4] implementations of High-performance Linpack (HPL), Fast Fourier Transform (FFT), STREAM, and RandomAccess on the K computer [5]. Moreover, we also present XcalableACC [6, 7] implementations of HPL, FFT, STREAM, and the Himeno Benchmark [8] as an additional benchmark on HA-PACS/TCA [9], which is a GPU cluster. The highlights of this submission are as follows: • Table 1 shows the SLOC (source lines of code) of the implementations. • Table 2 shows experimental environments. • Table 3 and Table 4 show performance summaries.
More
Translated text
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined