Efficient Breadth First Search on Multi-GPU Systems Using GPU-Centric OpenSHMEM.

Lecture Notes in Computer Science(2018)

Cited 1|Views8
No score
Abstract
NVSHMEM is an implementation of OpenSHMEM for NVIDIA GPUs which allows communication to be issued from inside CUDA kernels. In this work, we present an implementation of Breadth First Search for multi-GPU systems using NVSHMEM. We analyze the benefits and bottlenecks of moving fine-grained communication into CUDA kernels. Using our implementation of BFS, we achieve up to 75% improvement in performance compared to a CUDA-aware MPI-based implementation, in the best case.
More
Translated text
Key words
efficient breadth first search,multi-gpu,gpu-centric
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined