Chrome Extension
WeChat Mini Program
Use on ChatGLM

A Distributed Multi-GPU System for Fast Graph Processing.

International Conference on Very Large Data Bases(2018)

Cited 40|Views8
No score
Abstract
We present Lux, a distributed multi-GPU system that achieves fast graph processing by exploiting the aggregate memory bandwidth of multiple GPUs and taking advantage of locality in the memory hierarchy of multi-GPU clusters. Lux provides two execution models that optimize algorithmic efficiency and enable important GPU optimizations, respectively. Lux also uses a novel dynamic load balancing strategy that is cheap and achieves good load balance across GPUs. In addition, we present a performance model that quantitatively predicts the execution times and automatically selects the runtime configurations for Lux applications. Experiments show that Lux achieves up to 20x speedup over state-of-the-art shared memory systems and up to two orders of magnitude speedup over distributed systems.
More
Translated text
Key words
graph,processing,multi-gpu
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined