Fast Sparse Deep Neural Network Inference with Flexible SpMM Optimization Space Exploration

2021 IEEE High Performance Extreme Computing Conference (HPEC)(2021)

Cited 10|Views10
No score
Abstract
Deep neural networks (DNN) have been widely used in many fields. With the ever-increasing model size, the DNN scalability suffers. Sparse deep neural networks (SpDNN) are promising to resolve this problem, but the sparse data makes it difficult to execute efficiently on GPUs due to load imbalance and irregular memory accesses. The recent MIT/IEEE/Amazon GraphChallenge has shown several big advance...
More
Translated text
Key words
Deep learning,Scalability,Memory management,Graphics processing units,Throughput,Inference algorithms,Space exploration
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined