Fast Sparse Deep Neural Network Inference with Flexible SpMM Optimization Space Exploration
2021 IEEE High Performance Extreme Computing Conference (HPEC)(2021)
Abstract
Deep neural networks (DNN) have been widely used in many fields. With the ever-increasing model size, the DNN scalability suffers. Sparse deep neural networks (SpDNN) are promising to resolve this problem, but the sparse data makes it difficult to execute efficiently on GPUs due to load imbalance and irregular memory accesses. The recent MIT/IEEE/Amazon GraphChallenge has shown several big advance...
MoreTranslated text
Key words
Deep learning,Scalability,Memory management,Graphics processing units,Throughput,Inference algorithms,Space exploration
AI Read Science
Must-Reading Tree
Example
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined