Accelerating Matrix Multiplication in Deep Learning by Using Low-Rank Approximation

Kazuki Osawa,Akira Sekiya,Hiroki Naganuma,Rio Yokota

2017 International Conference on High Performance Computing & Simulation (HPCS)（2017）

引用 9|浏览21

暂无评分

摘要

The open source frameworks of deep learning including TensorFlow, Caffe, Torch, etc. are widely used all over the world and its acceleration have great meaning. In these frameworks, a lot of computation time is spent on convolution, and highly tuned libraries such as cuDNN play important role on accelerating convolution. In these libraries, however, a convolution computation is performed without approximating a dense matrices. In this research, we propose a method to introduce the low-rank approximation method, widely used in the field of scientific and technical computation, into the convolution computation. As a result of investigating the influence on the recognition accuracy of the existing model, it is possible to reduce up to about 90% of rank of data matrices while keeping recognition accuracy -2% of baseline.

查看译文

关键词

low-rank approximation,deep learning,image recognition

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要