Trained Rank Pruning for Efficient Deep Neural Networks

neural information processing systems(2019)

引用 5|浏览0
暂无评分
摘要
To accelerate DNNs inference, low-rank approximation has been widely adopted because of its solid theoretical rationale and efficient implementations. Several previous works attempted to directly approximate a pre-trained model by low-rank decomposition; however, small approximation errors in parameters can ripple over a large prediction loss. Apparently, it is not optimal to separate low-rank app...
更多
查看译文
关键词
low-rank,decomposition,acceleration,pruning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要