A 400MHz NPU with 7.8TOPS 2 /W High-PerformanceGuaranteed Efficiency in 55nm for Multi-Mode Pruning and Diverse Quantization Using Pattern-Kernel Encoding and Reconfigurable MAC Units

2021 IEEE Custom Integrated Circuits Conference (CICC)(2021)

引用 2|浏览14
暂无评分
摘要
Deep neural networks present a promising future in applications, ranging from face ID on mobile phones to self-driving cars. Weight pruning and quantization act as valuable solutions to release the burden of computation and memory. Figure 1 shows the family of weight pruning, including the fine-grained and several structural pruning methods. With similar compression rates, coarse-grained pruning r...
更多
查看译文
关键词
Quantization (signal),Convolution,Neural networks,Mobile handsets,Hardware,Encoding,Distance measurement
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要