RePIM: Joint Exploitation of Activation and Weight Repetitions for In-ReRAM DNN Acceleration

2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC)(2021)

引用 11|浏览16
暂无评分
摘要
Eliminating redundant computations is a common approach to improve the performance of ReRAM-based DNN accelerators. While existing practical ReRAM-based accelerators eliminate part of the redundant computations by exploiting sparsity in inputs and weights or utilizing weight patterns of DNN models, they fail to identify all the redundancy, resulting in many unnecessary computations. Thus, we propose a practical design, RePIM, that is the first to jointly exploit the repetition of both inputs and weights. Our evaluation shows that RePIM is effective in eliminating unnecessary computations, achieving an average of 15.24x speedup and 96.07% energy savings over the state-of-the-art practical ReRAM-based accelerator.
更多
查看译文
关键词
repetition,redundancy,DNN models,weight patterns,practical ReRAM-based accelerators,ReRAM-based DNN accelerators,redundant computations,In-ReRAM DNN Acceleration,joint exploitation,RePIM
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要