Accelerating Sparse Matrix-Matrix Multiplication With 3d-Stacked Logic-In-Memory Hardware

2013 IEEE CONFERENCE ON HIGH PERFORMANCE EXTREME COMPUTING (HPEC)(2013)

引用 69|浏览52
暂无评分
摘要
This paper introduces a 3D-stacked logic-in-memory (LiM) system to accelerate the processing of sparse matrix data that is held in a 3D DRAM system. We build a customized content addressable memory (CAM) hardware structure to exploit the inherent sparse data patterns and model the LiM based hardware accelerator layers that are stacked in between DRAM dies for the efficient sparse matrix operations. Through silicon vias (TSVs) are used to provide the required high inter-layer bandwidth. Furthermore, we adapt the algorithm and data structure to fully leverage the underlying hardware capabilities, and develop the necessary design framework to facilitate the design space evaluation and LiM hardware synthesis. Our simulation demonstrates more than two orders of magnitude of performance and energy efficiency improvements compared with the traditional multi-threaded software implementation on modern processors.
更多
查看译文
关键词
3D-Stacked DRAM, TSV, Logic-in-Memory, CAM, Sparse Matrix Matrix Multiplication
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要