A Scalable Cluster-based Hierarchical Hardware Accelerator for a Cortically Inspired Algorithm

Sumon Dey,Lee Baker,Joshua Schabel,Weifu Li,Paul D. Franzon

ACM Journal on Emerging Technologies in Computing Systems（2021）

引用 0|浏览5

暂无评分

摘要

AbstractThis article describes a scalable, configurable and cluster-based hierarchical hardware accelerator through custom hardware architecture for Sparsey, a cortical learning algorithm. Sparsey is inspired by the operation of the human cortex and uses a Sparse Distributed Representation to enable unsupervised learning and inference in the same algorithm. A distributed on-chip memory organization is designed and implemented in custom hardware to improve memory bandwidth and accelerate the memory read/write operations for synaptic weight matrices. Bit-level data are processed from distributed on-chip memory and custom multiply-accumulate hardware is implemented for binary and fixed-point multiply-accumulation operations. The fixed-point arithmetic and fixed-point storage are also adapted in this implementation. At 16 nm, the custom hardware of Sparsey achieved an overall 24.39× speedup, 353.12× energy efficiency per frame, and 1.43× reduction in silicon area against a state-of-the-art GPU.

查看译文

关键词

Neuromorphic computing, accelerator, cortical processor, hierarchical temporal memory, sparse distributed memory

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要