A 28nm 16kb Aggregation and Combination Computing-in-Memory Macro with Dual-level Sparsity Modulation and Sparse-Tracking ADCs for GCNs.

Zhaoyang Zhang, Zhichao Liu, Feiran Liu, Yinhai Gao, Yuchen Ma, Yutong Zhang,An Guo,Tianzhu Xiong, Jinwu Chen,Xi Chen,Bo Wang, Yuchen Tang, Xingyu Pu, Xing Wang,Jun Yang,Xin Si

IEEE Custom Integrated Circuits Conference(2024)

Cited 0|Views8
No score
Abstract
Graph convolutional neural networks (GCNs) are neural networks for graph structures with large vertex features [1]. GCNs have two phases: aggregation and combination. During the combination phase, matrix multiplications are performed using trained weights and aggregated features. The aggregation phase, however, requires traversing the graph to gather features from neighboring vertices. However, performing aggregation on graphs with high irregularity and sparsity presents challenges in terms of memory bandwidth utilization. SRAM computing-in-memory (CIM) can address this issue by reducing data movement and increasing parallelism. Nevertheless, applying GCNs to CIM faces several challenges: (1) The aggregation operation is highly irregular due to the varying number of neighbors for each vertex in the graph, which leads to a huge amount of PSUM overhead. Additionally, it is challenging to obtain the degree matrix data required for refined aggregation in CIM. (2) Aggregation and combination are two distinct operations with different operator types, data sparsity, and operation scales. This makes it challenging to perform these operations within a standard memory architecture. (3) The efficiency of the operations is affected by input sparsity in aggregation and bit sparsity in both aggregation and combination.
More
Translated text
Key words
Graph Convolutional Network,Convolutional Neural Network,Energy Efficiency,Matrix Multiplication,Access Time,Vertices,Capacitive Coupling,Combination Of Operators,Aggregation Operators,Maximum Energy Efficiency,Compact Array,Distinct Operations
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined