Towards designing a hardware accelerator for 3D convolutional neural networks

Computers and Electrical Engineering(2023)

引用 3|浏览0
暂无评分
摘要
•The hardware design of 3D CNNs requires massive compute and memory resources.•A 3-dimensional design space exploration is performed to reduce the complexity of the model.•First, an efficient word-length is found that reduces the feature map and kernel sizes.•Next, input data tiling is performed to efficiently utilized the on-chip memory.•Then, different modes of parallelization are explored to minimize the off-chip accesses.•Finally, an FPGA-based complete hardware accelerator is proposed that can achieve a throughput of 1.29TOPs/s in inference stage.
更多
查看译文
关键词
3D CNN,Hardware architecture,Design space,FPGA implementation,I3D network,Input tiling,Memory access optimization,Time space mapping
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要