Efficient Inference Of Image-Based Neural Network Models In Reconfigurable Systems With Pruning And Quantization.

José Flich,Laura Medina,Izan Catalán,Carles Hernández,Andrea Bragagnolo,Fabrice Auzanneau,David Briand

ICIP（2022）

引用 3|浏览8

暂无评分

摘要

Neural networks (NN) for image processing in embedded systems expose two conflicting requirements: increasing computing power needs as models become more complex and constrained resource budget. In order to alleviate this problems, model compression based on quantization and pruning techniques are common. Derived models then need to fit on reconfigurable systems such as FPGAs for the embedded system to work properly. In this paper, we present HLSinf, an open source framework for the development of custom NN accelerators for FPGAs which provides efficient support to quantized and pruned NN models. With HLSinf, significant inference speedups can be obtained for typical medical image-based applications. In particular, we obtain up to 90x speedup factor when we combine quantization/pruning with the flexibility of HLSinf compared to CPU.

查看译文

关键词

reconfigurable systems,efficient inference,neural network models,pruning,neural network,image-based

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要