谷歌Chrome浏览器插件
订阅小程序
在清言上使用

VideoTime3: A 40-uJ/frame 38 FPS Video Understanding Accelerator With Real-Time DiffFrame Temporal Redundancy Reduction and Temporal Modeling

IEEE Solid-State Circuits Letters(2023)

引用 0|浏览88
暂无评分
摘要
VideoTime(3) is an accelerator for state-of-the-art video understanding with deep learning on the edge. Different from prior work, it highlights a real-time DiffFrame convolution achieving $2.2\times $ DRAM access reduction compared to conventional convolution, a sorter-free architecture for efficient sparse output stationary dataflow, temporal modeling capability achieving high accuracy on video understanding applications, and optimized data buffering to remove DRAM traffic overhead for temporal modeling and reduce 55%-79% input activation DRAM traffic in depth-wise layers. The chip consumes 40 uJ/frame with 38 frames/s at 0.6 V in 28-nm CMOS.
更多
查看译文
关键词
Random access memory,Convolution,Generators,Real-time systems,Streaming media,Solid state circuits,Redundancy,Activation sparsity handling,deep learning (DL),Index Terms,hardware accelerator,video understanding
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要