A 28nm 77.35TOPS/W Similar Vectors Traceable Transformer Processor with Principal-Component-Prior Speculating and Dynamic Bit-wise Stationary Computing.

VLSI Technology and Circuits(2023)

引用 0|浏览21
暂无评分
摘要
This paper proposes an energy-efficient Transformer processor exploiting dynamic similarity in global attention computing. It has three features: 1) A principal-component-prior speculation unit (PCSU) removes 28.4% of redundant computations. 2) A similar-vector tracked computing engine (STCE) saves 42.2% of multiplications. 3) A bit-wise stationary processing element (BSPE) reduces multiplication energy by $1.47\times$. The proposed processor achieves a peak energy efficiency of 77.35TOPS/W. It reduces energy by $2.81\times$ and offers $3.71\times$ speedup compared with the state-of-the-art Transformer processor.
更多
查看译文
关键词
bit-wise stationary processing element,dynamic bit-wise stationary computing,dynamic similarity,energy-efficient Transformer processor,global attention computing,multiplication energy,peak energy efficiency,principal-component-prior speculation unit,redundant computations,similar-vector tracked computing engine,size 28.0 nm,state-of-the-art Transformer processor
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要