Vitruvius plus : An Area-Efficient RISC-V Decoupled Vector Coprocessor for High Performance Computing Applications

ACM Transactions on Architecture and Code Optimization(2023)

引用 8|浏览52
暂无评分
摘要
The maturity level of RISC-V and the availability of domain-specific instruction set extensions, like vector processing, make RISC-V a good candidate for supporting the integration of specialized hardware in processor cores for the High Performance Computing (HPC) application domain. In this article,(1) we present Vitruvius+, the vector processing acceleration engine that represents the core of vector instruction execution in the HPC challenge that comes within the EuroHPC initiative. It implements the RISC-V vector extension (RVV) 0.7.1 and can be easily connected to a scalar core using the Open Vector Interface standard. Vitruvius+ natively supports long vectors: 256 double precision floating-point elements in a single vector register. It is composed of a set of identical vector pipelines (lanes), each containing a slice of the Vector Register File and functional units (one integer, one floating point). The vector instruction execution scheme is hybrid in-order/out-oforder and is supported by register renaming and arithmetic/memory instruction decoupling. On a stand-alone synthesis, Vitruvius+ reaches a maximum frequency of 1.4 GHz in typical conditions (TT/0.80V/25 degrees C) using GLOBALFOUNDRIES 22FDX FD-SOI. The silicon implementation has a total area of 1.3 mm(2) and maximum estimated power of similar to 920 mW for one instance of Vitruvius+ equipped with eight vector lanes.
更多
查看译文
关键词
RISC-V,vector accelerator,SIMD,HPC
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要