Architectural exploration of a fine-grained 3D cache for high performance in a manycore context

Very Large Scale Integration(2013)

引用 7|浏览3
暂无评分
摘要
New fine-grained 3D cache architectures have been recently proposed to embed more memory on-chip and thus reduce off-chip memory accesses. These 3D architectures provide a high access bandwidth thanks to wide vertical links. In this paper, we analyze the performances of such caches in a manycore context. We first propose to improve the microarchitecture of an existing 3D non uniform cache architecture. Then we evaluate the impact of the granularity (number of tiles) of this 3D cache on an existing multicore architecture executing high performance computing workloads. We show that the granularity of the 3D cache can affect the performances by a factor of 300%. We also evaluate the impact of the vertical links granularity (number of vertical 3D NoC links) on performances and show that a high number of these links is necessary to achieve the best performanes. Finally, we compare this fine-grained architecture to a memory using a Wide IO interface and show that the latter is less efficient in a manycore context.
更多
查看译文
关键词
cache storage,multiprocessing systems,network-on-chip,parallel architectures,performance evaluation,3D cache granularity,3D nonuniform cache architecture,access bandwidth,architectural exploration,cache performances,fine-grained 3D cache architectures,high performance computing workloads,manycore context,memory on-chip,microarchitecture,multicore architecture,off-chip memory accesses,vertical 3D NoC links,vertical links granularity,wide IO interface,wide vertical links
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要