Accelerating Dependent Cache Misses With An Enhanced Memory Controller

ACM SIGARCH Computer Architecture News(2016)

引用 127|浏览211
暂无评分
摘要
On-chip contention increases memory access latency for multi-core processors. We identify that this additional latency has a substantial effect on performance for an important class of latency-critical memory operations: those that result in a cache miss and are dependent on data from a prior cache miss. We observe that the number of instructions between the first cache miss and its dependent cache miss is usually small. To minimize dependent cache miss latency, we propose adding just enough functionality to dynamically identify these instructions at the core and migrate them to the memory controller for execution as soon as source data arrives from DRAM. This migration allows memory requests issued by our new Enhanced Memory Controller (EMC) to experience a 20% lower latency than if issued by the core. On a set of memory intensive quad-core workloads, the EMC results in a 13% improvement in system performance and a 5% reduction in energy consumption over a system with a Global History Buffer prefetcher, the highest performing prefetcher in our evaluation.
更多
查看译文
关键词
dependent cache misses,on-chip contention,memory access latency,multicore processors,latency-critical memory operations,cache miss latency,enhanced memory controller,DRAM,memory requests,EMC,memory intensive quad-core workloads,system performance,energy consumption,global history buffer prefetcher
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要