A compiler approach to managing storage and memory bandwidth in configurable architectures

ACM Trans. Design Autom. Electr. Syst.(2008)

引用 26|浏览17
暂无评分
摘要
Configurable architectures offer the unique opportunity of realizing hardware designs tailored to the specific data and computational patterns of an application code. Customizing the storage structures is becoming increasingly important in mitigating the continuing gap between memory latencies and internal computing speeds. In this article we describe and evaluate a compiler algorithm that maps the arrays of a loop-based computation to internal storage structures, either RAM blocks or discrete registers. Our objective is to minimize the overall execution time while considering the capacity and bandwidth constraints of the storage resources. The novelty of our approach lies in creating a single framework that combines high-level compiler techniques with lower-level scheduling information for mapping the data. We illustrate the benefits of our approach for a set of image/signal processing kernels using a Xilinx Virtex™ Field-Programmable Gate Array (FPGA). Our algorithm leads to faster designs compared to the state-of-the-art custom data layout mapping technique, in some instances using less storage. When compared to hand-coded designs, our results are comparable in terms of execution time and resources, but are derived in a minute fraction of the design time.
更多
查看译文
关键词
storage allocation and management,specific data,high-level hardware synthesis,execution time,configurable architectures,storage resource,design time,compiler approach,compiler analysis,configurable architecture,state-of-the-art custom data layout,memory bandwidth,overall execution time,high-level compiler technique,storage structure,compiler algorithm,internal storage structure,signal processing,field programmable gate array,memory latency
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要