Ultrafast CPU/GPU Kernels for Density Accumulation in Placement

2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC)(2021)

引用 7|浏览19
暂无评分
摘要
Density accumulation is a widely-used primitive operation in physical design, especially for placement. Iterative invocation in the optimization flow makes it one of the runtime bottlenecks. Accelerating density accumulation is challenging due to data dependency and workload imbalance. In this paper, we propose efficient CPU/GPU kernels for density accumulation by decomposing the problem into two phases: constant-time density collection for each instance and a linear-time prefix sum. We develop CPU and GPU dedicated implementations, and demonstrate promising efficiency benefits on tasks from large-scale placement problems.
更多
查看译文
关键词
density accumulation,constant-time density collection,ultrafast CPU/GPU kernels,iterative invocation,optimization flow,runtime bottlenecks,data dependency,workload imbalance,linear-time prefix sum,large-scale placement problems
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要