CoCoPeLia: Communication-Computation Overlap Prediction for Efficient Linear Algebra on GPUs
2021 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)(2021)
摘要
Graphics Processing Units (GPUs) are well established in HPC systems and frequently used to accelerate linear algebra routines. Since data transfers pose a severe bottleneck for GPU offloading, modern GPUs provide the ability to overlap communication with computation by splitting the problem to fine-grained sub-kernels that are executed in a pipelined manner. This optimization is currently underut...
更多查看译文
关键词
Concurrent computing,Runtime,Graphics processing units,Linear algebra,Predictive models,Libraries,Data models
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要