Seamless GPU Evaluation of Smart Expression Templates

2018 International Conference on High Performance Computing & Simulation (HPCS)(2018)

引用 0|浏览8
暂无评分
摘要
Expression Templates is a technique allowing to write linear algebra code in C++ the same way it would be written on paper. It is also used extensively as a performance optimization technique, especially as the Smart Expression Templates form which allows for even higher performance. It has proved to be very efficient for computation on a Central Processing Unit (CPU). However, due to its design, it is not easily implemented on a Graphics Processing Unit (GPU). In this paper, we devise a set of techniques to allow the seamless evaluation of Smart Expression Templates on the GPU. The execution is transparent for the user of the library which still uses the matrices and vector as if it was on the CPU and profits from the performance and higher multi-processing capabilities of the GPU. We also show that the GPU version is significantly faster than the CPU version, without any change to the code of the user.
更多
查看译文
关键词
CPU,seamless evaluation,GPU version,linear algebra code,performance optimization technique,central processing unit,graphics processing unit,GPU evaluation,multiprocessing capabilities,smart expression templates form
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要