A Proposed API for Batched Basic Linear Algebra Subprograms

user-5bd69975530c70d56f390249(2016)

引用 29|浏览7
暂无评分
摘要
This paper proposes an API for Batched Basic Linear Algebra Subprograms (Batched BLAS). We focus on many independent BLAS operations on small matrices that are grouped together as a single routine, called Batched BLAS routine, with the aim of providing more efficient, but portable, implementations of algorithms on high-performance manycore architectures (like multi/manycore CPU processors, GPUs, and coprocessors).
更多
查看译文
关键词
Basic Linear Algebra Subprograms,Coprocessor,Parallel computing,Computer science,Matrix (mathematics),Implementation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要