A Proposed API for Batched Basic Linear Algebra Subprograms
user-5bd69975530c70d56f390249(2016)
摘要
This paper proposes an API for Batched Basic Linear Algebra Subprograms (Batched BLAS).
We focus on many independent BLAS operations on small matrices that are grouped together as a single routine, called Batched BLAS routine, with the aim of providing more efficient, but portable, implementations of algorithms on high-performance manycore architectures
(like multi/manycore CPU processors, GPUs, and coprocessors).
更多查看译文
关键词
Basic Linear Algebra Subprograms,Coprocessor,Parallel computing,Computer science,Matrix (mathematics),Implementation
AI 理解论文
溯源树
样例
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要