Automatic Optimization of Matrix Implementations for Distributed Machine Learning and Linear Algebra

Shangyu Luo,Dimitrije Jankov,Binhang Yuan,Chris Jermaine

International Conference on Management of Data（2021）

引用 11|浏览43

暂无评分

摘要

ABSTRACTMachine learning (ML) computations are often expressed using vectors, matrices, or higher-dimensional tensors. Such data structures can have many different implementations, especially in a distributed environment: a matrix could be stored as row or column vectors, tiles of different sizes, or relationally, as a set of (rowIndex, colIndex, value) triples. Many other storage formats are possible. The choice of format can have a profound impact on the performance of a ML computation. In this paper, we propose a framework for automatic optimization of the physical implementation of a complex ML or linear algebra (LA) computation in a distributed environment, develop algorithms for solving this problem, and show, through a prototype on top of a distributed relational database system, that our ideas can radically speed up common ML and LA computations.

查看译文

关键词

Distributed systems and machine learning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要