Performance Evaluation Of Parallel Sparse Tensor Decomposition Implementations

SC(2016)

引用 10|浏览23
暂无评分
摘要
Tensor decomposition, the higher-order analogue to singular value decomposition, has emerged as a useful tool for finding relationships in large, sparse, multidimensional data sets. As this technique matures and is applied to increasingly larger data sets, the need for high performance implementations becomes critical. In this work, we perform an objective empirical evaluation of three popular parallel implementations of the Candecomp/Parafac Alternating Least Squares (CP-ALS) tensor decomposition algorithm, namely SPLATT, DFacTo, and ENSIGN. We conduct performance studies across a variety of data sets, comparing the total memory required, the runtime, and the parallel scalability of each implementation. We find that the approach taken by SPLATT results in the fastest runtimes across the data sets, performing 5 - 22.64 times faster than the other tools. Additionally, SPLATT consumes 1.16 - 8.62 times less memory than the other tools. When tested on up to 20 cores or nodes, SPLATT using distributed memory parallelism exhibits the best strong scaling.
更多
查看译文
关键词
parallel sparse tensor decomposition implementations,performance evaluation,singular value decomposition,multidimensional data sets,objective empirical evaluation,Candecomp/Parafac alternating least squares,CP-ALS tensor decomposition algorithm,data sets,parallel scalability,distributed memory parallelism,SPLATT
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要