Programming with BIG Data in R: Scaling Analytics from One to Thousands of Nodes.

Big Data Research(2017)

引用 17|浏览13
暂无评分
摘要
We present a tutorial overview showing how one can achieve scalable performance with R. We do so by utilizing several package extensions, including those from the pbdR project. These packages consist of high performance, high-level interfaces to and extensions of MPI, PBLAS, ScaLAPACK, I/O libraries, profiling libraries, and more. While these libraries shine brightest on large distributed platforms, they also work rather well on small clusters and often, surprisingly, even on a laptop with only two cores.
更多
查看译文
关键词
Scalable statistical computing,Principal components analysis,Distributed computing,SPMD
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要