Distributed Discovery of Functional Dependencies

2019 IEEE 35th International Conference on Data Engineering (ICDE)(2019)

引用 20|浏览44
暂无评分
摘要
We address the problem of discovering functional dependencies from distributed big data. Existing (non-distributed) algorithms such as FastFDs focus on minimizing computation. However, distributed algorithms must also optimize data communication costs, especially in shared-nothing settings. We propose a distributed version of FastFDs that is communication-efficient and we experimentally show significant performance improvements over a straightforward distributed implementation.
更多
查看译文
关键词
Sparks,Data communication,Runtime,Random access memory,Partitioning algorithms,Big Data,Distributed algorithms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要