On exploring data lakes by finding compact, isolated clusters
Information Sciences(2022)
摘要
•Data lakes store unprocessed business data at large scale.•Clustering helps data engineers understand the structure of their data lakes.•RóMULO is a meta-heuristic multi-way clustering proposal to cluster data lakes.•The results confirm that RóMULO is a promising contribution to assist data engineers.
更多查看译文
关键词
Data lakes,Clustering,Meta-heuristics,Genetic algorithms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要