Tuning small analytics on Big Data: Data partitioning and secondary indexes in the Hadoop ecosystem

Oscar Romero,Victor Herrero,Alberto Abelló,Jaume Ferrarons

Information Systems（2015）

引用 21|浏览47

暂无评分

摘要

In the recent years the problems of using generic storage (i.e., relational) techniques for very specific applications have been detected and outlined and, as a consequence, some alternatives to Relational DBMSs (e.g., HBase) have bloomed. Most of these alternatives sit on the cloud and benefit from cloud computing, which is nowadays a reality that helps us to save money by eliminating the hardware as well as software fixed costs and just pay per use. On top of this, specific querying frameworks to exploit the brute force in the cloud (e.g., MapReduce) have also been devised. The question arising next tries to clear out if this (rather naive) exploitation of the cloud is an alternative to tuning DBMSs or it still makes sense to consider other options when retrieving data from these settings.

查看译文

关键词

Big Data,OLAP,Multidimensional model,Indexes,Partitioning,Cost estimation

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要