Cocktail: A hybrid system combining Hadoop and Storm

2015 IEEE Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)(2015)

引用 2|浏览5
暂无评分
摘要
Hadoop and Storm are playing a significant role in Cloud Computing and either of them has its own applicable area. Cocktail is a new hybrid system that combines Hadoop and Storm into one single system, leveraging the functions of two computing frameworks. The design and implementation of Cocktail includes a SQL-like query language making the implementation of details transparent for users, an intelligent framework selector based on cost model to choose appropriate framework automatically, and an efficient resource scheduling and task execution framework. Cocktail has a wide range of application scenarios from batch processing to stream computing, using Storm to process real-time data and Hadoop to process large-scale data. We compare the performance, throughput and scalability of Cocktail with SummingBird to demonstrate the practicability and capability. According to benchmark, for small-scale data, the performance of Cocktail is close to Summingbird based on Storm and 20%~40% faster than Summingbird based on Hadoop. And for large-scale data, Cocktail's throughput is 40% higher than Summingbird's throughout based on Storm.
更多
查看译文
关键词
Hadoop,Storm,Hybrid System
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要