Elastic Allocator: An Adaptive Task Scheduler for Streaming Query in the Cloud

SoSE(2014)

引用 9|浏览19
暂无评分
摘要
Many big data applications receive and process data in real time. These data, also known as data streams, are generated continuously and processed online in a low latency manner. Data stream is prone to change dramatically in volume, since its workload may have a variation of several orders between peak and valley periods. Fully provisioning resources for stream processing to handle the peak load is costly, while over-provisioning is wasteful when to deal with lightweight workload. Cloud computing emphasizes that resource should be utilized economically and elastically. An open question is how to allocate query task adaptively to keeping up the input rate of the data stream. Previous work focuses on using either local or global capacity information to improve the cluster CPU resource utilization, while the bandwidth utilization which is also critical to the system throughput is ignored or simplified. In this paper, we formalize the operator placement problem considering both the CPU and bandwidth usage, and introduce the Elastic Allocator. The Elastic Allocator uses a quantitative method to evaluate a node's capacity and bandwidth usage, and exploit both the local and global resource information to allocate the query task in a graceful manner to achieve high resource utilization. The experimental results and a simple prototype built on top of Storm finally demonstrate that Elastic Allocator is adaptive and feasible in cloud computing environment, and has an advantage of improving and balancing system resource utilization.
更多
查看译文
关键词
task allocation,resource utilization,big data,node capacity,query task allocation,global resource information,operator placement problem,data streams,query streaming,resource allocation,elastic,stream query,big data applications,quantitative method,cpu,bandwidth usage,data stream, stream query, elastic, task allocation,cloud computing,adaptive task scheduler,local resource information,elastic allocator,data stream,query processing,resource management,linear programming,computer architecture,clustering algorithms,bandwidth,storms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要