A Performance Comparison of Open-Source Stream Processing Platforms.

IEEE Global Communications Conference(2016)

引用 110|浏览99
暂无评分
摘要
Distributed stream processing platforms are a new class of real-time monitoring systems that analyze and extract knowledge from large continuous streams of data. These type of systems are crucial for providing high throughput and low latency required by Big Data or Internet of Things monitoring applications. This paper describes and analyzes three main open-source distributed stream-processing platforms: Storm, Flink, and Spark Streaming. We analyze the system architectures and we compare their main features. We carry out two experiments concerning threats detection on network traffic to evaluate the throughput efficiency and the resilience to node failures. Results show that the performance of native stream processing systems, Storm and Flink, is up to 15 times higher than the micro-batch processing system, Spark Streaming. However, Spark Streaming is robust to node failures and provides recovery without losses.
更多
查看译文
关键词
open-source stream processing platform,distributed stream processing platform,data streaming,big data monitoring application,Internet of Things monitoring application,open source distributed stream-processing platform,storm streaming,flink streaming,spark streaming,network traffic detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要