Scispark: Applying In-Memory Distributed Computing To Weather Event Detection And Tracking

2015 IEEE International Conference on Big Data (Big Data)(2015)

引用 36|浏览43
暂无评分
摘要
In this paper we present SciSpark, a Big Data framework that extends Apache (TM) Spark for scaling scientific computations. The paper details the initial architecture and design of SciSpark. We demonstrate how SciSpark achieves parallel ingesting and partitioning of earth science satellite and model datasets. We also illustrate the usability and extensibility of SciSpark by implementing aspects of the Grab 'em Tag 'em Graph 'em (GTG) algorithm using SciSpark and its Map Reduce capabilities. GTG is a topical automated method for identifying and tracking Mesoscale Convective Complexes in satellite infrared datasets.
更多
查看译文
关键词
Apache Spark,in-memory distributed computing,large scientific datasets,mesoscale convective complexes
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要