谷歌Chrome浏览器插件
订阅小程序
在清言上使用

Scalable Run-time Data Indexing and Querying for Scientific Simulations

Big Data Analytics: Challenges and Opportunities (BDAC-14) Workshop at Supercomputing Conference(2014)

引用 1|浏览6
暂无评分
摘要
Scientific simulations running at scale on highend computing systems are generating tremendous amounts of raw data, which has to be carefully analyzed before scientists can derive insights from simulations and better understand the phenomena being modeled. Query-driven data analysis is an important technique used by scientists to gain insights from data, especially to capture intermittent transient information at simulation run-time. However, the increasing data volumes and associated I/O costs (latency and energy) are quickly making it infeasible to perform post-simulation query-driven data analysis in a cost-effective and timely manner. To address this challenge, in this paper, we present a scalable data indexing and querying framework using a data staging approach to support run-time query-driven data analysis for large scale scientific simulations. We achieve scalability, low overheads and minimal impact on the simulations by offloading computationally expensive indexing and querying to data staging servers. We also present an experimental evaluation, which demonstrates the performance and scalability of the framework.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要