What can real information content tell us about compressing climate model data?

2022 IEEE/ACM 8th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD)(2022)

引用 0|浏览10
暂无评分
摘要
The massive data volumes produced by climate simulation models create an urgent need for data reduction. Lossy compression in particular is a solution that can significantly reduce storage requirements, however, a tradeoff must be made between the amount of compression applied and the scientific integrity of the data. Determining how much compression can be applied is therefore vital for applying lossy compression. One particular metric for gauging the quality of compression is the percentage of real information present in the original data that is preserved in the compressed data. We compute bitwise real information content for several climate variables from the popular Community Earth System Model, and we investigate the amount of compression that can be applied to each of these climate variables using two popular compression algorithms designed for floating-point data while preserving 99% of the real information content. The analysis of the real information content of data after lossy compression has been applied shows a helpful visualization of how compression artifacts have been introduced to the data. Finally, we demonstrate how this real information content can be used in a straightforward manner to determine compressor settings for our data.
更多
查看译文
关键词
Real Information Content,Entropy,Bit Grooming,ZFP,Lossy Compression,Climate Model Data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要