Compression-Aware Algorithms for Massive Datasets

DCC '15 Proceedings of the 2015 Data Compression Conference(2015)

引用 3|浏览25
暂无评分
摘要
While massive datasets are often stored in compressed format, most algorithms are designed to operate on uncompressed data. We address this growing disconnect by developing a framework for compression-aware algorithms that operate directly on compressed datasets. Synergistically, we also propose new algorithmically-aware compression schemes that enable algorithms to efficiently process the compressed data. In particular, we apply this general methodology to geometric / CAD datasets that are ubiquitous in areas such as graphics, VLSI, and geographic information systems. We develop example algorithms and corresponding compression schemes that address different types of datasets, including point sets and graphs. Our methods are more efficient than their classical counterparts, and they extend to both lossless and lossy compression scenarios. This motivates further investigation of how this approach can enable algorithms to process ever-increasing big data volumes.
更多
查看译文
关键词
Big Data,data compression,Big Data volumes,CAD dataset,algorithmically-aware compression scheme,compression-aware algorithm,computer-aided dataset,geometric dataset,lossless compression,lossy compression,massive dataset,algorithmically-aware compressions,compression-aware algorithms,geometric algorithms,graph algorithms,graph compression,pointset compression
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要