Chunk Content is not Enough: Chunk-Context Aware Resemblance Detection for Deduplication Delta Compression

DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC)(2022)

引用 2|浏览28
暂无评分
摘要
In this paper, we propose a novel chunk-context-aware resemblance detection al-gorithm called CARD. By introducing machine learning into deduplication, the chunk feature will embed the chunk-context information after the N-sub-chunk shingles based initial feature extraction and BP-Neural network training. In the predicting process, each chunk's initial feature corresponds to a chunk-context feature. Finally, the cloud calculates the different part among resemblance chunks based on these feature by delta encoding. Only the different part is stored. The basic workflow corresponds to Figure 1. For more detailed illustrations, please see our full paper here
更多
查看译文
关键词
Cloud Storage,Resemblance Detection,Chunk Context,BP Neural network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要