Improved Compression-Based Pattern Recognition Exploiting New Useful Features

PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017)(2017)

引用 1|浏览9
暂无评分
摘要
Compression-based pattern recognition measures the similarity between objects with relying on data compression techniques. This paper improves the current compression-based pattern recognition by exploiting new useful features which are easy to obtain. In particular, we study the two known methods called PRDC (Pattern Representation on Data Compression) and NMD (Normalized Compression Distance). PRDC represents an object x with a feature vector that lines up the compression ratios derived by compressing x with multiple dictionaries. We smartly enhance PRDC by extracting new novel features from the compressed files. NMD measures the similarity between two objects by comparing their compression dictionaries. We extend NMD by incorporating the length of words in the dictionaries into the similarity measure.
更多
查看译文
关键词
Compression Ratio, Word Frequency, Word Length, Word Order, Reference Object
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要