Music identification via vocabulary tree with MFCC peaks.

MM '11: ACM Multimedia Conference Scottsdale Arizona USA November, 2011(2011)

引用 2|浏览26
暂无评分
摘要
In this paper, a Vocabulary Tree based framework is proposed for music identification whose target is to recognize a fragment from a song database. The key to a high recognition precision within this framework is a novel feature, namely MFCC Peaks, which is a combination of MFCC and Spectral Peaks features. Our approach consists of three stages. We first build the Vocabulary Tree with 2 million MFCC Peaks features extracted from hundreds of music. Then each song in the database is quantified into some words by traveling from root down to a certain leaf. Given a query input, we apply the same quantization procedure to this fragment, score the archive according to the TF-IDF scheme and return the best matches. The experimental results demonstrate that our proposed feature has strong identifying and generalization ability. Other trials show that our approach scales well with the size of database. Further comparison also demonstrates that while our algorithm achieves approximately the same retrieval precision as other state-of-the-art methods, it cost less time and memory.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要