An Implementation of Rational Wavelets and Filter Design for Phonetic Classification

Audio, Speech, and Language Processing, IEEE Transactions(2007)

引用 34|浏览3
暂无评分
摘要
Although wavelet analysis has been proposed for speech processing as an alternative to Fourier analysis, most approaches make use of off-the-shelf wavelets and dyadic tree-structured filter banks. In this paper, we extend previous wavelet-based frameworks in two ways. First, we increase the flexibility in wavelet selection by taking advantage of the relationship between wavelets and filter banks and by designing new wavelets using filter design methods. We adopt two filter design techniques that we refer to as filter matching and attenuation minimization. Second, we improve the flexibility in frequency partitioning by implementing rational as well as dyadic filter banks. Rational filter banks naturally incorporate the critical-band effect in the human auditory system. To test our extensions, we implement an energy-based measurement which we also compare in performance to the mel-frequency cepstral coefficients (MFCCs) in a phonetic classification task. We show that the designed wavelets outperform off-the-shelf wavelets as well as an MFCC baseline
更多
查看译文
关键词
new wavelet,wavelet selection,dyadic tree-structured filter bank,filter design method,phonetic classification,off-the-shelf wavelet,wavelet analysis,rational filter bank,rational wavelets,dyadic filter bank,fourier analysis,filter design,filter design technique,frequency,attenuation,speech processing,mel frequency cepstral coefficients,wavelet transforms,design methodology,speech recognition,tree structure,matched filters,mel frequency cepstral coefficient,filter bank
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要