Speech Emotion Recognition using non-linear Teager energy based features in noisy environments

Signal Processing Conference(2012)

引用 30|浏览16
暂无评分
摘要
In this study, Teager-energy based Mel-frequency cepstral coefficients (TEMFCCs) are proposed for Automatic Speech Emotion Recognition (ASER) in noisy environments. TEMFCCs are obtained by taking the absolute value of the Teager-energy operator (TEO) of the short-time Fourier transform of the signal (STFT), warping it to a Mel-frequency scale, and taking the discrete cosine transform (DCT) of the log-Mel Teager-energy spectrum. Experiments on classification of discrete emotion categories show that TEMFCCs are more robust than MFCCs in noisy conditions, while TEMFCCs and MFCCs perform similarly for clean conditions.
更多
查看译文
关键词
Fourier transforms,cepstral analysis,discrete cosine transforms,emotion recognition,signal classification,speech recognition,ASER,DCT,STFT,TEMFCC,TEO,Teager energy based Mel frequency cepstral coefficient,Teager energy operator,automatic speech emotion recognition,discrete cosine transform,discrete emotion category classification,log-Mel Teager energy spectrum,noisy environment,nonlinear Teager energy,short time Fourier transform,emotion recognition,nonlinear acoustics,speech analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要