Spectro-temporal Gabor features for speaker recognition

Acoustics, Speech and Signal Processing(2012)

引用 29|浏览11
暂无评分
摘要
In this work, we have investigated the performance of 2D Gabor features (known as spectro-temporal features) for speaker recognition. Gabor features have been used mainly for automatic speech recognition (ASR), where they have yielded improvements. We explored different Gabor feature implementations, along with different speaker recognition approaches, on ROSSI [1] and NIST SRE08 databases. Using the noisy ROSSI database, the Gabor features performed as well as the MFCC features standalone, and score-level combination of Gabor and MFCC features resulted in an 8% relative EER improvement over MFCC features standalone. These results demonstrated the value of both spectral and temporal information for feature extraction, and the complementarity of Gabor features to MFCC features.
更多
查看译文
关键词
Gabor filters,feature extraction,speech recognition,2D Gabor features,MFCC features standalone,NIST SRE08 database,automatic speech recognition,noisy ROSSI database,score-level combination,spectro-temporal Gabor features,Gabor features,ROSSI database,Speaker recognition,spectral and temporal modulation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要