Enhanced Multichannel Histogram Equalization for Speech Recognition in noisy acoustic conditions.

Frontiers in Artificial Intelligence and Applications(2011)

引用 1|浏览5
暂无评分
摘要
Feature statistics normalization in the cepstral domain is one of the most performing approaches for robust automatic Speech Recognition (ASR) in noisy acoustic scenarios. According to this approach, feature coefficients are normalized by using suitable linear or nonlinear transformations in order to match the noisy speech statistics to the clean speech one. Histogram Equalization (HEQ) is an effective algorithm belonging to this category. Recently some of the authors have proposed an interesting extension to the HEQ original algorithm, in order to suitably deal with the multichannel audio information coming frommulti-microphone sensory activity in far-field acoustic scenarios. In this paper the feature normalization capabilities of the multichannel HEQ technique are further enhanced by introducing the kernel estimation technique and employing the multi-condition training for ASR system parametrization. Computer simulations based on the Aurora 2 database have shown that a significant recognition improvement with respect to the single-channel counterpart and other multi-channel techniques can be achieved confirming the effectiveness of the idea.
更多
查看译文
关键词
Feature Statistics Normalization,Multi-channel Histogram Equalization,Speech Recognition,Kernel Density Estimation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要