1-D Local binary patterns based VAD used INHMM-based improved speech recognition

EUSIPCO(2012)

Cited 31|Views22
No score
Abstract
In this paper, 1-D Local binary patterns (LBP) are proposed to be used in speech signal segmentation and voice activation detection (VAD)and combined with hidden Markov model (HMM) for advanced speech recognition. Speech is firstly de-noised by Adaptive Empirical Model Decomposition (AEMD), and then processed using LBP based VAD. The short-time energy of the speech activity detected from the VAD is finally smoothed and used as the input of the HMM recognition process. The enhanced performance of the proposed system for speech recognition is compared with other VAD techniques at different SNRs ranging from 15 dB to a robust noisy condition at -5 dB.
More
Translated text
Key words
aemd,signal denoising,noise figure 15 db,speech activity detection,speech recognition,1-d lbp,speech enhancement using adaptive empirical model decomposition (aemd),voice activity detection,voice activation detection,inhmm-based improved speech recognition,speech signal segmentation,vad technique,speech denoising,hidden markov model,adaptive empirical model decomposition,1d local binary pattern,local binary patterns,snr,noise reduction,noise figure -5 db,hidden markov models
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined