Discriminative template extraction for direct modeling

ICASSP(2010)

引用 0|浏览45
暂无评分
摘要
This paper addresses the problem of developing appropriate features for use in direct modeling approaches to speech recognition, such as those based on Maximum Entropy models or Segmental Conditional Random Fields. We propose a feature based on the detection of word-level templates which are discriminatively chosen based on a mutual information criterion. The templates for a word are derived directly from the MFCC feature vectors, based on self-similarity across examples. No pronunciation dictionary is used, and the resulting templates match closely to in-class examples and distantly to out-of-class examples. We utilize template detection events as input to a segmental CRF speech recognizer. We evaluate the entire scheme on a voice search task. The results show that the use of discriminative template based word detector streams improves the speech recognizer's performance over the baseline HMM results.
更多
查看译文
关键词
discriminative templates,segmental condi- tional random fields,speech recognition,random processes,discriminative template extraction,index terms— discriminative templates,feature detection,hmm,direct modeling approaches,cepstral analysis,maximum entropy models,voice search task,feature extraction,mfcc feature vectors,maximum entropy methods,segmental conditional random fields,word-level templates,hidden markov models,mel frequency cepstral coefficients,segmental crf speech recognizer,mutual information,maximum entropy model,speech,random field,indexing terms,detectors,entropy,feature vector,template matching,decoding,data mining,dictionaries,mel frequency cepstral coefficient
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要