Restoring-Aware Beam Search For Reduced Search Errors In Contextual Automatic Speech Recognition

18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION(2017)

引用 4|浏览37
暂无评分
摘要
Using context in automatic speech recognition allows the recognition system to dynamically task-adapt and bring gains to a broad variety of use-cases. An important mechanism of context inclusion is on-the-fly rescoring of hypotheses with contextual language model content available only in real-time.In systems where rescoring occurs on the lattice during its construction as part of beam search decoding. hypotheses eligible for rescoring may be missed due to pruning. This can happen for many reasons: the language model and rescoring model may assign significantly different scores, there may be a lot of noise in the utterance, or word prefixes with a high out degree may necessitate aggressive pruning to keep the search tractable. This results in misrecognitions when contextually relevant hypotheses are pruned before rescoring, even if a contextual rescoring model favors those hypotheses by a large margin.We present a technique to adapt the beam search algorithm to preserve hypotheses when they may benefit from rescoring. We show that this technique significantly reduces the number of search pruning errors on restorable hypotheses, without a significant increase in the search space size. This technique makes it feasible to use one base language model, but still achieve high-accuracy speech recognition results in all contexts.
更多
查看译文
关键词
speech recognition, decoding algorithms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要