System combination and score normalization for spoken term detection

ICASSP(2013)

引用 96|浏览153
暂无评分
摘要
Spoken content in languages of emerging importance needs to be searchable to provide access to the underlying information. In this paper, we investigate the problem of extending data fusion methodologies from Information Retrieval for Spoken Term Detection on low-resource languages in the framework of the IARPA Babel program. We describe a number of alternative methods improving keyword search performance. We apply these methods to Cantonese, a language that presents some new issues in terms of reduced resources and shorter query lengths. First, we show score normalization methodology that improves in average by 20% keyword search performance. Second, we show that properly combining the outputs of diverse ASR systems performs 14% better than the best normalized ASR system.
更多
查看译文
关键词
natural language processing,lattices,hidden markov models,sensor fusion,information retrieval,speech recognition,tuning,indexes,automatic speech recognition,speech,data fusion,data integration
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要