Novel approaches to Arabic speech recognition: report from the 2002 Johns-Hopkins Summer Workshop

ICASSP (1)(2003)

引用 142|浏览76
暂无评分
摘要
Although Arabic is currently one of the most widely spoken languages in the world, there has been relatively little speech recognition research on Arabic compared to other languages. Moreover, most previous work has concentrated on the recognition of formal rather than dialectal Arabic. This paper reports on our project at the 2002 Johns Hopkins Summer Workshop, which focused on the recognition of dialectal Arabic. Three problems were addressed: (a) the lack of short vowels and other pronunciation information in Arabic texts; (b) the morphological complexity of Arabic; and (c) the discrepancies between dialectal and formal Arabic. We present novel approaches to automatic vowel restoration, morphology-based language modeling and the integration of out-of-corpus language model data, and report significant word error rate improvements on the LDC Arabic CallHome task.
更多
查看译文
关键词
speech recognition,speech recognition research,natural languages,automatic vowel restoration,word error rate,dialectal arabic recognition,morphological complexity,dialectal arabic,morphology-based language modeling,johns-hopkins summer workshop,pronunciation information,arabic speech recognition,vowels,out-of-corpus language model data,ldc arabic callhome task,formal arabic,language model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要