BBN TransTalk: Robust multilingual two-way speech-to-speech translation for mobile platforms

Computer Speech & Language(2013)

引用 16|浏览1
暂无评分
摘要
In this paper we present a speech-to-speech (S2S) translation system called the BBN TransTalk that enables two-way communication between speakers of English and speakers who do not understand or speak English. The BBN TransTalk has been configured for several languages including Iraqi Arabic, Pashto, Dari, Farsi, Malay, Indonesian, and Levantine Arabic. We describe the key components of our system: automatic speech recognition (ASR), machine translation (MT), text-to-speech (TTS), dialog manager, and the user interface (UI). In addition, we present novel techniques for overcoming specific challenges in developing high-performing S2S systems. For ASR, we present techniques for dealing with lack of pronunciation and linguistic resources and effective modeling of ambiguity in pronunciations of words in these languages. For MT, we describe techniques for dealing with data sparsity as well as modeling context. We also present and compare different user confirmation techniques for detecting errors that can cause the dialog to drift or stall.
更多
查看译文
关键词
robust multilingual two-way speech-to-speech,different user confirmation technique,s2s system,levantine arabic,mobile platform,iraqi arabic,present technique,effective modeling,machine translation,present novel technique,bbn transtalk,dialog manager,automatic speech recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要