Survey: Exploring Disfluencies for Speech To Text Machine Translation

semanticscholar(2021)

引用 0|浏览0
暂无评分
摘要
Spoken language is different from the written language in its style and structure. Disfluencies that appear in transcriptions from speech recognition systems generally hamper the performance of downstream NLP tasks. Thus, a disfluency correction system that converts disfluent to fluent text is of great value. This survey paper talks about disfluencies present in speech and its transcriptions. Later, we describe methodologies to correct disfluencies present in the transcriptions of a spoken utterance via various approaches viz, a) style transfer for disfluency correction b) transfer learning and language model pretraining. We observe that disfluency inherent speech phenomenon and its correction is crucial for downstream NLP tasks.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要