Seq-2-Seq based Refinement of ASR Output for Spoken Name Capture

Conference of the International Speech Communication Association (INTERSPEECH)(2022)

引用 2|浏览15
暂无评分
摘要
Person name capture from human speech is a difficult task in human-machine conversations. In this paper, we propose a novel approach to capture the person names from the caller utterances in response to the prompt "say and spell your first/last name". Inspired from work on spell correction, disfluency removal and text normalization, we propose a lightweight Seq-2-Seq system which generates a name spell from a varying user input. Our proposed method outperforms the strong baseline which is based on LM-driven rule-based approach.
更多
查看译文
关键词
spoken name capture,asr output
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要