Towards a visual speech learning system for the deaf by matching dynamic lip shapes

COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PT I(2012)

引用 2|浏览0
暂无评分
摘要
In this paper we propose a visual-based speech learning framework to assist deaf persons by comparing the lip movements between a student and an E-tutor in an intelligent tutoring system. The framework utilizes lip reading technologies to determine if a student learns the correct pronunciation. Different from conventional speech recognition systems, which usually recognize a speaker's utterance, our speech learning framework focuses on recognizing whether a student pronounces are correct according to an instructor's utterance by using visual information. We propose a method by extracting dynamic shape difference features (DSDF) based on lip shapes to recognize the pronunciation difference. The preliminary experimental results demonstrate the robustness and effectiveness of our approach on a database we collected, which contains multiple persons speaking a small number of selected words.
更多
查看译文
关键词
deaf person,correct pronunciation,dynamic shape difference feature,visual-based speech,pronunciation difference,dynamic lip shape,intelligent tutoring system,lip shape,visual speech,lip movement,conventional speech recognition system,lip reading technology
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要