Towards realizing gesture-to-speech conversion with a HMM-based bilingual speech synthesis system

Hongwu Yang,Xiaochun An, Dong Pei, Yitong Liu

Orange Technologies(2014)

引用 4|浏览7
暂无评分
摘要
This paper realizes a gesture-to-speech conversion system to solve the communication problem between healthy people and speech disorders. An improved speeded up robust features (SURF) algorithm is adopted for static gesture recognition by combining Kinect sensor. Meanwhile, a Hidden Markov Model (HMM) based Mandarin-Tibetan bilingual speech synthesis system is developed by using speaker adaptive training. A set of semantic rules is designed for the static gestures. Chinese or Tibetan context-dependent labels of recognized static gestures are generated according to the semantic rules. The recognized gestures are finally converted to the Mandarin or Tibetan by using the Mandarin-Tibetan bilingual speech synthesis system with the context-dependent labels. Tests show that the static gesture recognition rate of the designed system achieves 97.1%. Subjective evaluation demonstrates that synthesized speech can get 4.0 of the mean opinion score (MOS) on synthesized speech.
更多
查看译文
关键词
feature extraction,gesture recognition,handicapped aids,hidden markov models,natural language processing,sensors,speaker recognition,speech synthesis,chinese context-dependent labels,hmm based mandarin-tibetan bilingual speech synthesis system,kinect sensor,mos,surf algorithm,tibetan context-dependent labels,communication problem,gesture-to-speech conversion system,healthy people,hidden markov model,mean opinion score,semantic rules,speaker adaptive training,speech disorders,speeded up robust features algorithm,static gesture recognition,synthesized speech,mandarin-tibetan bilingual speech synthe-sis,context-dependent label,improved surf algorithm,speech recognition,semantics,speech
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要