A study on Tibetan prosodic model of speech and respiratory signals

2010 IEEE International Conference on Information and Automation, ICIA 2010(2010)

引用 0|浏览2
暂无评分
摘要
Prosodic model is an important component of the TTS system, and respiratory rhythm is an important factor affecting prosodic features. Based on the speech characteristics of Tibetan, the paper studies the correspondence between respiratory signals and Tibetan prosodic features, and has decided parameters of speech and respiratory signals that affect parameters of prosodic features. Combining the research experience of Chinese prosodic models, the paper established two Tibetan prosodic models with RBF neural network - speech prosodic model and prosodic model of speech and respiratory signals, so physiological signals has been introduced into the establishment of prosodic model. News corpus is used for training of these two kinds of prosodic models with a comparing test the output, the result of which shows that the prosodic model of speech and respiratory signals can generate fundamental frequency and duration parameters that is nearer to natural speech. The results of listening and phonetically identification test show that the MOS score of its synthesized speech is 3.37, with a high naturalness. ©2010 IEEE.
更多
查看译文
关键词
neural network,prosodic model,respiratory signal,speech signal,tibetan,automation,testing,speech,artificial neural networks,network synthesis,neural networks,signal generators,context modeling,natural languages,data models,fundamental frequency,speech synthesis,psychology
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要