Estimating continuous-valued emotion of real-life speech

Journal of Convergence Information Technology(2011)

引用 4|浏览4
暂无评分
摘要
Recently, emotion recognition from real-life speech is so challenging that much attention has been paid to it. In light of this, we develop our research on spontaneous speech emotion estimation at following two levels. At theoretic level, we adopt the two-dimensional Valence-arousal emotion plane to describe the real-life emotions, instead of the traditional discrete representation. Benefiting from this continuous perspective, plentiful emotions of spontaneous speech can be represented tractably. At implemental level, a small-scaled spontaneous corpus with 777 utterances is established firstly. Then, to estimate the continuous-valued emotions from speech, three regression algorithms are adopted as the estimators. Experimental results show that Elman Recurrent Neural Network presents better performance than Fuzzy k-Nearest Neighbor and Support Vector Regression, and suits better for emotion estimation task, yielding smallest mean square errors and highest R-Square, reaching 80.84% for valence and 85.64% for arousal respectively.
更多
查看译文
关键词
Emotion Recognition,Real-life Speech,Valence-arousal Emotion Space
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要