Discriminative Feature Construction Using Multi-labeling Approach for Automatic Speech Emotion Recognition

Md. Shah Fahad, Raushan Raj,Ashish Ranjan, A. Deepak

Lecture notes in electrical engineering(2023)

引用 0|浏览0
暂无评分
摘要
The paper introduces a framework for the discriminative feature construction for speech emotion recognition by jointly learning the discrete categorical and continuous emotion information. In the discrete emotion labeling approach, each utterance is assigned one label, whereas, in continuous emotion labeling, three primary attribute values (arousal, valence, and dominance) are assigned to each utterance. Each auxiliary task (arousal, valence, and dominance) is classified into low, mid, and high categories and simultaneously predicted with the main task (discrete emotion prediction). A deep CNN architecture is proposed to optimize the goal of the multi-labeling approach and is later utilized to extract the intermediate features. The extracted features are then used to train the deep neural network to classify the discrete emotion class. The proposed network is evaluated on the IEMOCAP dataset for the four emotions: Angry, excitation, neutral, and sad are used for evaluation. The proposed multi-label framework improved + 3.0% in the unweighted accuracy (UWA) compared with the single-label framework (discrete emotion prediction).
更多
查看译文
关键词
discriminative feature construction,emotion,speech,multi-labeling
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要