Speech-Based Emotion Analysis Using Log-Mel Spectrograms and MFCC Features.

Ahmet Kemal Yetkin,Hatice Köse

SIU(2023)

引用 0|浏览1
暂无评分
摘要
This study proposes a method for recognizing emotions from speech using Mel spectrograms and MFCC features which capture the spectral features of speech signals. To predict emotions from the extracted features from the dataset, Convolutional Neural Networks (CNNs) and finetune pre-trained models are used. Pre-trained models are fine-tuned with some optimizations and one-dimensional convolutional neural network is constructed. The results demonstrate that the proposed method achieved an accuracy rate of over 80% in predicting emotions from speech and show the effectiveness of the approach in a comparative manner.
更多
查看译文
关键词
speech emotion recognition, machine learning, neural networks, log-Mel spectrogram, MFCC
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要