Chrome Extension
WeChat Mini Program
Use on ChatGLM

Hybrid Feature Extraction MFCC and Feature Selection CNN for Speaker Identification Using CNN: A Comparative Study

Aya Hasan Abdulqader,Syed AbdulRahman Al-Haddad,Salah Abdo,Amer Abdulghani, Sureshkumar Natarajan

2022 2nd International Conference on Emerging Smart Technologies and Applications (eSmarTA)(2022)

Cited 0|Views3
No score
Abstract
Speaker Identification is known as the technology that enables users to access a device by speaking into the microphone and detecting the present talker among a group of speakers. The deformation of the incoming voice signal by external sounds greatly degrades the quality of the speaker identification systems. Noise could impact the efficiency of speaker recognition as well as cause the system to not function appropriately. Therefore, speech enhancement is critical for improving the performance of speaker recognition systems under challenging conditions. The spectral Subtraction method is among the most common approaches offered for audio enhancement since it is simple to apply and requires minimal computation in signal processing. The suggested system combines three primary modules: background noise reduction, feature extraction, and sound classification. First of all, the speech enhancement approach has been used to remove the additive noise. Secondly, a combined strategy of feature extraction that uses Mel Frequency Cepstral Coefficients (MFCC) as a feature extractor to be integrated with a Mel filter bank as a single package has been used. Furthermore, by using extracted features and a deep learning algorithm, this method can recognize the identity of the speaker. A convolutional neural network (CNN) for speech modeling that demonstrates very positive results in classifying participants was applied. This architecture has been built in a text-independent configuration. The dataset was made containing 50 speakers, each speaker has 20 voice samples. The accuracy and precision parameters were used to check the feasibility of this model. Our successful hybrid approach attained an accuracy and precision of 98.46%.
More
Translated text
Key words
speaker identification,speech enhancement,feature extraction,MFCC,deep learning,CNN
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined