A Comprehensive Review on Machine Learning Approaches for Enhancing Human Speech Recognition

Maha Adnan Shanshool,Husam Ali Abdulmohsin

TRAITEMENT DU SIGNAL(2023)

引用 0|浏览1
暂无评分
摘要
As a fundamental element of human-computer interaction, speech recognition-the ability of software systems to identify and interpret human language-has garnered immense attention in recent years. This review offers a rigorous examination of machine learning techniques deployed for optimizing speech recognition capabilities. It delves into the utilization of prominent datasets-such as Librispeech, Timit, and Voxforge-in speech recognition research and underscores their significant contributions to enhancing the accuracy of recognition systems. Furthermore, the efficacy of assorted classification techniques-including deep neural networks (DNN), convolutional neural networks (CNN), support vector machines (SVM), and random forests (RF)-is evaluated in the context of voice recognition. It is observed that Mel-Frequency Cepstral Coefficients (MFCC) often render superior discriminatory abilities in human voice recognition trials. This review stands to provide valuable insights for both researchers and professionals active in the field of speech recognition, thereby paving the way for future advancements in this domain.
更多
查看译文
关键词
human speech recognition,machine learning approaches,machine learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要