谷歌Chrome浏览器插件
订阅小程序
在清言上使用

Residual Convolutional Neural Network-Based Dysarthric Speech Recognition

Arabian Journal for Science and Engineering(2024)

引用 0|浏览5
暂无评分
摘要
People with dysarthric speech face problems communicating with others and voice-based smart devices. This paper presents the development of a spatial residual convolutional neural network (RCNN)-based dysarthric speech recognition (DSR) system to improve communication for individuals with dysarthric speech. The RCNN model is simplified to an optimal number of layers. The system utilizes a speaker-adaptive approach, incorporating transfer learning to leverage knowledge learned from healthy individuals and a new data augmentation technique to address voice hoarseness in patients. The dysarthric speech is preprocessed using a novel voice cropping technique based on erosion and dilation methods to eliminate unnecessary pauses and hiccups in the time domain. The isolated word recognition accuracy improved by nearly 8.16
更多
查看译文
关键词
Dysarthric speech,UASpeech database,Residual convolutional neural network,Speech recognition,Deep neural network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要