Fine-Tuning the Wav2vec2 Model for Kazakh Speech: A Study on a Limited Corpus

2023 IEEE International Conference on Smart Information Systems and Technologies (SIST)(2023)

引用 0|浏览0
暂无评分
摘要
In this study, we developed a model for automatic recognition of Kazakh speech by fine-tuning the XLSR-Wav2Vec2 pre-trained model to a corpus of Kazakh speech. Our results show that fine-tuning the wav2vec2 model on a small corpus of Kazakh speech allows a significant increase in recognition accuracy. However, larger datasets are needed to further evaluate the effectiveness of this approach. The results of this study contribute to ongoing efforts to improve speech recognition technology for low-resource languages such as Kazakh.
更多
查看译文
关键词
automatic speech recognition,Kazakh language,Wav2Vec2
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要