Fine-Tuning the Wav2vec2 Model for Kazakh Speech: A Study on a Limited Corpus
2023 IEEE International Conference on Smart Information Systems and Technologies (SIST)(2023)
摘要
In this study, we developed a model for automatic recognition of Kazakh speech by fine-tuning the XLSR-Wav2Vec2 pre-trained model to a corpus of Kazakh speech. Our results show that fine-tuning the wav2vec2 model on a small corpus of Kazakh speech allows a significant increase in recognition accuracy. However, larger datasets are needed to further evaluate the effectiveness of this approach. The results of this study contribute to ongoing efforts to improve speech recognition technology for low-resource languages such as Kazakh.
更多查看译文
关键词
automatic speech recognition,Kazakh language,Wav2Vec2
AI 理解论文
溯源树
样例
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要