Speech Recognition for Turkic Languages Using Cross-Lingual Transfer Learning from Kazakh

BigComp(2023)

引用 0|浏览8
暂无评分
摘要
This paper investigates the effectiveness of transfer learning in building automatic speech recognition models for nine Turkic languages (Azerbaijani, Bashkir, Chuvash, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek), by leveraging large-scale training data from the Kazakh language. The performance of the models built using transfer learning from Kazakh was compared with the performance of the models for three non-Turkic languages (Indonesian, Japanese, and Swedish) to which transfer learning from Kazakh was also applied. We also compared the performance of the models with the results of models trained on English data. A total of 64 models were created. Most of the models built using transfer learning from Kazakh performed better than the monolingual baselines, with the most notable improvement observed for the Sakha model, which achieved a 45.5% and 22.8% reduction in the word error rate and character error rate on the test set, respectively. The datasets and codes used to train the models are available for download from https://github.com/IS2AI/CLTL Turkic ASR.
更多
查看译文
关键词
automatic speech recognition,cross-lingual transfer learning,deep learning,Turkic languages
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要