Automatic Speech Recognition Framework for Indian Languages

semanticscholar(2018)

引用 0|浏览0
暂无评分
摘要
In this project, we worked on developing several automatic speech recognition models for Indian languages, namely Tamil, Telugu and Gujarati using the Kaldi Speech Recognition Toolkit. A HMM-GMM acoustic model in conjugation with a n-gram language model was initially built for converting speech in the above mentioned languages to text. To obtain improved Word Error Rates, a Time Delay Neural Network (TDNN) was run. A Recurrent Neural Network based Language Model (RNNLM) pipeline was then set up to improve the contextual information compared to the n-gram language model. To achieve End-To-End speech recognition, CTC (Connectionist Temporal Classification) was used in conjugation with an Encoder-Decoder framework. A detailed ananlysis of this framework was performed to obtain best results.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要