Acoustic modelling with CD-CTC-SMBR LSTM RNNS

Andrew W. Senior,Hasim Sak,Felix de Chaumont Quitry,Tara N. Sainath,Kanishka Rao

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)（2015）

引用 129|浏览101

暂无评分

摘要

This paper describes a series of experiments to extend the application of Context-Dependent (CD) long short-term memory (LSTM) recurrent neural networks (RNNs) trained with Connectionist Temporal Classification (CTC) and sMBR loss. Our experiments, on a noisy, reverberant voice search task, include training with alternative pronunciations and the application to child speech recognition; combination of multiple models, and convolutional input layers. We also investigate the latency of CTC models and show that constraining forward-backward alignment in training can reduce the delay for a real-time streaming speech recognition system. Finally we investigate transferring knowledge from one network to another through alignments.

查看译文

关键词

Long Short Term Memory,Recurrent Neural Networks,Connectionist Temporal Classification,sequence discriminative training,knowledge transfer

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要