Flat start training of CD-CTC-SMBR LSTM RNN acoustic models

Kanishka Rao,Andrew Senior,Hasim Sak

doi:10.1109/icassp.2016.7472710

Abstract

We present a recipe for training acoustic models with context dependent (CD) phones from scratch using recurrent neural networks (RNNs). First, we use the connectionist temporal classification (CTC) technique to train a model with context independent (CI) phones directly from the written-domain word transcripts by aligning with all possible phonetic verbalizations. Then, we devise a mechanism to generate a set of CD phones using the CTC CI phone model alignments and train a CD phone model to improve the accuracy. This end-to-end training recipe does not require any previously trained GMM-HMM or DNN model for CD phone generation or alignment, and thus drastically reduces the overall model building time. We show that using this procedure does not degrade the performance of models and allows us to improve models more quickly by updates to pronunciations or training data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Flat start training of CD-CTC-SMBR LSTM RNN acoustic models

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Fast and accurate recurrent neural network acoustic models for speech recognition
Haşim Sak ... Kanishka Rao
-
Haşim Sak, et. al.Haşim Sak ... Kanishka Rao
06 Sep 2015
06 Sep 2015

An empirical exploration of CTC acoustic models
Yajie Miao ... Tom Ko
-
Yajie Miao, et. al.Yajie Miao ... Tom Ko
01 Mar 2016
01 Mar 2016

Study of subword units for Spanish speech recognition
Antonio Bonafonte ... Eugenio Vives
-
Antonio Bonafonte, et. al.Antonio Bonafonte ... Eugenio Vives
18 Sep 1995
18 Sep 1995

A static lexicon network representation for cross-word context dependent phones
Kris Demuynck ... Jacques Duchateau
-
Kris Demuynck, et. al.Kris Demuynck ... Jacques Duchateau
22 Sep 1997
22 Sep 1997

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Flat start training of CD-CTC-SMBR LSTM RNN acoustic models

Abstract

Talk to us

Similar Papers