Semi-supervised maximum mutual information training of deep neural network acoustic models

Vimal Manohar,Daniel Povey,Sanjeev Khudanpur

doi:10.21437/interspeech.2015-561

Abstract

Maximum Mutual Information (MMI) is a popular discriminative criterion that has been used in supervised training of acoustic models for automatic speech recognition. However, standard discriminative training is very sensitive to the accuracy of the transcription and hence its implementation in a semisupervised setting requires extensive filtering of data. We will show that if the supervision transcripts are not known, the natural analogue of MMI is to minimize the conditional entropy of the lattice of possible transcripts of the data. This is equivalent to the weighted average of MMI criterion over different reference transcripts, taking those reference transcripts and their weighting from the lattice itself. In this paper we describe experiments where we applied this method to the semi-supervised training of Deep Neural Network acoustic models. In our experimental setup, the proposed method gives up to 0.5% absolute WER improvement over a DNN trained with sMBR only on the transcribed part of the data. This is 37% of the improvement that we would get from doing sMBR training if we had the transcripts for the untranscribed part of the data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semi-supervised maximum mutual information training of deep neural network acoustic models

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies
Xiaodong Cui ... Wei Zhang
IEEE Signal Processing Magazine | VOL. 37
Xiaodong Cui, et. al.Xiaodong Cui ... Wei Zhang
01 May 2020
IEEE Signal Processing Magazine | VOL. 37

Gaussian mixture models for adaptation of deep neural network acoustic models in automatic speech recognition systems
N.A Tomashenko ... Yu.N Matveev
Scientific and Technical Journal of Information Technologies, Mechanics and Optics | VOL. 106
N.A Tomashenko, et. al.N.A Tomashenko ... Yu.N Matveev
15 Nov 2016
Scientific and Technical Journal of Information Technologies, Mechanics and Optics | VOL. 106

Large-Scale Semi-Supervised Training in Deep Learning Acoustic Model for ASR
Yanhua Long ... Yijie Li
IEEE Access | VOL. 7
Yanhua Long, et. al.Yanhua Long ... Yijie Li
01 Jan 2019
IEEE Access | VOL. 7

Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Xiaodong Cui ... Takashi Fukuda
-
Xiaodong Cui, et. al.Xiaodong Cui ... Takashi Fukuda
18 Sep 2022
18 Sep 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semi-supervised maximum mutual information training of deep neural network acoustic models

Abstract

Talk to us

Similar Papers