Speaker adaptive training for deep neural networks embedding linear transformation networks

Tsubasa Ochiai,Chiori Hori,Hideyuki Watanabe,Xugang Lu,Shigeru Katagiri,Shigeki Matsuda

doi:10.1109/icassp.2015.7178843

Abstract

Recently, a novel speaker adaptation method was proposed that applied the Speaker Adaptive Training (SAT) concept to a speech recognizer consisting of a Deep Neural Network (DNN) and a Hidden Markov Model (HMM), and its utility was demonstrated. This method implements the SAT scheme by allocating one Speaker Dependent (SD) module for each training speaker to one of the intermediate layers of the front-end DNN. It then jointly optimizes the SD modules and the other part of network, which is shared by all the speakers. In this paper, we propose an improved version of the above SAT-based adaptation scheme for a DNN-HMM recognizer. Our new training adopts a Linear Transformation Network (LTN) for the SD module, and such LTN employment leads to more appropriate regularization in both the SAT and adaptation stages by replacing an empirically selected anchorage of a network for regularization in the preceding SAT-DNN-HMM with a SAT-optimized anchorage. We elaborate the effectiveness of our proposed method over TED Talks corpus data. Our experimental results show that a speaker-adapted recognizer using our method achieves a significant word error rate reduction of 9.2 points from a baseline SI-DNN recognizer and also steadily outperforms speaker-adapted recognizers, each of which originates from the preceding SAT-based DNN-HMM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker adaptive training for deep neural networks embedding linear transformation networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speaker Adaptive Training using Deep Neural Networks
Tsubasa Ochiai ... Chiori Hori
-
Tsubasa Ochiai, et. al.Tsubasa Ochiai ... Chiori Hori
01 May 2014
01 May 2014

Ensemble speaker modeling using speaker adaptive training deep neural network for speaker adaptation
Sheng Li ... Tatsuya Kawahara
-
Sheng Li, et. al.Sheng Li ... Tatsuya Kawahara
06 Sep 2015
06 Sep 2015

Speaker Adaptation on Myanmar Spontaneous Speech Recognition
Hay Mar Soe Naing ... Win Pa Pa
-
Hay Mar Soe Naing, et. al.Hay Mar Soe Naing ... Win Pa Pa
01 Jan 2018
01 Jan 2018

Towards speaker adaptive training of deep neural network acoustic models
Yajie Miao ... Hao Zhang
-
Yajie Miao, et. al.Yajie Miao ... Hao Zhang
14 Sep 2014
14 Sep 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker adaptive training for deep neural networks embedding linear transformation networks

Abstract

Talk to us

Similar Papers