Speaker-aware Multi-Task Learning for automatic speech recognition

Gueorgui Pironkov,Thierry Dutoit,Stephane Dupont

doi:10.1109/icpr.2016.7900077

Abstract

Overfitting is a commonly met issue in automatic speech recognition and is especially impacting when the amount of training data is limited. In order to address this problem, this article investigates acoustic modeling through Multi-Task Learning, with two speaker-related auxiliary tasks. Multi-Task Learning is a regularization method which aims at improving the network's generalization ability, by training a unique model to solve several different, but related tasks. In this article, two auxiliary tasks are jointly examined. On the one hand, we consider speaker classification as an auxiliary task by training the acoustic model to recognize the speaker, or find the closest one inside the training set. On the other hand, the acoustic model is also trained to extract i-vectors from the standard acoustic features. I-Vectors are efficiently applied in the speaker identification community in order to characterize a speaker and its acoustic environment. The core idea of using these auxiliary tasks is to give the network an additional inter-speaker awareness, and thus, reduce overfitting.We investigate this Multi-Task Learning setup on the TIMIT database, while the acoustic modeling is performed using a Recurrent Neural Network with Long Short-Term Memory cells.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker-aware Multi-Task Learning for automatic speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

I-Vector estimation as auxiliary task for Multi-Task Learning based acoustic modeling for automatic speech recognition
Gueorgui Pironkov ... Thierry Dutoit
-
Gueorgui Pironkov, et. al.Gueorgui Pironkov ... Thierry Dutoit
01 Dec 2016
01 Dec 2016

Speaker-aware long short-term memory multi-task learning for speech recognition
Gueorgui Pironkov ... Thierry Dutoit
-
Gueorgui Pironkov, et. al.Gueorgui Pironkov ... Thierry Dutoit
01 Aug 2016
01 Aug 2016

Investigating multi-task learning for automatic speech recognition with code-switching between mandarin and english
Xiao Song ... Yi Liu
-
Xiao Song, et. al.Xiao Song ... Yi Liu
01 Dec 2017
01 Dec 2017

Investigating the impact of the training data volume for robust speech recognition using multi-task learning
Gueorgui Pironkov ... Thierry Dutoit
-
Gueorgui Pironkov, et. al.Gueorgui Pironkov ... Thierry Dutoit
01 Dec 2017
01 Dec 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker-aware Multi-Task Learning for automatic speech recognition

Abstract

Talk to us

Similar Papers