Speaker-aware long short-term memory multi-task learning for speech recognition

Gueorgui Pironkov,Stephane Dupont,Thierry Dutoit

doi:10.1109/eusipco.2016.7760581

Abstract

In order to address the commonly met issue of overfitting in speech recognition, this article investigates Multi-Task Learning, when the auxiliary task focuses on speaker classification. Overfitting occurs when the amount of training data is limited, leading to an over-sensible acoustic model. Multi-Task Learning is a method, among many other regularization methods, which decreases the overfitting impact by forcing the acoustic model to train jointly for multiple different, but related, tasks. In this paper, we consider speaker classification as an auxiliary task in order to improve the generalization abilities of the acoustic model, by training the model to recognize the speaker, or find the closest one inside the training set. We investigate this Multi-Task Learning setup on the TIMIT database, while the acoustic modeling is performed using a Recurrent Neural Network with Long Short-Term Memory cells.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker-aware long short-term memory multi-task learning for speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speaker-aware Multi-Task Learning for automatic speech recognition
Gueorgui Pironkov ... Stephane Dupont
-
Gueorgui Pironkov, et. al.Gueorgui Pironkov ... Stephane Dupont
01 Dec 2016
01 Dec 2016

I-Vector estimation as auxiliary task for Multi-Task Learning based acoustic modeling for automatic speech recognition
Gueorgui Pironkov ... Thierry Dutoit
-
Gueorgui Pironkov, et. al.Gueorgui Pironkov ... Thierry Dutoit
01 Dec 2016
01 Dec 2016

Investigating the impact of the training data volume for robust speech recognition using multi-task learning
Gueorgui Pironkov ... Thierry Dutoit
-
Gueorgui Pironkov, et. al.Gueorgui Pironkov ... Thierry Dutoit
01 Dec 2017
01 Dec 2017

Exploring recurrent neural network based acoustic and linguistic modeling for children's speech recognition
Sreeram Ganji ... Rohit Sinha
-
Sreeram Ganji, et. al.Sreeram Ganji ... Rohit Sinha
01 Nov 2017
01 Nov 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker-aware long short-term memory multi-task learning for speech recognition

Abstract

Talk to us

Similar Papers