Using out-of-language data to improve an under-resourced speech recognizer

David Imseng,Petr Motlicek,Hervé Bourlard,Philip N Garner

doi:10.1016/j.specom.2013.01.007

Abstract

Under-resourced speech recognizers may benefit from data in languages other than the target language. In this paper, we report how to boost the performance of an Afrikaans automatic speech recognition system by using already available Dutch data. We successfully exploit available multilingual resources through (1) posterior features, estimated by multilayer perceptrons (MLP) and (2) subspace Gaussian mixture models (SGMMs). Both the MLPs and the SGMMs can be trained on out-of-language data. We use three different acoustic modeling techniques, namely Tandem, Kullback–Leibler divergence based HMMs (KL-HMM) as well as SGMMs and show that the proposed multilingual systems yield 12% relative improvement compared to a conventional monolingual HMM/GMM system only trained on Afrikaans. We also show that KL-HMMs are extremely powerful for under-resourced languages: using only six minutes of Afrikaans data (in combination with out-of-language data), KL-HMM yields about 30% relative improvement compared to conventional maximum likelihood linear regression and maximum a posteriori based acoustic model adaptation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using out-of-language data to improve an under-resourced speech recognizer

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Feb 8, 2013
Citations: 46

Similar Papers

Feature and score level combination of subspace Gaussinas in LVCSR task
Petr Motlicek ... Daniel Povey
-
Petr Motlicek, et. al.Petr Motlicek ... Daniel Povey
01 May 2013
01 May 2013

Development of Robust Automatic Speech Recognition System for Children's using Kaldi Toolkit
Vivek Bhardwaj ... Virender Kadyan
-
Vivek Bhardwaj, et. al.Vivek Bhardwaj ... Virender Kadyan
01 Jul 2020
01 Jul 2020

Accent adaptation using Subspace Gaussian Mixture Models
Petr Motlicek ... Namhoon Kim
-
Petr Motlicek, et. al.Petr Motlicek ... Namhoon Kim
01 May 2013
01 May 2013

Performance Analysis of various Front-end and Back End Amalgamations for Noise-robust DNN-based ASR
Mohit Dua ... Vinam Agrawal
Recent Advances in Computer Science and Communications | VOL. 14
Mohit Dua, et. al.Mohit Dua ... Vinam Agrawal
01 Dec 2021
Recent Advances in Computer Science and Communications | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using out-of-language data to improve an under-resourced speech recognizer

Abstract

Talk to us

Similar Papers

More From: Speech Communication