Evolutionary eigenvoice MLLR speaker adaptation

Reza Sahraeian,Ahmad Akbari,Mehdi Mohammadi,Ahmad Ayatollahi

doi:10.1016/j.procs.2010.12.163

Reza Sahraeian, Ahmad Akbari + Show 2 more

Open Access

https://doi.org/10.1016/j.procs.2010.12.163

Copy DOI

Abstract

Abstract This paper considers the problem of rapid and robust speaker adaptation in automatic speech recognition (ASR) systems. We propose an approach using combination of eigenspace-based maximum likelihood linear regression (EMLLR) and evolutionary algorithms. To find the best solution for the coefficients estimation problem, we suggest using genetic algorithm (GA) for rapid speaker adaptation. This is due to the fact that genetic algorithms are not as sensitive as expectation maximization (EM) algorithm to the amount of adaptation data. Experimental results on TIMIT database illustrate that genetic algorithm, using random individuals in first population, leads to up to 1.03% improvement in phoneme recognition rate. Moreover, we show that if the first population contains coefficients initially estimated by maximum likelihood criterion, further improvement can be achieved as well. However, the amount of adaptation data does not have considerable effect on the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evolutionary eigenvoice MLLR speaker adaptation

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Journal: Procedia Computer Science	Publication Date: Jan 1, 2011
License type: cc-by-nc-nd

Similar Papers

Rapid feature space MLLR speaker adaptation for deep neural network acoustic modeling
Shilei Zhang ... Yong Qin
-
Shilei Zhang, et. al. Shilei Zhang ... Yong Qin
01 Dec 2016
01 Dec 2016

Aggregate a Posteriori Linear Regression for Speaker Adaptation
Chih-Hsien Huang ... Jen-Tzung Chien
-
Chih-Hsien Huang, et. al. Chih-Hsien Huang ... Jen-Tzung Chien
18 Mar 2005
18 Mar 2005

Rapid speaker adaptation using compressive sensing
Wen-Lin Zhang ... Bi-Cheng Li
Speech Communication | VOL. 55
Wen-Lin Zhang, et. al.Wen-Lin Zhang ... Bi-Cheng Li
28 Jun 2013
Speech Communication | VOL. 55

Implementing PCA-based speaker adaptation methods in a Persian ASR system
Zohreh Ansari ... Farshad Almasganj
-
Zohreh Ansari, et. al.Zohreh Ansari ... Farshad Almasganj
01 Dec 2010
01 Dec 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evolutionary eigenvoice MLLR speaker adaptation

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science