Speaker adaptation using maximum likelihood model interpolation

Zuoying Wang Zuoying Wang,Feng Liu Feng Liu

doi:10.1109/icassp.1999.759777

Abstract

A speaker adaptation scheme named maximum likelihood model interpolation (MLMI) is proposed. The basic idea of MLMI is to compute the speaker adapted (SA) model of a test speaker by a linear convex combination of a set of speaker dependent (SD) models. Given a set of training speakers, we first calculate the corresponding SD models for each training speaker as well as the speaker-independent (SI) models. Then, the mean vector of the SA model is computed as the weighted sum of the set of the SD mean vectors, while the covariance matrix is the same as that of the SI model. An algorithm to estimate the weight parameters is given which maximizes the likelihood of the SA model given the adaptation data. Experiments show that 3 adaptation sentences can give a significant performance improvement. As the number of SD models increases, further improvement can be obtained.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker adaptation using maximum likelihood model interpolation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speaker normalization and adaptation based on linear transformation
J Ishii ... M Tonomura
-
J Ishii, et. al.J Ishii ... M Tonomura
21 Apr 1997
21 Apr 1997

Adversarial Speaker Adaptation
Zhong Meng ... Jinyu Li
-
Zhong Meng, et. al.Zhong Meng ... Jinyu Li
29 Apr 2019
29 Apr 2019

Neighbour selection and adaptation for rapid speaker-dependent ASR
Udhyakumar Nallasamy ... Tanja Schultz
-
Udhyakumar Nallasamy, et. al.Udhyakumar Nallasamy ... Tanja Schultz
01 Dec 2013
01 Dec 2013

Linear Networks Based Speaker Adaptation for Speech Synthesis
Zhiying Huang ... Heng Lu
-
Zhiying Huang, et. al.Zhiying Huang ... Heng Lu
01 Apr 2018
01 Apr 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker adaptation using maximum likelihood model interpolation

Abstract

Talk to us

Similar Papers