Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting

Brian Kan-Wing Mak Brian Kan-Wing Mak,J.T Kwok,Roger Wend-Huu Hsiao Roger Wend-Huu Hsiao,Simon Ka-Lung Ho Simon Ka-Lung Ho

doi:10.1109/tsa.2005.860836

Abstract

Recently, we proposed an improvement to the conventional eigenvoice (EV) speaker adaptation using kernel methods. In our novel kernel eigenvoice (KEV) speaker adaptation, speaker supervectors are mapped to a kernel-induced high dimensional feature space, where eigenvoices are computed using kernel principal component analysis. A new speaker model is then constructed as a linear combination of the leading eigenvoices in the kernel-induced feature space. KEV adaptation was shown to outperform EV, MAP, and MLLR adaptation in a TIDIGITS task with less than 10 s of adaptation speech. Nonetheless, due to many kernel evaluations, both adaptation and subsequent recognition in KEV adaptation are considerably slower than conventional EV adaptation. In this paper, we solve the efficiency problem and eliminate all kernel evaluations involving adaptation or testing observations by finding an approximate pre-image of the implicit adapted model found by KEV adaptation in the feature space; we call our new method embedded kernel eigenvoice (eKEV) adaptation. eKEV adaptation is faster than KEV adaptation, and subsequent recognition runs as fast as normal HMM decoding. eKEV adaptation makes use of multidimensional scaling technique so that the resulting adapted model lies in the span of a subset of carefully chosen training speakers. It is related to the reference speaker weighting (RSW) adaptation method that is based on speaker clustering. Our experimental results on Wall Street Journal show that eKEV adaptation continues to outperform EV, MAP, MLLR, and the original RSW method. However, by adopting the way we choose the subset of reference speakers for eKEV adaptation, we may also improve RSW adaptation so that it performs as well as our eKEV adaptation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech and Language Processing	Publication Date: Jul 1, 2006
Citations: 56

Similar Papers

Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA
Brian Mak ... Simon Ho
-
Brian Mak, et. al.Brian Mak ... Simon Ho
04 Oct 2004
04 Oct 2004

Using kernel PCA to improve eigenvoice speaker adaptation
B Mak ... S Ho
-
B Mak, et. al.B Mak ... S Ho
26 Aug 2004
26 Aug 2004

A study of various composite kernels for kernel eigenvoice speaker adaptation
B Mak ... J.T Kwok
-
B Mak, et. al.B Mak ... J.T Kwok
17 May 2004
17 May 2004

Various Reference Speakers Determination Methods for Embedded Kernel Eigenvoice Speaker Adaptation
B Mak ... S Ho
-
B Mak, et. al.B Mak ... S Ho
18 Mar 2005
18 Mar 2005

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing