Discriminative fuzzy clustering maximum a posterior linear regression for speaker adaptation

Ting-Yao Hu,Lin-Shan Lee,Yu Tsao

doi:10.21437/interspeech.2012-174

Abstract

We propose a discriminative fuzzy clustering maximum a posterior linear regression (DFCMAPLR) model adaptation approach to compensate the acoustic mismatch due to speaker variability. The DFCMAPLR approach adopts the MAP criterion and a discriminative objective function to estimate shared affine transform and fuzzy weight sets, respectively. Then, through a linear combination of the calculated fuzzy weights and shared affine transforms, more specific affine transforms are formed for model adaptation. By incorporating the MAP criterion and the discriminative information, DFCMAPLR can calculate shared affine transforms reliably and enhance the discriminative power of the adapted acoustic model. Based on the experimental results on the ASTTEL200 Mandarin corpus, we verified that DFCMAPLR outperforms not only the conventional maximum likelihood linear regression (MLLR) but also the fuzzy clustering MLLR(FCMLLR), which estimates the shared affine transform and fuzzy weight sets both based on the maximum likelihood criterion. Moreover, when compared to the baseline result, DFCMAPLR provides a clear improvement of 9.86% (24.04% to 21.67%) relative average phone error rate (PER) reduction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discriminative fuzzy clustering maximum a posterior linear regression for speaker adaptation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

SVM Based Speaker Recognition Using Maximum a posteriori Linear Regression
Xiang Zhang ... Qingwei Zhao
-
Xiang Zhang, et. al.Xiang Zhang ... Qingwei Zhao
01 Feb 2009
01 Feb 2009

Maximum a posteriori linear regression for speaker recognition
Xiang Zhang ... Xiang Xiao
-
Xiang Zhang, et. al.Xiang Zhang ... Xiang Xiao
01 Jan 2009
01 Jan 2009

Improved Semi-Parametric Mean Trajectory Model Using Discriminatively Trained Centroids
Ran Xu ... Jielin Pan
-
Ran Xu, et. al.Ran Xu ... Jielin Pan
01 Dec 2008
01 Dec 2008

An evaluation of posterior modeling techniques for phonetic recognition
Rohit Prabhavalkar ... Dimitri Kanevsky
-
Rohit Prabhavalkar, et. al.Rohit Prabhavalkar ... Dimitri Kanevsky
01 May 2013
01 May 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discriminative fuzzy clustering maximum a posterior linear regression for speaker adaptation

Abstract

Talk to us

Similar Papers