MPE-based discriminative linear transforms for speaker adaptation

Lan Wang,Philip C Woodland

doi:10.1016/j.csl.2007.09.001

Abstract

In this paper, the use of discriminative linear transforms (DLT) is investigated to construct speaker adaptive speech recognition systems, where a discriminative criterion rather than ML is used for transform parameter estimation. The minimum phone error (MPE) criterion is investigated for DLT estimation, by making use of a so-called weak-sense auxiliary function to derive the estimation formulae. An implementation based on lattices is used for DLT statistics accumulation, where the use of a weakened language model allows more confusion data to be included. To improve DLT estimation for unsupervised adaptation, a method of incorporating word correctness information of the supervision into transform estimation is developed. The confidence scores calculated by confusion network decoding are used to represent the word correctness and weight the numerator statistics during DLT estimation. This makes the DLT estimation less sensitive to errors in the supervision. Experiments on transcription of read newspaper data and on conversational telephone speech transcription have shown the improvements of DLT over MLLR for both supervised and unsupervised adaptation, and the effectiveness of confidence scores for improving both normal and DLT-based MLLR adaptation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MPE-based discriminative linear transforms for speaker adaptation

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Sep 29, 2007
Citations: 24

Similar Papers

Differenced maximum mutual information criterion for robust unsupervised acoustic model adaptation
Marc Delcroix ... Atsushi Nakamura
Computer Speech & Language | VOL. 36
Marc Delcroix, et. al.Marc Delcroix ... Atsushi Nakamura
19 Aug 2015
Computer Speech & Language | VOL. 36

Automatic transcription of conversational telephone speech
T Hain ... D Povey
IEEE Transactions on Speech and Audio Processing | VOL. 13
T Hain, et. al.T Hain ... D Povey
01 Nov 2005
IEEE Transactions on Speech and Audio Processing | VOL. 13

Discriminative speaker adaptation in Persian continuous speech recognition systems
Shadi Pirhosseinloo ... Farshad Almas Ganj
Procedia - Social and Behavioral Sciences | VOL. 32
Shadi Pirhosseinloo, et. al.Shadi Pirhosseinloo ... Farshad Almas Ganj
01 Jan 2012
Procedia - Social and Behavioral Sciences | VOL. 32

Discriminative adaptive training using the MPE criterion
L Wang ... P.C Woodland
-
L Wang, et. al.L Wang ... P.C Woodland
16 Sep 2003
16 Sep 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MPE-based discriminative linear transforms for speaker adaptation

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language