Language model cross adaptation for LVCSR system combination

X Liu,M.J.F Gales,P.C Woodland

doi:10.1016/j.csl.2012.07.010

Abstract

State-of-the-art large vocabulary continuous speech recognition (LVCSR) systems often combine outputs from multiple sub-systems that may even be developed at different sites. Cross system adaptation, in which model adaptation is performed using the outputs from another sub-system, can be used as an alternative to hypothesis level combination schemes such as ROVER. Normally cross adaptation is only performed on the acoustic models. However, there are many other levels in LVCSR systems’ modelling hierarchy where complimentary features may be exploited, for example, the sub-word and the word level, to further improve cross adaptation based system combination. It is thus interesting to also cross adapt language models (LMs) to capture these additional useful features. In this paper cross adaptation is applied to three forms of language models, a multi-level LM that models both syllable and word sequences, a word level neural network LM, and the linear combination of the two. Significant error rate reductions of 4.0–7.1% relative were obtained over ROVER and acoustic model only cross adaptation when combining a range of Chinese LVCSR sub-systems used in the 2010 and 2011 DARPA GALE evaluations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Language model cross adaptation for LVCSR system combination

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Jul 27, 2012
Citations: 17

Similar Papers

Language model cross adaptation for LVCSR system combination
Xunying Liu ... Mark J F Gales
-
Xunying Liu, et. al.Xunying Liu ... Mark J F Gales
26 Sep 2010
26 Sep 2010

Quantifying the value of pronunciation lexicons for keyword search in lowresource languages
Guoguo Chen ... Oguz Yilmaz
-
Guoguo Chen, et. al.Guoguo Chen ... Oguz Yilmaz
01 May 2013
01 May 2013

Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR
Tara N Sainath ... David Nahamoo
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19
Tara N Sainath, et. al.Tara N Sainath ... David Nahamoo
01 Nov 2011
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19

Acoustic models of the elderly for large‐vocabulary continuous speech recognition
Akira Baba ... Shinichi Yoshizawa
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87
Akira Baba, et. al.Akira Baba ... Shinichi Yoshizawa
09 Jun 2004
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Language model cross adaptation for LVCSR system combination

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language