Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions

Xunying Liu,Mark Gales

doi:10.1109/tasl.2006.889804

Abstract

Selecting the model structure with the "appropriate" complexity is a standard problem for training large-vocabulary continuous-speech recognition (LVCSR) systems. State-of-the-art LVCSR systems are highly complex. A wide variety of techniques may be used which alter the system complexity and word error rate (WER). Explicitly evaluating systems for all possible configurations is infeasible; hence, an automatic model complexity control criterion is highly desirable. Most existing complexity control schemes can be classified into two types, Bayesian learning techniques and information theory approaches. An implicit assumption is made in both, that increasing the likelihood on held-out data decreases the WER. However, this correlation has been found quite weak for current speech recognition systems. This paper presents a novel discriminative complexity control technique, the marginalization of a discriminative growth function. This is a closer approximation to the true WER than standard approaches. Experimental results on a standard LVCSR Switchboard task showed that marginalized discriminative growth functions outperforms manually tuned systems and conventional complexity control techniques, such as Bayesian information criterion (BIC), in terms of WER

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech and Language Processing	Publication Date: May 1, 2007
Citations: 41

Similar Papers

Quantifying the value of pronunciation lexicons for keyword search in lowresource languages
Guoguo Chen ... Oguz Yilmaz
-
Guoguo Chen, et. al.Guoguo Chen ... Oguz Yilmaz
01 May 2013
01 May 2013

Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR
Tara N Sainath ... David Nahamoo
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19
Tara N Sainath, et. al.Tara N Sainath ... David Nahamoo
01 Nov 2011
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19

Automatic generation of subword units for speech recognition systems
R Singh ... R.M Stern
IEEE Transactions on Speech and Audio Processing | VOL. 10
R Singh, et. al.R Singh ... R.M Stern
01 Jan 2002
IEEE Transactions on Speech and Audio Processing | VOL. 10

Acoustic models of the elderly for large‐vocabulary continuous speech recognition
Akira Baba ... Shinichi Yoshizawa
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87
Akira Baba, et. al.Akira Baba ... Shinichi Yoshizawa
09 Jun 2004
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing