Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system

S Matsoukas,L Lamel,G Adda,L Nguyen,J.-L Gauvain,R Schwartz,O Kimball,F Lefevre,R Prasad,T Colthurst,J Makhoul,Chia-Lin Kao Chia-Lin Kao,J.Z Ma,H Schwenk,Bing Xiang Bing Xiang

doi:10.1109/tasl.2006.878257

Abstract

This paper describes the progress made in the transcription of broadcast news (BN) and conversational telephone speech (CTS) within the combined BBN/LIMSI system from May 2002 to September 2004. During that period, BBN and LIMSI collaborated in an effort to produce significant reductions in the word error rate (WER), as directed by the aggressive goals of the Effective, Affordable, Reusable, Speech-to-text [Defense Advanced Research Projects Agency (DARPA) EARS] program. The paper focuses on general modeling techniques that led to recognition accuracy improvements, as well as engineering approaches that enabled efficient use of large amounts of training data and fast decoding architectures. Special attention is given on efforts to integrate components of the BBN and LIMSI systems, discussing the tradeoff between speed and accuracy for various system combination strategies. Results on the EARS progress test sets show that the combined BBN/LIMSI system achieved relative reductions of 47% and 51% on the BN and CTS domains, respectively

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech and Language Processing	Publication Date: Sep 1, 2006
Citations: 79

Similar Papers

Advances in automatic transcription of Italian broadcast news
Fabio Brugnara ... Mauro Cettolo
-
Fabio Brugnara, et. al.Fabio Brugnara ... Mauro Cettolo
16 Oct 2000
16 Oct 2000

Multimedia interaction for the new millennium
Mark T Maybury
-
Mark T MayburyMark T Maybury
05 Sep 1999
05 Sep 1999

Large scale discriminative training of hidden Markov models for speech recognition
P.C Woodland ... D Povey
Computer Speech & Language | VOL. 16
P.C Woodland, et. al.P.C Woodland ... D Povey
01 Jan 2002
Computer Speech & Language | VOL. 16

Sentence boundary detection of spontaneous Japanese using statistical language model and support vector machines
Yuya Akita ... Tatsuya Kawahara
-
Yuya Akita, et. al.Yuya Akita ... Tatsuya Kawahara
17 Sep 2006
17 Sep 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing