Developing speech recognition systems for corpus indexing under the IARPA Babel program

Jia Cui,Jonathan Mamou,Xiaodong Cui,Michael Picheny,Janice Kim,Abhinav Sethy,Bhuvana Ramabhadran,Lidia Mangu,Brian Kingsbury,Tara N Sainath

doi:10.1109/icassp.2013.6638969

Abstract

Automatic speech recognition is a core component of many applications, including keyword search. In this paper we describe experiments on acoustic modeling, language modeling, and decoding for keyword search on a Cantonese conversational telephony corpus collected as part of the IARPA Babel program. We show that acoustic modeling techniques such as the bootstrapped-and-restructured model and deep neural network acoustic model significantly outperform a state-of-the-art baseline GMM/HMM model, in terms of both recognition performance and keyword search performance, with improvements of up to 11% relative character error rate reduction and 31% relative maximum term weighted value improvement. We show that while an interpolated Model M and neural network LM improve recognition performance, they do not improve keyword search results; however, the advanced LM does reduce the size of the keyword search index. Finally, we show that a simple form of automatically adapted keyword search performs 16% better than a preindexed search system, indicating that out-of-vocabulary search is still a challenge.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Developing speech recognition systems for corpus indexing under the IARPA Babel program

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Geo-location dependent deep neural network acoustic model for speech recognition
Guoli Ye ... Yifan Gong
-
Guoli Ye, et. al.Guoli Ye ... Yifan Gong
01 Mar 2016
01 Mar 2016

Speech recognition system based on deep neural network acoustic modeling for low resourced language-Amharic
Eshete Derb Emiru ... Awet Fesseha
-
Eshete Derb Emiru, et. al.Eshete Derb Emiru ... Awet Fesseha
09 Nov 2019
09 Nov 2019

Cross-Lingual Language Modeling for Low-Resource Speech Recognition
Ping Xu ... P Fung
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 21
Ping Xu, et. al. Ping Xu ... P Fung
01 Jun 2013
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 21

Multi-domain adversarial training of neural network acoustic models for distant speech recognition
Seyedmahdad Mirsamadi ... John H.L Hansen
Speech Communication | VOL. 106
Seyedmahdad Mirsamadi, et. al.Seyedmahdad Mirsamadi ... John H.L Hansen
03 Nov 2018
Speech Communication | VOL. 106

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Developing speech recognition systems for corpus indexing under the IARPA Babel program

Abstract

Talk to us

Similar Papers