Evaluation of SPLICE on the Aurora 2 and 3 tasks

Jasha Droppo,Li Deng,Alex Acero

doi:10.21437/icslp.2002-6

Abstract

Stereo-based Piecewise Linear Compensation for Environments (SPLICE) is a general framework for removing distortions from noisy speech cepstra. It contains a non-parametric model for cepstral corruption, which is learned from two channels of training data. We evaluate SPLICE on both the Aurora 2 and 3 tasks. These tasks consist of digit sequences in five European languages. Noise corruption is both synthetic (Aurora 2) and realistic (Aurora 3). For both the Aurora 2 and 3 tasks, we use the same training and testing procedure provided with the corpora. By holding the back-end constant, we ensure that any increase in word accuracy is due to our front-end processing techniques. In the Aurora 2 task, we achieve a 76.86% average decrease in word error rate with clean acoustic models, and an overall improvement of 62.63%. For the Aurora 3 task, we achieve a 75.06% average decrease in word error rate for the high-mismatch experiment, and an overall improvement of 47.19%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluation of SPLICE on the Aurora 2 and 3 tasks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition
G Garau ... S Renals
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16
G Garau, et. al.G Garau ... S Renals
01 Mar 2008
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16

Audio-Visual Speech Recognition with a Hybrid CTC/Attention Architecture
Stavros Petridis ... Themos Stafylakis
-
Stavros Petridis, et. al.Stavros Petridis ... Themos Stafylakis
01 Dec 2018
01 Dec 2018

A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition
Umit H Yapanel ... John H.L Hansen
Speech Communication | VOL. 50
Umit H Yapanel, et. al.Umit H Yapanel ... John H.L Hansen
19 Sep 2007
Speech Communication | VOL. 50

Challenges and Techniques for Dialectal Arabic Speech Recognition and Machine Translation
Mohamed Elmahdy
Qatar Foundation Annual Research Forum Proceedings | VOL. 2011
Mohamed ElmahdyMohamed Elmahdy
01 Nov 2011
Qatar Foundation Annual Research Forum Proceedings | VOL. 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of SPLICE on the Aurora 2 and 3 tasks

Abstract

Talk to us

Similar Papers