Development of a spontaneous large vocabulary speech recognition system for Qatari Arabic

Mohamed Elmahdy

doi:10.5339/qfarf.2013.ictp-053

Abstract

In this work, we develop a spontaneous large vocabulary speech recognition system for Qatari Arabic (QA). A major problem with dialectal Arabic speech recognition is due to the sparsity of speech resources. So, we propose an Automatic Speech Recognition (ASR) framework to jointly use Modern Standard Arabic (MSA) data and QA data to improve acoustic and language modeling by orthographic normalization, cross-dialectal phone mapping, data sharing, and acoustic model adaptation. A wide-band speech corpus has been developed for QA. The corpus consists of 15 hours speech data collected from different TV series and talk-show programs. The corpus was manually segmented and transcribed. A QA tri-gram language model (LM) was linearly interpolated with a large MSA LM in order to decrease Out-Of-Vocabulary (OOV) rate and to improve perplexity. The vocabulary consists of 21K words extracted from the QA training set with additional 256K MSA vocabulary. The acoustic model (AM) was trained with a data pool of QA data and additional 60 hours of MSA data. In order to boost the contribution of QA data, Maximum-A-Posteriori (MAP) adaptation was applied on the resulted AM using only the QA data, effectively increasing the weight of dialectal acoustic features in the final cross-lingual model. All trainings were performed with Maximum Mutual Information Estimation (MMIE) and with Speaker Adaptive Training (SAT) applied on top of MMIE. Our proposed approach achieves more than 16% relative reduction in WER on QA testing set compared to a baseline system trained with only QA data. This work was funded by a grant from the Qatar National Research Fund under its National Priorities Research Program (NPRP) award number NPRP 09-410-1-069. Reported experimental work was performed at Qatar University in collaboration with University of Illinois.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Development of a spontaneous large vocabulary speech recognition system for Qatari Arabic

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Hybrid pronunciation modeling for Arabic large vocabulary speech recognition
Mohamed Elmahdy ... Mark Hasegawa-Johnson
-
Mohamed Elmahdy, et. al.Mohamed Elmahdy ... Mark Hasegawa-Johnson
01 Jan 2012
01 Jan 2012

Automatic long audio alignment for conversational Arabic speech
Mohamed Elmahdy
-
Mohamed ElmahdyMohamed Elmahdy
01 Jan 2013
01 Jan 2013

Challenges and Techniques for Dialectal Arabic Speech Recognition and Machine Translation
Mohamed Elmahdy
Qatar Foundation Annual Research Forum Proceedings | VOL. 2011
Mohamed ElmahdyMohamed Elmahdy
01 Nov 2011
Qatar Foundation Annual Research Forum Proceedings | VOL. 2011

Capitalising on North American speech resources for the development of a South African English large vocabulary speech recognition system
Herman Kamper ... Thomas Niesler
Computer Speech & Language | VOL. 28
Herman Kamper, et. al.Herman Kamper ... Thomas Niesler
06 May 2014
Computer Speech & Language | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development of a spontaneous large vocabulary speech recognition system for Qatari Arabic

Abstract

Talk to us

Similar Papers