Challenges and Techniques for Dialectal Arabic Speech Recognition and Machine Translation

Mohamed Elmahdy

doi:10.5339/qfarf.2011.cso5

Abstract

Abstract In this research, we propose novel techniques to improve automatic speech recognition (ASR) and statistical machine translation (SMT) for dialectal Arabic. Since dialectal Arabic speech resources are very sparse, we describe how existing Modern Standard Arabic (MSA) speech data can be applied to dialectal Arabic acoustic modeling. Our assumption is that MSA is always a second language for all Arabic speakers, and in most cases we can identify the original dialect of a speaker even though he is speaking MSA. Hence, an acoustic model trained with sufficient number of MSA speakers will implicitly model the acoustic features for the different Arabic dialects. Since, MSA and dialectal Arabic do not share the same phoneme set, we propose phoneme sets normalization in order to crosslingually use MSA in dialectal Arabic ASR. After normalization, we applied state-of-the-art acoustic model adaptation techniques to adapt MSA acoustic models with little amount of dialectal speech. Results indicate significant decrease in word error rate (WER). Since it is hard to phonetically transcribe large amounts of dialectal Arabic speech, we studied the use of graphemic acoustic models where phonetic transcription is approximated to be word letters instead of phonemes. A large number of Gaussians in the Gaussian mixture model is used to model missing vowels. In the case of graphemic adaptation, significant decrease in WER was also observed. The approaches were applied with Egyptian Arabic and Levantine Arabic. The reported experimental work was performed while the first author was at the German University in Cairo in collaboration with Ulm University. This work will be extended at Qatar University in collaboration with the University of Illinois to cover ASR and SMT for Qatari broadcast TV. We propose novel algorithms for learning the similarities and differences between Qatari Arabic (QA) and MSA, for purposes of automatic speech translation and speech-to-text machine translation, building on our own definitive research in the relative phonological, morphological, and syntactic systems of QA and MSA, and in the application of translation to interlingual semantic parse. Furthermore, we propose a novel efficient and accurate speech-to-text translation system, building on our research in landmark-based and segment-based ASR.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Challenges and Techniques for Dialectal Arabic Speech Recognition and Machine Translation

Abstract

Talk to us

Similar Papers

More From: Qatar Foundation Annual Research Forum Proceedings

Lead the way for us

Journal: Qatar Foundation Annual Research Forum Proceedings	Publication Date: Nov 1, 2011
Citations: 2

Similar Papers

Development of a spontaneous large vocabulary speech recognition system for Qatari Arabic
Mohamed Elmahdy
-
Mohamed ElmahdyMohamed Elmahdy
01 Jan 2013
01 Jan 2013

Cross-lingual acoustic modeling for dialectal Arabic speech recognition
Mohamed Elmahdy ... Slim Abdennadher
-
Mohamed Elmahdy, et. al.Mohamed Elmahdy ... Slim Abdennadher
26 Sep 2010
26 Sep 2010

Diacritics restoration for Arabic dialect texts
S Harrat ... K Meftouh
-
S Harrat, et. al.S Harrat ... K Meftouh
25 Aug 2013
25 Aug 2013

Analysis of Dialectal Influence in Pan-Arabic ASR
...
-
, et. al. ...
01 Aug 2011
01 Aug 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Challenges and Techniques for Dialectal Arabic Speech Recognition and Machine Translation

Abstract

Talk to us

Similar Papers

More From: Qatar Foundation Annual Research Forum Proceedings