Automatic Detection of Shadda in Modern Standard Arabic Continuous Speech

Ammar Al-Sabri,Fadhilah Rosdi,Afzan Adam

doi:10.18517/ijaseit.8.4-2.6813

Abstract

The presence of diacritics Shadda in Arabic continuous speech may lead to the reduction of the accuracy of automatic Word Boundary Detection (WBD), which caused one word will be wrongly detected as two words. Therefore, this will affect the accuracy of Automatic Speech Recognition (ASR), if it is based on WBD. Shadda is one of the essential characteristics of the Arabic language which represents a consonant doubling. In this paper, a proposed method of automatic detection of Shadda in Modern Standard Arabic (MSA) continuous speech was introduced to improve the accuracy of WBD in MSA continuous speech. The prosodic features namely Short Time Energy (STE), Fundamental Frequency and Intensity were investigated for its ability as Shadda pattern detection in continuous MSA speech. We have analyzed the proposed features by implementing a separated algorithm for each feature to detect Shadda pattern automatically. In addition, a new proposed method which is a combination of STE and Intensity were introduced. The dataset in this work is a collection of 1-hour TV broadcast news from Aljazeera Arabic TV channel for 2018 - broadcasters. We found that the Shadda pattern is very similar to unvoiced regions of speech, and this represents a big challenge for the improvement of WDB using Shadda. Results showed that the detection of Shadda using Short Time Energy and Intensity outperforms the Fundamental frequency with 55% of accuracy. Intensity achieved 71.5% in accuracy. In addition, a combination between Intensity & STE features was performed and achieved good results with 67.15% in accuracy. The number of false positive too has been reduced compared to Intensity alone.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal on Advanced Science, Engineering and Information Technology	Publication Date: Sep 30, 2018
Citations: 1	License type: cc-by-sa

R Discovery Prime

R Discovery Prime

Automatic Detection of Shadda in Modern Standard Arabic Continuous Speech

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology

Lead the way for us

Similar Papers

Word boundary estimation for continuous speech using higher order statistical features
Vijayakrishna Naganoor ... Krishnan Chemmangat
-
Vijayakrishna Naganoor, et. al.Vijayakrishna Naganoor ... Krishnan Chemmangat
01 Nov 2016
01 Nov 2016

Graphical models for the recognition of Arabic continuous speech based triphones modeling
Elyes Zarrouk ... Yassine Benayed
-
Elyes Zarrouk, et. al.Elyes Zarrouk ... Yassine Benayed
01 Jun 2015
01 Jun 2015

TAMEEM V1.0: speakers and text independent Arabic automatic continuous speech recognizer
Mohammad A M Abushariah
International Journal of Speech Technology | VOL. 20
Mohammad A M AbushariahMohammad A M Abushariah
24 Feb 2017
International Journal of Speech Technology | VOL. 20

Improving the Arabic pronunciation dictionary for phone and word recognition with linguistically-based pronunciation rules
Fadi Biadsy ... Julia Hirschberg
-
Fadi Biadsy, et. al.Fadi Biadsy ... Julia Hirschberg
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Detection of Shadda in Modern Standard Arabic Continuous Speech

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology