Native and non-native class discrimination using speech rhythm- and auditory-based cues

S.-A Selouani,Y Alotaibi,W Cichocki,S Gharsellaoui,K Kadi

doi:10.1016/j.csl.2014.11.003

Abstract

In recent years, the use of rhythm-based features in speech processing systems has received growing interest. This approach uses a wide array of rhythm metrics that have been developed to capture speech timing differences between and within languages. However, the reliability of rhythm metrics is being increasingly called into question. In this paper, we propose two modifications to this approach. First, we describe a model that is based on auditory cues that simulate the external, middle and inner parts of the ear. We evaluate this model by performing experiments to discriminate between native and non-native Arabic speech. Data are from the West Point Arabic Speech Corpus; testing is done on standard classifiers based on Gaussian Mixture Models (GMMs), Support Vector Machines (SVMs) and a hybrid GMM/SVM. Results show that the auditory-based model consistently outperforms a traditional rhythm-metric approach that includes both duration- and intensity-based metrics. Second, we propose a framework that combines the rhythm metrics and the auditory-based cues in the context of a Logistic Regression (LR) method that can optimize feature combination. Further results show that the proposed LR-based method improves performance over the standard classifiers in the discrimination between the native and non-native Arabic speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Native and non-native class discrimination using speech rhythm- and auditory-based cues

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Nov 22, 2014
Citations: 7

Similar Papers

Application of the pairwise variability index of speech rhythm with particle swarm optimization to the classification of native and non-native accents
Soumaya Gharsellaoui ... Adel Omar Dahmane
Computer Speech & Language | VOL. 48
Soumaya Gharsellaoui, et. al.Soumaya Gharsellaoui ... Adel Omar Dahmane
24 Oct 2017
Computer Speech & Language | VOL. 48

The Concept and Measurement of Speech Rhythm
Robert Fuchs
-
Robert FuchsRobert Fuchs
01 Jan 2015
01 Jan 2015

A Rhythm-Based Analysis of Arabic Native and Non-Native Speaking Styles
Soumaya Gharsellaoui ... Alaidine Ben Ayed
International Journal of Signal Processing Systems | VOL. -
Soumaya Gharsellaoui, et. al.Soumaya Gharsellaoui ... Alaidine Ben Ayed
01 Jan 2013
International Journal of Signal Processing Systems | VOL. -

Exploring the speech rhythm continuum
Ann Marie Olivio
Journal of Speech Sciences | VOL. 1
Ann Marie OlivioAnn Marie Olivio
03 Feb 2021
Journal of Speech Sciences | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Native and non-native class discrimination using speech rhythm- and auditory-based cues

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language