Detecting glides and their place of articulation using speech-related measurements in a feature-cue-based model

Adrian Y Cho,Stefanie Shattuck-Hufnagel,Jeung-Yoon Choi,Anita Y Liu

doi:10.1121/1.4987646

Abstract

An algorithm was developed for detecting glides (/w/, /j/, /r/, /l/, or /h/) in spoken English and detecting their place of articulation using an analysis of acoustic landmarks [Stevens 2002]. The system uses Gaussian mixture models (GMMs) trained on a subset of the TIMIT speech database annotated with acoustic landmarks. To characterize the glide tokens extracted from the speech samples, the following speech-related measurements were calculated: energy in four spectral bands (E1-E4), formant frequencies (F1-F4), and the time derivatives of E1-E4 (E1’-E4’); the fundamental frequency (F0) and magnitude difference of harmonics (H1-H2, H1-H4) were also included. GMMs were then trained on a subset of the tokens to learn the characteristics of each category for two distinct tasks: distinguishing glide landmarks from the set of all landmark types (identification task), and determining the place of articulation given a glide landmark (categorization task). The classifier used the maximum posterior probability of a speech sample conditioned on each of the trained GMMs. The performance of the algorithm was evaluated with median F-scores, and results suggest that the measurements at acoustic landmarks provide salient cues to glide detection and categorization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detecting glides and their place of articulation using speech-related measurements in a feature-cue-based model

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

PENGUCAPAN MAKHRAJ DARI UNIT BUNYI TERKECIL HURUF HIJAIYAH BERDASARKAN FREKUENSI DASAR DAN FREKUENSI FORMANT UNTUK MEDIA PEMBELAJARAN MEMBACA ALQURAN
Christanto Sinambela ... Muhammad Subali
ALQALAM | VOL. 32
Christanto Sinambela, et. al.Christanto Sinambela ... Muhammad Subali
31 Dec 2015
ALQALAM | VOL. 32

On the relation between locus equations and subglottal resonances.
Steven M Lulich
The Journal of the Acoustical Society of America | VOL. 124
Steven M LulichSteven M Lulich
01 Oct 2008
The Journal of the Acoustical Society of America | VOL. 124

Relative spectral change and formant transitions as cues to labial and alveolar place of articulation.
Michael F Dorman ... Philipos C Loizou
The Journal of the Acoustical Society of America | VOL. 100
Michael F Dorman, et. al.Michael F Dorman ... Philipos C Loizou
01 Dec 1996
The Journal of the Acoustical Society of America | VOL. 100

Formant frequency prediction from MFCC vectors in noisy environments
Jonathan Darch ... Ben Milner
-
Jonathan Darch, et. al.Jonathan Darch ... Ben Milner
04 Sep 2005
04 Sep 2005

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting glides and their place of articulation using speech-related measurements in a feature-cue-based model

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America