Modelling of Amplitude Modulated Vocal Fry Glottal Area Waveforms Using an Analysis-by-Synthesis Approach

Vinod Devaraj,Philipp Aichinger

doi:10.3390/app11051990

Vinod Devaraj, Philipp Aichinger

Open Access

https://doi.org/10.3390/app11051990

Copy DOI

Abstract

The characterization of voice quality is important for the diagnosis of a voice disorder. Vocal fry is a voice quality which is traditionally characterized by a low frequency and a long closed phase of the glottis. However, we also observed amplitude modulated vocal fry glottal area waveforms (GAWs) without long closed phases (positive group) which we modelled using an analysis-by-synthesis approach. Natural and synthetic GAWs are modelled. The negative group consists of euphonic, i.e., normophonic GAWs. The analysis-by-synthesis approach fits two modelled GAWs for each of the input GAW. One modelled GAW is modulated to replicate the amplitude and frequency modulations of the input GAW and the other modelled GAW is unmodulated. The modelling errors of the two modelled GAWs are determined to classify the GAWs into the positive and the negative groups using a simple support vector machine (SVM) classifier with a linear kernel. The modelling errors of all vocal fry GAWs obtained using the modulating model are smaller than the modelling errors obtained using the unmodulated model. Using the two modelling errors as predictors for classification, no false positives or false negatives are obtained. To further distinguish the subtypes of amplitude modulated vocal fry GAWs, the entropy of the modulator’s power spectral density and the modulator-to-carrier frequency ratio are obtained.

Highlights

We identify vocal fry based on the impulsivity of voice samples, i.e., the auditory attribute associated with the separate perception of the glottal cycles
This paper investigated different types of amplitude modulated vocal fry Glottal area waveforms (GAWs)
They were modelled using an analysis-by-synthesis approach and distinguished automatically from euphonic GAWs based on their modelling errors

Summary

Introduction

Vocal fry is a voice quality which is synonymously referred to as creaky voice, pulse register, glottal fry or creak [1,2,3]. The term vocal fry is used to designate a subtype of creaky voice [4]. The remaining subtypes are multipulsed voice, aperiodic voice, nonconstricted creak and tense/pressed voice. Vocal fry is mainly characterized by a low fundamental frequency which gives an auditory impression of “a stick being run along a railing”, “popping of corn” or “cooking of food on a pan” [1,2,5]. Subglottal air pressure and air flow were found to be smaller in vocal fry than in modal registers [2]

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Feb 24, 2021
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Modelling of Amplitude Modulated Vocal Fry Glottal Area Waveforms Using an Analysis-by-Synthesis Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Vocal Fold Vibrations: High‐Speed Imaging, Kymography, and Acoustic Analysis: A Preliminary Report
Hans Larsson ... Britta Hammarberg
The Laryngoscope | VOL. 110
Hans Larsson, et. al.Hans Larsson ... Britta Hammarberg
01 Dec 2000
The Laryngoscope | VOL. 110

Vibratory onset and offset times in children: A laryngeal imaging study
Rita R Patel
International Journal of Pediatric Otorhinolaryngology | VOL. 87
Rita R PatelRita R Patel
20 May 2016
International Journal of Pediatric Otorhinolaryngology | VOL. 87

Influence of spatial camera resolution in high-speed videoendoscopy on laryngeal parameters.
Patrick Schlegel ... Michael Döllinger
PLOS ONE | VOL. 14
Patrick Schlegel, et. al.Patrick Schlegel ... Michael Döllinger
22 Apr 2019
PLOS ONE | VOL. 14

Intersegmenter Variability in High-Speed Laryngoscopy-Based Glottal Area Waveform Measures.
Youri Maryn ... Pablo Gomez
The Laryngoscope | VOL. 130
Youri Maryn, et. al.Youri Maryn ... Pablo Gomez
16 Dec 2019
The Laryngoscope | VOL. 130

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modelling of Amplitude Modulated Vocal Fry Glottal Area Waveforms Using an Analysis-by-Synthesis Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences