Accurate glottal model parametrization by integrating audio and high-speed endoscopic video data

Carlo Drioli,Gian Luca Foresti

doi:10.1007/s11760-013-0597-0

Abstract

The aim of this paper is to evaluate the effectiveness of using video data for voice source parametrization in the representation of voice production through physical modeling. Laryngeal imaging techniques can be effectively used to obtain vocal fold video sequences and to derive time patterns of relevant glottal cues, such as folds edge position or glottal area. In many physically based numerical models of the vocal folds, these parameters are estimated from the inverse filtered glottal flow waveform, obtained from audio recordings of the sound pressure at lips. However, this model inversion process is often problematic and affected by accuracy and robustness issues. It is here discussed how video analysis of the fold vibration might be effectively coupled to the parametric estimation algorithms based on voice recordings, to improve accuracy and robustness of model inversion.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Accurate glottal model parametrization by integrating audio and high-speed endoscopic video data

Abstract

Talk to us

Similar Papers

More From: Signal, Image and Video Processing

Lead the way for us

Journal: Signal, Image and Video Processing	Publication Date: Jan 7, 2014
Citations: 23

Similar Papers

A Deep Learning Enhanced Novel Software Tool for Laryngeal Dynamics Analysis.
Andreas M Kist ... Michael Döllinger
Journal of Speech, Language, and Hearing Research | VOL. 64
Andreas M Kist, et. al.Andreas M Kist ... Michael Döllinger
17 May 2021
Journal of Speech, Language, and Hearing Research | VOL. 64

Glottis Analysis Tools (Kist et al., 2021)
...
-
, et. al. ...
17 May 2021
Glottis Analysis Tools (Kist et al., 2021)
...

The effect of high-speed videoendoscopy configuration on reduced-order model parameter estimates by Bayesian inference.
Jonathan J Deng ... Sean D Peterson
The Journal of the Acoustical Society of America | VOL. 146
Jonathan J Deng, et. al.Jonathan J Deng ... Sean D Peterson
01 Aug 2019
The Journal of the Acoustical Society of America | VOL. 146

Glottal Area Waveform Analysis of Benign Vocal Fold Lesions before and after Surgery
J Pieter Noordzij ... Peak Woo
Annals of Otology, Rhinology & Laryngology | VOL. 109
J Pieter Noordzij, et. al.J Pieter Noordzij ... Peak Woo
01 May 2000
Annals of Otology, Rhinology & Laryngology | VOL. 109

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accurate glottal model parametrization by integrating audio and high-speed endoscopic video data

Abstract

Talk to us

Similar Papers

More From: Signal, Image and Video Processing