Estimating vocal tract length by minimizing non-uniformity of cross-sectional area

Stefon Flego

doi:10.1121/2.0001000

Abstract

Previous approaches to estimation of vocal tract length (VTL) differ in what information about the speaker is assumed to be known and which formants are treated as better predictors of VTL. However, they are alike in modeling formant frequencies as deviations from the resonances of a uniform tube, and in allocating equal credibility to all vowel spectra as predictors of length. The latter may be problematic, as phonotactic asymmetries in vowel quality in a data set can draw formants’ mean frequencies away from their underlying resonances, skewing VTL estimates. Herein, an additional parameter is proposed privileging vowel spectra that approximate the resonance characteristics of a uniform tube. The metric for this proximity is standard variance in Phi (SigmaPhi) across a vowel spectrum. In this study, formant data from 32 participants were analyzed using the estimators compared in Lammert & Narayanan (2015). Each estimator was run using frequencies from all vowel spectra, as well as from spectra with low values of SigmaPhi. As the threshold for maximum SigmaPhi was lowered, VTL estimates became tighter and the estimators typically converged on a length. This approach requires no labeling or identification of vowel type, and is therefore easily replicable across languages and speakers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Estimating vocal tract length by minimizing non-uniformity of cross-sectional area

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Towards higher precision in vocal tract length estimation
Stefon M Flego
The Journal of the Acoustical Society of America | VOL. 144
Stefon M FlegoStefon M Flego
01 Sep 2018
The Journal of the Acoustical Society of America | VOL. 144

Vocal tract length estimation for voiced and whispered speech using gammachirp filterbank
Toshio Irino ... Erika Okamoto
-
Toshio Irino, et. al.Toshio Irino ... Erika Okamoto
01 Oct 2013
01 Oct 2013

Vocal Tract Length Estimation Using Accumulated Means of Formants and Its Effects on Speaker-Normalization
Tadashi Sakata ... Naomitsu Ikeda
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29
Tadashi Sakata, et. al.Tadashi Sakata ... Naomitsu Ikeda
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29

Continuous estimation of VTL from vowels using a linearly VTL-covariant speech feature
Christian Feldbauer ... Jessica J Monaghan
The Journal of the Acoustical Society of America | VOL. 123
Christian Feldbauer, et. al.Christian Feldbauer ... Jessica J Monaghan
01 May 2008
The Journal of the Acoustical Society of America | VOL. 123

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Estimating vocal tract length by minimizing non-uniformity of cross-sectional area

Abstract

Talk to us

Similar Papers