Vocal tract length estimation for voiced and whispered speech using gammachirp filterbank

Toshio Irino,Hideki Kawahara,Ryuichi Nisimura,Erika Okamoto

doi:10.1109/apsipa.2013.6694131

Vocal tract length estimation for voiced and whispered speech using gammachirp filterbank

Toshio Irino, Hideki Kawahara + Show 2 more

https://doi.org/10.1109/apsipa.2013.6694131

Copy DOI

Publication Date: Oct 1, 2013

Citations: 10

Affiliation: Wakayama University

#Vocal Tract Length #Estimation Of Vocal Tract Length + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper, we demonstrate an auditory spectrogram based on a dynamic compressive gammachirp filterbank (GCFB) that enables accurate and robust estimation of vocal tract length (VTL) for both voiced and whispered speech. Normalized VTLs of 21 speakers were derived by using the least squared analysis of their VTL ratios (for all permutations, 420 = 21P20) which were estimated by minimizing spectral distances in the auditory spectrograms. The frequency range was selected in the calculation and the range between 500 and 5000 (Hz) was most reasonable for both speech mode. The method based on GCFB was better than that based on the mel-frequency filterbank (MFFB). The estimated VTLs were compared with the VTL data measured in MRI to confirm the reliability.

Full Text