A reliable technique for detecting the second subglottal resonance and its use in cross-language speaker adaptation

Shizhen Wang,Steven M Lulich,Abeer Alwan

doi:10.21437/interspeech.2008-383

Abstract

In previous work [1], we proposed a speaker adaptation technique based on the second subglottal resonance (Sg2), which showed good performance relative to vocal tract length normalization (VTLN). In this paper, we propose a more reliable algorithm for automatically estimating Sg2 from speech signals. The algorithm is calibrated on children’s speech data collected simultaneously with accelerometer recordings from which Sg2 frequencies can be directly measured. To investigate whether Sg2 frequencies are independent of speech content and language, we perform a cross-language study with bilingual Spanish-English children. The study verifies that Sg2 is approximately constant for a given speaker and thus can be a good candidate for limited data speaker normalization and cross-language adaptation. We then present a cross-language speaker normalization method based on Sg2, which is computationally more efficient than maximum-likelihood based VTLN, and performs more robustly than VTLN.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A reliable technique for detecting the second subglottal resonance and its use in cross-language speaker adaptation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Automatic detection of the second subglottal resonance and its application to speaker normalization
Shizhen Wang ... Steven M Lulich
The Journal of the Acoustical Society of America | VOL. 126
Shizhen Wang, et. al.Shizhen Wang ... Steven M Lulich
01 Dec 2009
The Journal of the Acoustical Society of America | VOL. 126

Speaker normalization in noisy environments using subglottal resonances
Harish Arsikere ... Abeer Alwan
The Journal of the Acoustical Society of America | VOL. 134
Harish Arsikere, et. al.Harish Arsikere ... Abeer Alwan
01 Nov 2013
The Journal of the Acoustical Society of America | VOL. 134

Relations among subglottal resonances, vowel formants, and speaker height, gender, and native language.
Harish Arsikere ... John R Morton
The Journal of the Acoustical Society of America | VOL. 128
Harish Arsikere, et. al.Harish Arsikere ... John R Morton
01 Oct 2010
The Journal of the Acoustical Society of America | VOL. 128

Speaker adaptation with all-pass transforms
John Mcdonough ... Alex Waibel
Speech Communication | VOL. 42
John Mcdonough, et. al.John Mcdonough ... Alex Waibel
14 Oct 2003
Speech Communication | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A reliable technique for detecting the second subglottal resonance and its use in cross-language speaker adaptation

Abstract

Talk to us

Similar Papers