A novel formant tracker is proposed using the mixture models oft densities (tMMs) for vocal tract resonance frequencies estimated with a hybrid linear prediction (HLP) method. The hybrid integercycle pitch-synchronous linear prediction (LP) analysis improves the frequency resolution over voiced segments, leading to closer formant estimates than those provided by other LP methods. In conjunction with HLP, formant trajectories are shown to be more nearly tracked by tMMs than by Gaussian density models. Tests with synthetic voiced and whispered speech as well as with an annotated database confirm better performance than either tMM clustering after formant estimation based on different time-frequency representations or tracking after different LP methods.
Read full abstract