Initialization, Training, and Context-Dependency in HMM-Based Formant Tracking

D.T Toledano,J.G Villardebo,L.H Gomez

doi:10.1109/tsa.2005.857805

Abstract

This paper presents an algorithm for formant tracking using HMMs and analyzes the influence of HMM initialization, training and context-dependency on the accuracy of the formant tracks obtained with the HMMs. Formant trackers usually include two different phases: one in which the speech is analyzed and formant candidates are obtained, and another in which, by imposing different constraints, the most likely formants are chosen. While the first stage usually relies on standard spectrum estimation techniques, the second stage has evolved notably in the recent years. Traditionally the second phase tries to impose continuity constraints on the formant selection process. Lately there has been ongoing research to include phonemic knowledge in the second stage to make formant tracking more reliable. In order to incorporate phonemic knowledge newer approaches make use of the orthographic transcription of the speech utterance. From the orthographic transcription, the phonemic transcription is obtained, and from this and the speech itself a phonemic segmentation can be obtained. This phonemic segmentation, along with the phonemic transcription and some knowledge of the nominal formant positions for the different phonemes provides extra information that can be used to obtain more accurate formant tracks. This paper presents a complete HMM-based data-driven algorithm for formant tracking suitable to combine different levels of acoustic and phonemic information. A detailed analysis on the performance of this algorithm is discussed for: different initialization strategies using different levels of knowledge, different degrees of training, and context-independent and dependent HMMs. Experimental speaker-dependent results show that the efficient use of phonemic information in HMM training and context-dependent modeling significantly reduces the formant tracking error rate especially for formants $ F_2$ and $ F_3$ .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Initialization, Training, and Context-Dependency in HMM-Based Formant Tracking

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech and Language Processing	Publication Date: Mar 1, 2006
Citations: 43

Similar Papers

Formant tracking using segmental phonemic information
Minkyu Lee ... Joseph Olive
-
Minkyu Lee, et. al.Minkyu Lee ... Joseph Olive
05 Sep 1999
05 Sep 1999

Formant tracking using context-dependent phonemic information
Minkyu Lee ... J Van Santen
IEEE Transactions on Speech and Audio Processing | VOL. 13
Minkyu Lee, et. al. Minkyu Lee ... J Van Santen
01 Sep 2005
IEEE Transactions on Speech and Audio Processing | VOL. 13

Allpass Modeling of Phase Spectrum of Speech Signals for Formant Tracking
Karthika Vijayan ... K Sri Rama Murty
-
Karthika Vijayan, et. al.Karthika Vijayan ... K Sri Rama Murty
01 Nov 2019
01 Nov 2019

Adaptive Starting Points in Video Learning Environments for New Learners Based on Video and Topic Tree Relations
Alexander Lehmann
-
Alexander LehmannAlexander Lehmann
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Initialization, Training, and Context-Dependency in HMM-Based Formant Tracking

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing