Source and system features for phone recognition

K E Manjunath,K Sreenivasa Rao

doi:10.1007/s10772-014-9266-0

Abstract

In this work, we have explored excitation source features in addition to vocal tract system features to improve the performance of phone recognition systems (PRSs). The excitation source information is derived by processing linear prediction residual of the speech signal. The vocal tract information is captured using Mel-frequency cepstral coefficient features. The PRSs are developed using hidden Markov models. The robustness of proposed excitation source features is demonstrated using white and babble noisy speech samples. In this work, TIMIT and Bengali speech databases are used for developing PRSs. The tandem PRSs are developed using the phone posteriors obtained from feedforward neural networks. From the results, it is observed that the tandem PRSs developed using the combination of excitation source and vocal tract system features, outperform the conventional tandem systems developed using system features alone. It is also observed that the PRSs developed using the combination of excitation source and vocal tract features, are more robust to noise than the PRSs developed using vocal tract features alone.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Source and system features for phone recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Dec 9, 2014
Citations: 13

Similar Papers

Articulatory and excitation source features for speech recognition in read, extempore and conversation modes
K E Manjunath ... K Sreenivasa Rao
International Journal of Speech Technology | VOL. 19
K E Manjunath, et. al.K E Manjunath ... K Sreenivasa Rao
11 Dec 2015
International Journal of Speech Technology | VOL. 19

Improvement of phone recognition accuracy using source and system features
K E Manjunath ... K Sreenivasa Rao
-
K E Manjunath, et. al.K E Manjunath ... K Sreenivasa Rao
01 Jan 2015
01 Jan 2015

Multilingual speech mode classification model for Indian languages
Kumud Tripathi ... K Sreenivasa Rao
-
Kumud Tripathi, et. al.Kumud Tripathi ... K Sreenivasa Rao
01 Feb 2020
01 Feb 2020

Improvement of phone recognition accuracy using speech mode classification
Kumud Tripathi ... K Sreenivasa Rao
International Journal of Speech Technology | VOL. 21
Kumud Tripathi, et. al.Kumud Tripathi ... K Sreenivasa Rao
07 Dec 2017
International Journal of Speech Technology | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Source and system features for phone recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology