Time scale modification and vocal tract length normalization for improving the performance of Tamil speech recognition system implemented using language independent segmentation algorithm

S Saraswathi,T V Geetha

doi:10.1007/s10772-007-9004-y

Abstract

This paper describes the work done in improving the performance of Tamil speech recognition system by using Time Scale Modification (TSM) and Vocal Tract Length Normalization (VTLN) techniques. The speech recognition system for Tamil language was developed using a new approach of text independent speech segmentation, with a phoneme based language model for recognition. There is degradation in the performance of speech recognition due to variations in the speaking rate and vocal tract shape among different speakers. In order to improve the performance of speech recognition system, both TSM and VTLN normalization techniques were used in this work. The TSM was implemented using the Phase vocoder approach and the VTLN was implemented using speaker specific bark/mel scale in bark/mel domain. The performance of Tamil speech recognition system was improved by performing both TSM and VTLN normalization techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Time scale modification and vocal tract length normalization for improving the performance of Tamil speech recognition system implemented using language independent segmentation algorithm

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Dec 1, 2006
Citations: 18

Similar Papers

Feature compensation based on the normalization of vocal tract length for the improvement of emotion-affected speech recognition
Masoud Geravanchizadeh ... Meysam Bashirpour
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2021
Masoud Geravanchizadeh, et. al.Masoud Geravanchizadeh ... Meysam Bashirpour
04 Aug 2021
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2021

Feature Level Solution to Noise Robust Speech Recognition in the context of Tonal Languages
Utpal Bhattacharjee ... Jyoti Mannala
International Journal of Engineering and Advanced Technology | VOL. 9
Utpal Bhattacharjee, et. al.Utpal Bhattacharjee ... Jyoti Mannala
30 Dec 2020
International Journal of Engineering and Advanced Technology | VOL. 9

Speaker Adaptation With Limited Data Using Regression-Tree-Based Spectral Peak Alignment
Shizhen Wang ... Abeer Alwan
IEEE Transactions on Audio, Speech and Language Processing | VOL. 15
Shizhen Wang, et. al.Shizhen Wang ... Abeer Alwan
01 Nov 2007
IEEE Transactions on Audio, Speech and Language Processing | VOL. 15

Data-pooling and multi-task learning for enhanced performance of speech recognition systems in multiple low resourced languages
A Madhavaraj ... A G Ramakrishnan
-
A Madhavaraj, et. al.A Madhavaraj ... A G Ramakrishnan
01 Feb 2019
01 Feb 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Time scale modification and vocal tract length normalization for improving the performance of Tamil speech recognition system implemented using language independent segmentation algorithm

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology