Abstract

Tamil Language is one of the ancient Dravidian languages spoken in south India. Most of the Indian languages are syllabic in nature and syllables are in the form of Consonant-Vowel (CV) units. In Tamil language, CV pattern occurs in the beginning, middle and end of a word. In this work, CV Units formed with Stop Consonant - Short Vowel (SCSV) were considered for classification task. The work carried out in three stages, Vowel Onset Point (VOP) detection, CV segmentation and classification. VOP is an event at which the consonant part ends and vowel part begins. VOPs are identified using linear prediction residuals which provide significant characteristics of the excitation source. To segment the CV units, fixed length spectral frames before and after VOPs are considered. In this work, production based features, Linear Predictive Cepstral Coefficients (LPCC) and perception based features, Perceptual Linear Predictive Cepstral Coefficients (PLP) and Mel Frequency Cepstral Coefficients (MFCC) are extracted which are used to build the SCSV classifier using multilayer perceptron and support vector machine. A speech corpus of 200 Tamil words uttered by 15 native speakers was used, which covers all SCSV units formed with Tamil stop consonants (/k/, /ch/, /d/, /t/, /p/) and short vowels (/a/, /i/, /u/, /e/, /o/). The classifiers are trained and tested for its performance using predictive accuracy measure. The results indicate that perception based features, MFCC and PLP provides better results than production based features, LPCC and the model built using support vector machine outperforms.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call