Abstract

An effective way of communication between human is now becoming an alternative way to communicate between human and machine. This alternative way is now-a-days used in many real time systems for faster, easier and comfortable response and communication. Speech segmentation and labelling are the process that lay as a key to decide the accuracy of several speech related research. A tool AAYUDHA is proposed that enables automatic segmentation and labelling of continuous speech in Tamil. Two different segmentation algorithms, one based on Fast Fourier Transform (FFT) feature set and 2D filtering and other based on Discrete Wavelet Transform (DWT) feature set and its energy variation in different sub-bands are implemented. The segmentation accuracy of those algorithms is analyzed. Further the segmented speech is labelled using a baseline Hidden Markov Model (HMM) based acoustic model. A speech corpus named KAZHANGIYAM is created which includes the recorded Tamil speech of various speakers. The database also includes the information of manually segmented data of those speech data. This speech corpus is used to analyze the accuracy of the algorithms used in the proposed tool. This tool concentrates on the phonetic level segmentation of Tamil speech. The tool shows an acceptable segmentation and labelling accuracy.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call