Abstract

In this work, a stressed (emotional) speech recognition system is designed based on Vector-Quantization (VQ) approach using Generalized Lloyd algorithm. Frequency, Amplitude and Phase features are extracted from the Sinusoidal model of speech and these parameters are used for characterization of stressed speech. The stressed conditions considered in this study are anger, happiness, compassion and neutral. Data for analysis and classification of stressed speech are recorded from thirty speakers; each speaker uttered five statements for each emotion in two different languages, English and Telugu (an Indian language). The features of stressed speech signals are compared with the features of neutral speech signals. From the results, it has been observed that the sinusoidal model features can be used successfully for classification of stressed speech. A total of 320 stressed speech data files are used for training and 280 stressed speech data files are used for testing. The performance of the VQ classifier with sinusoidal model features has been tested under normal and noisy conditions. A maximum recognition of 92.8% is achieved with frequency features as the input to the VQ classifier.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.