Abstract

An accuracy of classifying human emotional states from speech has been dependent on emotional speech database, speech signal processing algorithms, and classification methods. Numerous static and dynamic classification techniques are used for emotional state recognition. The paper considers a novel method for classifying natural positive, negative, and neutral human emotional states based on a five-layer Recurrent Neural Network (RNN). The method novelty is due to various neuron activation functions employed for each network layer stemming from the peculiarities of natural emotional speech informative parameters used as input data for the neural network. Local and global informative speech parameters relevant to human emotional states are outlined, and the known classification methods are surveyed. The proposed classification method is described, and the developed Russian database of natural emotional speech is presented. The research results compared with the widely used classification methods have evidenced 95 % accuracy of the developed method, which can be effectively tested in systems for detecting and classifying natural human emotional states from speech.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.