A learning framework of modified deep recurrent neural network for classification and recognition of voice mood

Gaurav Agarwal,Sachi Gupta,Hari Om

doi:10.1002/acs.3425

Abstract

SUMMARYRecognition of human emotions is a basic requirement in many real‐time applications. Detection of exact emotions through voice provides relevant information for various purposes. Several computational methods have been employed for the clear analysis of human emotions. Most of the previous approaches face complexities due to certain drawbacks like degraded signal quality, a requirement of high storage space, increased computational complexity, and deteriorated outcomes of classification accuracy. The proposed work was implemented to gather the accurate classification result of embedded emotions and minimize the computational complexities of MDDTRNN (modified deep duck and traveler recurrent neural network). The proposed work includes four steps: preprocessing, feature extraction, feature selection, and classification. In feature extraction, the spectral and frequency features are extracted using the adopting boosted MFCC (Mel frequency cepstral coefficients) method to improve training speed. In feature selection, the best features are selected using an algorithm of AAVOA (adaptive African vulture optimization algorithm). To provide optimal emotion results, the classification step is undertaken by the MDDTRNN technique. The proposed work shows better classification outcomes of emotions when compared to the existing approaches by holding the accuracy of (95.86%), precision as (93.79%), specificity as (94.28%), sensitivity as (92.89%) and the error rate is attained to be 5.266 in terms of IEMOCAP dataset. The accuracy result (96.27%), precision (94.83%), specificity (93.16%), sensitivity (94%) and the error rate is achieved to be 4.982 in terms of the EMODB dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A learning framework of modified deep recurrent neural network for classification and recognition of voice mood

Abstract

Talk to us

Similar Papers

More From: International Journal of Adaptive Control and Signal Processing

Lead the way for us

Journal: International Journal of Adaptive Control and Signal Processing	Publication Date: May 23, 2022
Citations: 10

Similar Papers

Comparison of Mel Frequency Cepstral Coefficient (MFCC) Feature Extraction, With and Without Framing Feature Selection, to Test the Shahada Recitation
Heriyanto Heriyanto ... Dyah Ayu Irawati
RSF Conference Series: Engineering and Technology | VOL. 1
Heriyanto Heriyanto, et. al.Heriyanto Heriyanto ... Dyah Ayu Irawati
23 Dec 2021
RSF Conference Series: Engineering and Technology | VOL. 1

Classification of Javanese Script Hanacara Voice Using Mel Frequency Cepstral Coefficient MFCC and Selection of Dominant Weight Features
Heriyanto Heriyanto ... Gita Fadila Fitriana
JURNAL INFOTEL | VOL. 13
Heriyanto Heriyanto, et. al.Heriyanto Heriyanto ... Gita Fadila Fitriana
30 May 2021
JURNAL INFOTEL | VOL. 13

The Comparison of Audio Analysis Using Audio Forensic Technique and Mel Frequency Cepstral Coefficient Method (MFCC) as the Requirement of Digital Evidence
Helmy Dzulfikar ... Sisdarmanto Adinandra
Jurnal Online Informatika | VOL. 6
Helmy Dzulfikar, et. al.Helmy Dzulfikar ... Sisdarmanto Adinandra
26 Dec 2021
Jurnal Online Informatika | VOL. 6

The Implementation Of Mfcc Feature Extraction And Selection of Cepstral Coefficient for Qur’an Recitation in TPA (Qur’an Learning Center) Nurul Huda Plus Purbayan
Heriyanto Heriyanto ... Herlina Jayadianti
RSF Conference Series: Engineering and Technology | VOL. 1
Heriyanto Heriyanto, et. al.Heriyanto Heriyanto ... Herlina Jayadianti
23 Dec 2021
RSF Conference Series: Engineering and Technology | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A learning framework of modified deep recurrent neural network for classification and recognition of voice mood

Abstract

Talk to us

Similar Papers

More From: International Journal of Adaptive Control and Signal Processing