Stressed Speech Emotion Recognition Using Teager Energy and Spectral Feature Fusion with Feature Optimization.

Surekha Reddy Bandela,S Siva Priyanka,K Sunil Kumar,Y Vijay Bhaskar Reddy,Afework Aemro Berhanu

doi:10.1155/2023/5765760

Abstract

The objective of speech emotion recognition (SER) is to enhance man-machine interface. It can also be used to cover the physiological state of a person in critical situations. In recent time, speech emotion recognition also finds its operations in medicine and forensics. A new feature extraction technique using Teager energy operator (TEO) is proposed for the detection of stressed emotions as Teager energy-autocorrelation envelope (TEO-Auto-Env). TEO is basically designed for increasing the energies of the stressed speech signals whose energies are reduced during the speech production process and hence used in this analysis. A stressed speech emotion recognition (SSER) system is developed using TEO-Auto-Env and spectral feature combination for detecting the emotions. The spectral features considered are Mel-frequency cepstral coefficients (MFCC), linear prediction cepstral coefficients (LPCC), and relative spectra-perceptual linear prediction (RASTA-PLP). EMO-DB (German), EMOVO (Italian), IITKGP (Telugu), and EMA (English) databases are used in this analysis. The classification of the emotions is carried out using the k-nearest neighborhood (k-NN) classifier for gender-dependent (GD) and speaker-independent (SI) cases. The proposed SSER system provides improved accuracy compared to the existing ones. Average recall is used for performance evaluation. The highest classification accuracy is achieved using the feature combination of TEO-Auto-Env, MFCC, and LPCC features with 91.4% (SI), 91.4% (GD-male), and 93.1%(GD-female) for EMO-DB; 68.5% (SI), 68.5% (GD-male), and 74.6% (GD-female) for EMOVO; 90.6%(SI), 91% (GD-male), and 92.3% (GD-female) for EMA; and 95.1% (GD-female) for IITKGP female database.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stressed Speech Emotion Recognition Using Teager Energy and Spectral Feature Fusion with Feature Optimization.

Abstract

Talk to us

Similar Papers

More From: Computational Intelligence and Neuroscience

Lead the way for us

Journal: Computational Intelligence and Neuroscience	Publication Date: Jan 1, 2023
License type: CC BY 4.0

Similar Papers

Speech Emotion Recognition Using Feature Fusion of TEO and MFCC on Multilingual Databases
Syed Asif Ahmad Qadri ... Taiba Majid Wani
-
Syed Asif Ahmad Qadri, et. al.Syed Asif Ahmad Qadri ... Taiba Majid Wani
16 Jul 2021
16 Jul 2021

Speech Emotion Recognition Using Deep Neural Networks on Multilingual Databases
Syed Asif Ahmad Qadri ... Mira Kartiwi
-
Syed Asif Ahmad Qadri, et. al.Syed Asif Ahmad Qadri ... Mira Kartiwi
01 Jan 2020
01 Jan 2020

Cochannel speaker count labelling based on the use of cepstral and pitch prediction derived features
Michael A Lewis ... Ravi P Ramachandran
Pattern Recognition | VOL. 34
Michael A Lewis, et. al.Michael A Lewis ... Ravi P Ramachandran
01 Feb 2001
Pattern Recognition | VOL. 34

A Comparison of MFCC and LPCC with Deep Learning for Speaker Recognition
Haiyan Yang ... Yanrong Deng
-
Haiyan Yang, et. al.Haiyan Yang ... Yanrong Deng
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stressed Speech Emotion Recognition Using Teager Energy and Spectral Feature Fusion with Feature Optimization.

Abstract

Talk to us

Similar Papers

More From: Computational Intelligence and Neuroscience