Spectral Features for Emotional Speaker Recognition

P Sandhya,V Spoorthy,Shashidhar G Koolagudi,N.V Sobhana

doi:10.1109/icaecc50550.2020.9339502

P Sandhya, V Spoorthy + Show 2 more

Open Access

https://doi.org/10.1109/icaecc50550.2020.9339502

Copy DOI

Abstract

Speaker recognition in an emotive environment is a bit challenging task because of influence of emotions in a speech. Identifying the speaker from the speech can be done by analyzing the features of the speech signal. In normal conditions, identifying a speaker is not a tedious task. Whereas, identifying the speaker in an emotional environment such as happy, sad, anger, surprise, sarcastic, fear etc. is really challenging, since speech becomes altered under emotions and noise. The spectral features of speech signal include Mel Frequency Cepstral Co-efficients(MFCC), Shifted Delta Cepstral Coefficients (SDCC), spectral centroid, spectral roll off, spectral flatness, spectral contrast, spectral bandwidth, chroma-stft, zero crossing rate, root mean square energy, Linear Prediction Cepstral Coefficients (LPCC), spectral subband centroid, Teager energy based MFCC, line spectral frequencies, single frequency cepstral coefficients, formant frequencies, Power Normalized Cepstral Coefficients (PNCC), etc. The features that are extracted from the speech signal are classified using classifiers. Support Vector Machine(SVM), Gaussian Mixture Model, Gaussian Naive Bayes, K-Nearest Neighbour, Random Forest and a simple Neural Network using Keras is used for classification. The important application include security systems in which a person can be identified by biometrics that is voice of the person. The work aims to identify the speaker in an emotional environment using spectral features and classify using any of the classification techniques and to achieve a high speaker recognition rate. Feature combinations can also be used to improve accuracy. The proposed model performed better than most of the state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spectral Features for Emotional Speaker Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Advancements in Speaker Recognition: Exploring Mel Frequency Cepstral Coefficients (MFCC) for Enhanced Performance in Speaker Recognition
V Sai Nitin Varma ... Abdul Majeed K.K
International Journal for Research in Applied Science and Engineering Technology | VOL. 11
V Sai Nitin Varma, et. al.V Sai Nitin Varma ... Abdul Majeed K.K
31 Aug 2023
International Journal for Research in Applied Science and Engineering Technology | VOL. 11

An investigation on the degradation of different features extracted from the compressed American English speech using narrowband and wideband codecs
M S Arun Sankar ... P S Sathidevi
International Journal of Speech Technology | VOL. 21
M S Arun Sankar, et. al.M S Arun Sankar ... P S Sathidevi
29 Oct 2018
International Journal of Speech Technology | VOL. 21

Development of Optimal Feature Selection and Deep Learning Toward Hungry Stomach Detection Using Audio Signals
A Maria ... A Sengol Jeyaseelan
Journal of Control, Automation and Electrical Systems | VOL. 32
A Maria, et. al.A Maria ... A Sengol Jeyaseelan
28 Apr 2021
Journal of Control, Automation and Electrical Systems | VOL. 32

Robust Speaker Verification Using Improved PNCC Based on GMM-UBM
Xinxing Jing ... Haiyan Yang
International Journal of Automation and Power Engineering | VOL. 4
Xinxing Jing, et. al.Xinxing Jing ... Haiyan Yang
01 Jan 2015
International Journal of Automation and Power Engineering | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spectral Features for Emotional Speaker Recognition

Abstract

Talk to us

Similar Papers