Speaker Recognition Using Constrained Convolutional Neural Networks in Emotional Speech

Nikola Simić,Zoran Perić,Siniša Suzić,Vlado Delić,Mia Vujović,Tijana Nosek,Milan Savić

doi:10.3390/e24030414

Nikola Simić, Zoran Perić + Show 5 more

Open Access

https://doi.org/10.3390/e24030414

Copy DOI

Abstract

Speaker recognition is an important classification task, which can be solved using several approaches. Although building a speaker recognition model on a closed set of speakers under neutral speaking conditions is a well-researched task and there are solutions that provide excellent performance, the classification accuracy of developed models significantly decreases when applying them to emotional speech or in the presence of interference. Furthermore, deep models may require a large number of parameters, so constrained solutions are desirable in order to implement them on edge devices in the Internet of Things systems for real-time detection. The aim of this paper is to propose a simple and constrained convolutional neural network for speaker recognition tasks and to examine its robustness for recognition in emotional speech conditions. We examine three quantization methods for developing a constrained network: floating-point eight format, ternary scalar quantization, and binary scalar quantization. The results are demonstrated on the recently recorded SEAC dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Mar 16, 2022
Citations: 14	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Speaker Recognition Using Constrained Convolutional Neural Networks in Emotional Speech

Abstract

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

SEC-GAN for robust speaker recognition with emotional state dismatch
Dongdong Li ... Ming Hua
Biomedical Signal Processing and Control | VOL. 85
Dongdong Li, et. al.Dongdong Li ... Ming Hua
20 May 2023
Biomedical Signal Processing and Control | VOL. 85

Analysis of source and system features for speaker recognition in emotional conditions
K N R K Raju Alluri ... Anil Kumar Vuppala
-
K N R K Raju Alluri, et. al.K N R K Raju Alluri ... Anil Kumar Vuppala
01 Nov 2016
01 Nov 2016

Analyzing Noise Robustness of Cochleogram and Mel Spectrogram Features in Deep Learning Based Speaker Recognition
Wondimu Lambamo ... Ramasamy Srinivasagan
Applied Sciences | VOL. 13
Wondimu Lambamo, et. al.Wondimu Lambamo ... Ramasamy Srinivasagan
31 Dec 2022
Applied Sciences | VOL. 13

Employing Emotion Cues to Verify Speakers in Emotional Talking Environments
Ismail Shahin
Journal of Intelligent Systems | VOL. 25
Ismail ShahinIsmail Shahin
24 Feb 2015
Journal of Intelligent Systems | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker Recognition Using Constrained Convolutional Neural Networks in Emotional Speech

Abstract

Talk to us

Similar Papers

More From: Entropy