Performance Evaluation of Speaker Identification in Language and Emotion Mismatch Conditions on Eastern and North Eastern Low Resource Languages of India

Joyanta Basu,Swanirbhar Majumder,Tapan Kumar Basu

doi:10.1007/978-981-16-2641-8_49

Abstract

AbstractThis paper describes the impact of spoken language and emotional variation in a multilingual speaker identification (SID) system. The development of speech technology applications in low resource languages (LRL) is challenging due to the unavailability of proper speech corpus. This paper illustrates performance analysis of SID in six Eastern and North Eastern (E&NE) Indian languages and an emotional corpus of six basic emotions. For this purpose, six experimentations are carried out using the collected LRL of E&NE data to build speaker identification models. Speaker-specific acoustic characteristics are extracted from the speech segments in terms of short-term spectral features, i.e., shifted delta cepstral (SDC) and partial correlation (PARCOR) coefficients. Gaussian mixture model (GMM) and support vector machine (SVM)-based models are developed to represent the speaker-specific information captured through the spectral features. Apart from that, to build the modern SID i-vectors, time delay neural networks (TDNN) and recurrent neural network with long short-term memory (LSTM-RNN) have been considered. For the evaluation, equal error rate (EER) has been used as a performance matrix of the SID system. Performances of the developed systems are analyzed with different emotional native and non-native language corpus in terms of speaker identification (SID) accuracy in six different experiments.KeywordsLow resource language (LRL)Speaker identification (SID)Shifted delta cepstral (SDC)Partial correlation (PARCOR) coefficientsi-vectorsLinear discriminant analysis (LDA)Probabilistic linear discriminant analysis (PLDA)Deep neural network (DNN)Time delay neural networks (TDNN)Recurrent neural network (RNN)Long short-term memory (LSTM)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Evaluation of Speaker Identification in Language and Emotion Mismatch Conditions on Eastern and North Eastern Low Resource Languages of India

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speaker Identification in Spoken Language Mismatch Condition: An Experimental Study
Joyanta Basu ... Swanirbhar Majumder
-
Joyanta Basu, et. al.Joyanta Basu ... Swanirbhar Majumder
01 Jan 2020
01 Jan 2020

Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification
Joyanta Basu ... Tapan Kumar Basu
Circuits, Systems, and Signal Processing | VOL. 40
Joyanta Basu, et. al.Joyanta Basu ... Tapan Kumar Basu
20 Apr 2021
Circuits, Systems, and Signal Processing | VOL. 40

Comparative Analysis of Speaker Identification Performance Using Deep Learning, Machine Learning, and Novel Subspace Classifiers with Multiple Feature Extraction Techniques
Serkan Keser ... Esra Gezer
Digital Signal Processing | VOL. 156
Serkan Keser, et. al.Serkan Keser ... Esra Gezer
01 Jan 2025
Digital Signal Processing | VOL. 156

Bottleneck and Embedding Representation of Speech for DNN-based Language and Speaker Recognition
Alicia Lozano-Diez ... Joaquin Gonzalez-Rodriguez
-
Alicia Lozano-Diez, et. al.Alicia Lozano-Diez ... Joaquin Gonzalez-Rodriguez
21 Nov 2018
21 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Evaluation of Speaker Identification in Language and Emotion Mismatch Conditions on Eastern and North Eastern Low Resource Languages of India

Abstract

Talk to us

Similar Papers