Text-independent speaker recognition using LSTM-RNN and speech enhancement

Samia Abd El-Moneim,Adel S El-Fishawy,M A Nassar,Moawad I Dessouky,Fathi E Abd El-Samie,Nabil A Ismail

doi:10.1007/s11042-019-08293-7

Abstract

Speaker recognition revolution has lead to the inclusion of speaker recognition modules in several commercial products. Most published algorithms for speaker recognition focus on text-dependent speaker recognition. In contrast, text-independent speaker recognition is more advantageous as the client can talk freely to the system. In this paper, text-independent speaker recognition is considered in the presence of some degradation effects such as noise and reverberation. Mel-Frequency Cepstral Coefficients (MFCCs), spectrum and log-spectrum are used for feature extraction from the speech signals. These features are processed with the Long-Short Term Memory Recurrent Neural Network (LSTM-RNN) as a classification tool to complete the speaker recognition task. The network learns to recognize the speakers efficiently in a text-independent manner, when the recording circumstances are the same. The recognition rate reaches 95.33% using MFCCs, while it is increased to 98.7% when using spectrum or log-spectrum. However, the system has some challenges to recognize speakers from different recording environments. Hence, different speech enhancement techniques, such as spectral subtraction and wavelet denoising, are used to improve the recognition performance to some extent. The proposed approach shows superiority, when compared to the algorithm of R. Togneri and D. Pullella (2011).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Text-independent speaker recognition using LSTM-RNN and speech enhancement

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Jun 17, 2020
Citations: 41

Similar Papers

Comparative Analysis of Speaker Identification Performance Using Deep Learning, Machine Learning, and Novel Subspace Classifiers with Multiple Feature Extraction Techniques
Serkan Keser ... Esra Gezer
Digital Signal Processing | VOL. 156
Serkan Keser, et. al.Serkan Keser ... Esra Gezer
01 Jan 2025
Digital Signal Processing | VOL. 156

Effect of Reverberation Phenomena on Text- independent Speaker Recognition Based Deep Learning
Samia Abd El-Moneim ... Moawd I Dessouky
Menoufia Journal of Electronic Engineering Research | VOL. 28
Samia Abd El-Moneim, et. al.Samia Abd El-Moneim ... Moawd I Dessouky
01 Dec 2019
Menoufia Journal of Electronic Engineering Research | VOL. 28

A Comparison Between MFCC and MSE Features for Text-Independent Speaker Recognition Using Machine Learning Algorithms
Joseph Isaac Ramírez-Hernández ... Luis C González-Gurrola
-
Joseph Isaac Ramírez-Hernández, et. al.Joseph Isaac Ramírez-Hernández ... Luis C González-Gurrola
01 Jan 2023
01 Jan 2023

Bottleneck and Embedding Representation of Speech for DNN-based Language and Speaker Recognition
Alicia Lozano-Diez ... Joaquin Gonzalez-Rodriguez
-
Alicia Lozano-Diez, et. al.Alicia Lozano-Diez ... Joaquin Gonzalez-Rodriguez
21 Nov 2018
21 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text-independent speaker recognition using LSTM-RNN and speech enhancement

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications