A Methodology for Speaker Diazaration System Based on LSTM and MFCC Coefficients

Indu D Indu D

doi:10.52783/jes.3299

Abstract

Research on Speaker Identification is always difficult. A speaker may be automatically identified using by comparing their voice sample with their previously recorded voice, the machine learning strategy has grown in favor in recent years. Convolutional neural networks (CNN) , deep neural networks (DNN) are some of the machine learning techniques that has employed recently. The article will discuss a successful speaker verification system based on the d-vector to construct a new approach based on speaker diarization. In particular, in this article, we use the concept of LSTM to cluster the speech segments using MFCC coefficients and identify the speakers in the diarization system. The proposed system will be evaluated using benchmark performance metrics, and a comparative study will be made with other models. The need to consider the LSTM neural network using acoustic data and linguistic dialect is considered. LSTM networks could produce reliable speaker segmentation outputs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Methodology for Speaker Diazaration System Based on LSTM and MFCC Coefficients

Abstract

Talk to us

Similar Papers

More From: Journal of Electrical Systems

Lead the way for us

Journal: Journal of Electrical Systems	Publication Date: May 2, 2024
License type: CC BY-ND 4.0

Similar Papers

Bottleneck and Embedding Representation of Speech for DNN-based Language and Speaker Recognition
Alicia Lozano-Diez ... Joaquin Gonzalez-Rodriguez
-
Alicia Lozano-Diez, et. al.Alicia Lozano-Diez ... Joaquin Gonzalez-Rodriguez
21 Nov 2018
21 Nov 2018

Streamflow prediction using an integrated methodology based on convolutional neural network and long short-term memory networks
Sujan Ghimire ... Zaher Mundher Yaseen
Scientific Reports | VOL. 11
Sujan Ghimire, et. al.Sujan Ghimire ... Zaher Mundher Yaseen
01 Sep 2021
Scientific Reports | VOL. 11

Deep distributed convolutional neural networks: Universality
Ding-Xuan Zhou
Analysis and Applications | VOL. 16
Ding-Xuan ZhouDing-Xuan Zhou
01 Nov 2018
Analysis and Applications | VOL. 16

Comprehensive Study for Breast Cancer Using Deep Learning and Traditional Machine Learning
-
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34
--
12 Apr 2022
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Methodology for Speaker Diazaration System Based on LSTM and MFCC Coefficients

Abstract

Talk to us

Similar Papers

More From: Journal of Electrical Systems