Automatic Speaker Recognition using Deep Neural Network Classifiers

Abdikarim Ali Moumin,Smitha S Kumar

doi:10.1109/iccakm50778.2021.9357699

Abstract

The advances in modern computing technologies have achieved a breakthrough in the fields of artificial intelligence (AI) and the Internet of Things (IoT). One of the major achievements in the recent history is the ability of the computer software to classify and recognize some of the objects or sounds by learning data. In this paper, we have trained the software to recognize people using their voice utterances using TIMIT Acoustic Phonetic Continuous Speech Corpus. The speaker identity is enrolled by acquiring voice samples of the speaker. Relevant features are extracted, and a model is built using the extracted feature vectors. A pattern matching classification is applied to the model using artificial neural network techniques. Speaker verification system is built using Kaldi libraries to analyze acoustic features, while x-vector training is implemented using Tensor Flow. To achieve better performance, we have implemented a combination of multiple layers of TDNN (Time Delay Neural Networks) and LSTM (Long Short-Term Memory) deep neural networks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic Speaker Recognition using Deep Neural Network Classifiers

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Remote patient monitoring and classifying using the internet of things platform combined with cloud computing
Somayeh Iranpak ... Hassan Shakeri
Journal of Big Data | VOL. 8
Somayeh Iranpak, et. al.Somayeh Iranpak ... Hassan Shakeri
08 Sep 2021
Journal of Big Data | VOL. 8

Video captioning using boosted and parallel Long Short-Term Memory networks
Masoomeh Nabati ... Alireza Behrad
Computer Vision and Image Understanding | VOL. 190
Masoomeh Nabati, et. al.Masoomeh Nabati ... Alireza Behrad
11 Oct 2019
Computer Vision and Image Understanding | VOL. 190

ConvLSNet: A lightweight architecture based on ConvLSTM model for the classification of pulmonary conditions using multichannel lung sound recordings
Faezeh Majzoobi ... Sobhan Goudarzi
Artificial Intelligence In Medicine | VOL. 154
Faezeh Majzoobi, et. al.Faezeh Majzoobi ... Sobhan Goudarzi
22 Jun 2024
Artificial Intelligence In Medicine | VOL. 154

Active Noise Reduction with Filtered Least-Mean-Square Algorithm Improved by Long Short-Term Memory Models for Radiation Noise of Diesel Engine
Semin Kwon ... Bo-Seung Kim
Applied Sciences | VOL. 12
Semin Kwon, et. al.Semin Kwon ... Bo-Seung Kim
12 Oct 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Speaker Recognition using Deep Neural Network Classifiers

Abstract

Talk to us

Similar Papers