An Approach for Identification of Speaker using Deep Learning

Abdul Basit,Syed Mujtaba Haider,Syeda Rabia

doi:10.58921/ijaims.v1i2.36

An Approach for Identification of Speaker using Deep Learning

Abdul Basit, Syed Mujtaba Haider + Show 1 more

Open Access

https://doi.org/10.58921/ijaims.v1i2.36

Copy DOI

Journal: International Journal of Artificial Intelligence & Mathematical Sciences

Publication Date: Jan 31, 2023

#Librispeech Dataset #Audio File + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The audio data is getting increased on daily basis across the world with the increase of telephonic conversations, video conferences, podcasts and voice notes. This study presents a mechanism for identification of a speaker in an audio file, which is based on the biometric features of human voice such as frequency, amplitude, and pitch. We proposed an unsupervised learning model which uses wav2vec 2.0 where the model learns speech representation with the dataset provided. We used Librispeech dataset in our research and we achieved our results at an error rate which is 1.8.

Full Text