Wavelet Scattering Transform and CNN for Closed Set Speaker Identification

Wajdi Ghezaiel,Luc Brun,Olivier Lezoray

doi:10.1109/mmsp48831.2020.9287061

Abstract

In real world applications, the performances of speaker identification systems degrade due to the reduction of both the amount and the quality of speech utterance. For that particular purpose, we propose a speaker identification system where short utterances with few training examples are used for person identification. Therefore, only a very small amount of data involving a sentence of 2-4 seconds is used. To achieve this, we propose a novel raw waveform end-to-end convolutional neural network (CNN) for text-independent speaker identification. We use wavelet scattering transform as a fixed initialization of the first layers of a CNN network, and learn the remaining layers in a supervised manner. The conducted experiments show that our hybrid architecture combining wavelet scattering transform and CNN can successfully perform efficient feature extraction for a speaker identification, even with a small number of short duration training samples.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Wavelet Scattering Transform and CNN for Closed Set Speaker Identification

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Sep 21, 2020
Citations: 34	License type: other-oa

Similar Papers

DNN Based Speaker Identification System Under Multi-Variability Speech Conditions
Banala Saritha ... Madhuchhanda Choudhury
-
Banala Saritha, et. al.Banala Saritha ... Madhuchhanda Choudhury
30 Dec 2022
30 Dec 2022

Bottleneck and Embedding Representation of Speech for DNN-based Language and Speaker Recognition
Alicia Lozano-Diez ... Joaquin Gonzalez-Rodriguez
-
Alicia Lozano-Diez, et. al.Alicia Lozano-Diez ... Joaquin Gonzalez-Rodriguez
21 Nov 2018
21 Nov 2018

Ship Classification in SAR Imagery by Shallow CNN Pre-Trained on Task-Specific Dataset with Feature Refinement
Haitao Lang ... Ruifu Wang
Remote Sensing | VOL. 14
Haitao Lang, et. al.Haitao Lang ... Ruifu Wang
25 Nov 2022
Remote Sensing | VOL. 14

Hybrid Network For End-To-End Text-Independent Speaker Identification

-

29 Dec 2020
29 Dec 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Wavelet Scattering Transform and CNN for Closed Set Speaker Identification

Abstract

Talk to us

Similar Papers