Few-shot short utterance speaker verification using meta-learning.

Weijie Wang,Hong Zhao,Youkang Chang,Haojie You,Yikun Yang

doi:10.7717/peerj-cs.1276

Weijie Wang, Hong Zhao + Show 3 more

Open Access

https://doi.org/10.7717/peerj-cs.1276

Copy DOI

Abstract

Short utterance speaker verification (SV) in the actual application is the task of accepting or rejecting the identity claim of a speaker based on a few enrollment utterances. Traditional methods have used deep neural networks to extract speaker representations for verification. Recently, several meta-learning approaches have learned a deep distance metric to distinguish speakers within meta-tasks. Among them, a prototypical network learns a metric space that may be used to compute the distance to the prototype center of speakers, in order to classify speaker identity. We use emphasized channel attention, propagation and aggregation in TDNN (ECAPA-TDNN) to implement the necessary function for the prototypical network, which is a nonlinear mapping from the input space to the metric space for either few-shot SV task. In addition, optimizing only for speakers in given meta-tasks cannot be sufficient to learn distinctive speaker features. Thus, we used an episodic training strategy, in which the classes of the support and query sets correspond to the classes of the entire training set, further improving the model performance. The proposed model outperforms comparison models on the VoxCeleb1 dataset and has a wide range of practical applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PeerJ. Computer science	Publication Date: Apr 21, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Few-shot short utterance speaker verification using meta-learning.

Abstract

Talk to us

Similar Papers

More From: PeerJ. Computer science

Lead the way for us

Similar Papers

Normal-to-shouted speech spectral mapping for speaker recognition under vocal effort mismatch
Ana Ramirez Lopez ... Paavo Alku
-
Ana Ramirez Lopez, et. al.Ana Ramirez Lopez ... Paavo Alku
01 Mar 2017
01 Mar 2017

Prototypical Networks for Small Footprint Text-Independent Speaker Verification
Tom Ko ... Yangbin Chen
-
Tom Ko, et. al.Tom Ko ... Yangbin Chen
01 May 2020
01 May 2020

Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs
Seong Min Kye ... Hoirin Kim
-
Seong Min Kye, et. al.Seong Min Kye ... Hoirin Kim
25 Oct 2020
25 Oct 2020

Speaker recognition using PCA-based feature transformation
Ahmed Isam Ahmed ... Victor M Becerra
Speech Communication | VOL. 110
Ahmed Isam Ahmed, et. al.Ahmed Isam Ahmed ... Victor M Becerra
02 Apr 2019
Speech Communication | VOL. 110

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Few-shot short utterance speaker verification using meta-learning.

Abstract

Talk to us

Similar Papers

More From: PeerJ. Computer science