A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification

Juan M Coria,Sahar Ghannay,Sophie Rosset,Hervé Bredin

doi:10.1007/978-3-030-59430-5_11

Abstract

Despite the growing popularity of metric learningapproaches, very little work has attempted to perform a fair comparison of these techniques for speaker verification. We try to fill this gap and compare several metric learning loss functions in a systematic manner on the VoxCeleb dataset. The first family of loss functions is derived from the cross entropy loss (usually used for supervised classification) and includes the congenerous cosine loss, the additive angular margin loss, and the center loss. The second family of loss functions focuses on the similarity between training samples and includes the contrastive loss and the triplet loss. We show that the additive angular margin loss function outperforms all other loss functions in the study, while learning more robust representations. Based on a combination of SincNet trainable features and the x-vector architecture, the network used in this paper brings us a step closer to a truly end-to-end speaker verification system, when combined with the additive angular margin loss, while still being competitive with the x-vector baseline. In the spirit of reproducible research, we also release open source Python code for reproducing our results, and share pretrained PyTorch models on torch.hub that can be used either directly or after fine-tuning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

ADCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems
Victoria Mingote ... Antonio Miguel
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Victoria Mingote, et. al.Victoria Mingote ... Antonio Miguel
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

Angular Softmax Loss for End-to-end Speaker Verification
Yutian Li ... Jiasong Sun
-
Yutian Li, et. al.Yutian Li ... Jiasong Sun
01 Nov 2018
01 Nov 2018

Additive Angular Margin Loss in Deep Graph Neural Network Classifier for Learning Graph Edit Distance
Nadeem Iqbal Kajla ... Muhammad Muzzamil Luqman
IEEE Access | VOL. 8
Nadeem Iqbal Kajla, et. al.Nadeem Iqbal Kajla ... Muhammad Muzzamil Luqman
01 Jan 2020
IEEE Access | VOL. 8

Pseudo-Phoneme Label Loss for Text-Independent Speaker Verification
Mengqi Niu ... Zhihua Fang
Applied Sciences | VOL. 12
Mengqi Niu, et. al.Mengqi Niu ... Zhihua Fang
25 Jul 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification

Abstract

Talk to us

Similar Papers