DeltaVLAD: An efficient optimization algorithm to discriminate speaker embedding for text-independent speaker verification

Xin Guo,Feiqi Deng,Chengfang Luo,Aiwen Deng

doi:10.3934/math.2022355

Abstract

<abstract> <p>Text-independent speaker verification aims to determine whether two given utterances in open-set task originate from the same speaker or not. In this paper, some ways are explored to enhance the discrimination of embeddings in speaker verification. Firstly, difference is used in the coding layer to process speaker features to form the DeltaVLAD layer. The frame-level speaker representation is extracted by the deep neural network with differential operations to calculate the dynamic changes between frames, which is more conducive to capturing insignificant changes in the voiceprint. Meanwhile, NeXtVLAD is adopted to split the frame-level features into multiple word spaces before aggregating, and subsequently perform VLAD operations in each subspace, which can significantly reduce the number of parameters and improve performance. Secondly, the margin-based softmax loss function and the few-shot learning-based loss function are proposed to be combined for more discriminative speaker embeddings. Finally, for a fair comparison, the experimental results are performed on Voxceleb-1 showing superior performance of speaker verification system and can obtain new state-of-the-art results.</p> </abstract>

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DeltaVLAD: An efficient optimization algorithm to discriminate speaker embedding for text-independent speaker verification

Abstract

Talk to us

Similar Papers

More From: AIMS Mathematics

Lead the way for us

Journal: AIMS Mathematics	Publication Date: Jan 1, 2022
License type: cc-by

Similar Papers

Speaker verification: minimizing the channel effects using autoassociative neural network models
S.P Kishore ... B Yegnanarayana
-
S.P Kishore, et. al.S.P Kishore ... B Yegnanarayana
05 Jun 2000
05 Jun 2000

Investigating Language Variability on the Performance of Speaker Verification Systems
Amir Vaheb ... Saeid Safavi
-
Amir Vaheb, et. al.Amir Vaheb ... Saeid Safavi
01 Jan 2018
01 Jan 2018

Robust speaker verification using GFCC and joint factor analysis
Pranab Das ... Utpal Bhattacharjee
-
Pranab Das, et. al.Pranab Das ... Utpal Bhattacharjee
01 Jul 2014
01 Jul 2014

Embedding Aggregation for Far-Field Speaker Verification with Distributed Microphone Arrays
Danwei Cai ... Ming Li
-
Danwei Cai, et. al.Danwei Cai ... Ming Li
19 Jan 2021
19 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DeltaVLAD: An efficient optimization algorithm to discriminate speaker embedding for text-independent speaker verification

Abstract

Talk to us

Similar Papers

More From: AIMS Mathematics