Self-Supervised Open-Set Speaker Recognition with Laguerre-Voronoi Descriptors.

Abu Quwsar Ohi,Marina L Gavrilova

doi:10.3390/s24061996

Abu Quwsar Ohi, Marina L Gavrilova

Open Access

PDF Available

https://doi.org/10.3390/s24061996

Copy DOI

Export

Save

Cite

Journal: Sensors	Publication Date: Mar 21, 2024
License type: CC BY 4.0

Affiliation: University of Calgary

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Speaker recognition is a challenging problem in behavioral biometrics that has been rigorously investigated over the last decade. Although numerous supervised closed-set systems inherit the power of deep neural networks, limited studies have been made on open-set speaker recognition. This paper proposes a self-supervised open-set speaker recognition that leverages the geometric properties of speaker distribution for accurate and robust speaker verification. The proposed framework consists of a deep neural network incorporating a wider viewpoint of temporal speech features and Laguerre-Voronoi diagram-based speech feature extraction. The deep neural network is trained with a specialized clustering criterion that only requires positive pairs during training. The experiments validated that the proposed system outperformed current state-of-the-art methods in open-set speaker recognition and cluster representation.

Full Text