Making Speaker Diarization System Noise Tolerant

Davit S Karamyan,Grigor A Kirakosyan,Saten A Harutyunyan

doi:10.51408/1963-0102

Abstract

The goal of speaker diarization is to identify and separate different speakers in a multi-speaker audio recording. However, noise in the recording can interfere with the accuracy of these systems. In this paper, we explore methods such as multi-condition training, consistency regularization, and teacher-student techniques to improve the resilience of speaker embedding extractors to noise. We test the effectiveness of these methods on speaker verification and speaker diarization tasks and demonstrate that they lead to improved performance in the presence of noise and reverberation. To test the speaker verification and diarization system under noisy and reverberant conditions, we created augmented versions of the VoxCeleb1 cleaned test and Voxconverse dev datasets by adding noise and echo with different SNR values. Our results show that, on average, we can achieve a 19.1% relative improvement in speaker recognition using the teacher-student method and a 17% relative improvement in speaker diarization using consistency regularization compared to a multi-condition trained baseline.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Making Speaker Diarization System Noise Tolerant

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems of Computer Science

Lead the way for us

Journal: Mathematical Problems of Computer Science	Publication Date: May 31, 2023
License type: cc-by-nc

Similar Papers

Speaker diarization and detection system using a priori speaker information
Ouassila Kenai ... Salim Djeghiour
-
Ouassila Kenai, et. al.Ouassila Kenai ... Salim Djeghiour
01 Apr 2018
01 Apr 2018

Speaker Diarization For Vietnamese Conversations Using Deep Neural Network Embeddings
Tung Lam Nguyen ... Nhat Minh Le
-
Tung Lam Nguyen, et. al.Tung Lam Nguyen ... Nhat Minh Le
27 Jul 2022
27 Jul 2022

Speaker Diarization with LSTM
Quan Wang ... Ignacio Lopz Moreno
-
Quan Wang, et. al.Quan Wang ... Ignacio Lopz Moreno
01 Apr 2018
01 Apr 2018

Community Detection Graph Convolutional Network for Overlap-Aware Speaker Diarization
Jie Wang ... Haodong Zhou
-
Jie Wang, et. al.Jie Wang ... Haodong Zhou
04 Jun 2023
04 Jun 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Making Speaker Diarization System Noise Tolerant

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems of Computer Science