Sequence-Level Speaker Change Detection With Difference-Based Continuous Integrate-and-Fire

Zhiyun Fan,Linhao Dong,Zejun Ma,Meng Cai,Bo Xu

doi:10.1109/lsp.2022.3185955

Abstract

Speaker change detection is an important task in multi-party interactions such as meetings and conversations. In this paper, we address the speaker change detection task from the perspective of sequence transduction. Specifically, we propose a novel encoder-decoder framework that directly converts the input feature sequence to the speaker identity sequence. The difference-based continuous integrate-and-fire mechanism is designed to support this framework. It detects speaker changes by integrating the speaker difference between the encoder outputs frame-by-frame and transfers encoder outputs to segment-level speaker embeddings according to the detected speaker changes. The whole framework is supervised by the speaker identity sequence, a weaker label than the precise speaker change points. The experiments on the AMI and DIHARD-I corpora show that our sequence-level method consistently outperforms a strong frame-level baseline that uses the precise speaker change labels.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sequence-Level Speaker Change Detection With Difference-Based Continuous Integrate-and-Fire

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters

Lead the way for us

Journal: IEEE Signal Processing Letters	Publication Date: Jan 1, 2022
Citations: 1

Similar Papers

Speaker Change Detection For Transformer Transducer ASR
Jian Wu ... Zhuo Chen
-
Jian Wu, et. al.Jian Wu ... Zhuo Chen
04 Jun 2023
04 Jun 2023

Speaker Change Detection- an Comparative Study using Support Vector Machines
J.G.M Britto ... S.S Kumar
-
J.G.M Britto, et. al.J.G.M Britto ... S.S Kumar
01 Jan 2012
01 Jan 2012

Speaker Change Detection Using Fundamental Frequency with Application to Multi-talker Segmentation
Aidan O T Hogg ... Christine Evers
-
Aidan O T Hogg, et. al.Aidan O T Hogg ... Christine Evers
01 May 2019
01 May 2019

A Multitask Learning Framework for Speaker Change Detection with Content Information from Unsupervised Speech Decomposition
Hang Su ... Xixin Wu
-
Hang Su, et. al.Hang Su ... Xixin Wu
23 May 2022
23 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sequence-Level Speaker Change Detection With Difference-Based Continuous Integrate-and-Fire

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters