Speaker Change Detection Using Binary Key Modelling with Contextual Information

Jose Patino,Nicholas Evans,Héctor Delgado

doi:10.1007/978-3-319-68456-7_21

Abstract

Speaker change detection can be of benefit to a number of different speech processing tasks such as speaker diarization, recognition and detection. Current solutions rely either on highly localized data or on training with large quantities of background data. While efficient, the former tend to over-segment. While more stable, the latter are less efficient and need adaptation to mis-matching data. Building on previous work in speaker recognition and diarization, this paper reports a new binary key (BK) modelling approach to speaker change detection which aims to strike a balance between efficiency and segmentation accuracy. The BK approach benefits from training using a controllable degree of contextual data, rather than relying on external background data, and is efficient in terms of computation and speaker discrimination. Experiments on a subset of the standard ETAPE database show that the new approach outperforms the current state-of-the-art methods for speaker change detection and gives an average relative improvement in segment coverage and purity of 18.71% and 4.51% respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker Change Detection Using Binary Key Modelling with Contextual Information

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speech and multilingual natural language framework for speaker change detection and diarization
Or Haim Anidjar ... Itshak Lapidot
Expert Systems With Applications | VOL. 213
Or Haim Anidjar, et. al.Or Haim Anidjar ... Itshak Lapidot
11 Nov 2022
Expert Systems With Applications | VOL. 213

How Different Types of Linguistic Information Impact Voice Perception: Evidence From the Language-Familiarity Effect.
Keke Yu ... Linjun Zhang
Language and Speech | VOL. 66
Keke Yu, et. al.Keke Yu ... Linjun Zhang
21 Jan 2023
Language and Speech | VOL. 66

Compensation for inter-frame correlations in speaker diarization and recognition
Themos Stafylakis ... Patrick Kenny
-
Themos Stafylakis, et. al.Themos Stafylakis ... Patrick Kenny
01 May 2013
01 May 2013

Speaker Spotting: Automatic Telephony Surveillance for Homeland Security
V. Ramasubramanian
-
V. RamasubramanianV. Ramasubramanian
04 Oct 2011
04 Oct 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker Change Detection Using Binary Key Modelling with Contextual Information

Abstract

Talk to us

Similar Papers