Online speaker diarization using adapted i-vector transforms

Weizhong Zhu,Jason Pelecanos

doi:10.1109/icassp.2016.7472638

Abstract

Many speaker diarization systems operate in an off-line mode. Such systems typically find homogeneous segments and then cluster these segments according to speaker. Such algorithms, like bottom-up clustering, k-means or spectral clustering, generally require the registration of all segments before clustering can begin. However, for real-time applications such as with multi-person voice interactive systems, there is a need to perform online speaker assignment in a strict left-to-right fashion. In this paper we propose a novel Maximum a Posteriori (MAP) adapted transform within an i-vector speaker diarization framework, that operates in a strict left-to-right fashion. Previous work by the community has shown that the principal components of variation of fixed dimensional i-vectors learned across segments tend to indicate a strong basis by which to separate speakers. However, determining this basis can be problematic when there are few segments or when operating in an online manner. The proposed method blends the prior with the estimated subspace as more i-vectors are observed. Given oracle SAD segments, with adaptation we achieve 3.2% speaker diarization error for a strict left-to-right constraint on the LDC Callhome English Corpus compared to 4.8% without adaptation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online speaker diarization using adapted i-vector transforms

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A review on speaker diarization systems and approaches
M.H Moattar ... M.M Homayounpour
Speech Communication | VOL. 54
M.H Moattar, et. al.M.H Moattar ... M.M Homayounpour
05 Jun 2012
Speech Communication | VOL. 54

Speaker diarization and detection system using a priori speaker information
Ouassila Kenai ... Salim Djeghiour
-
Ouassila Kenai, et. al.Ouassila Kenai ... Salim Djeghiour
01 Apr 2018
01 Apr 2018

A hybrid approach to online speaker diarization
Carlos Vaquero ... Oriol Vinyals
-
Carlos Vaquero, et. al.Carlos Vaquero ... Oriol Vinyals
26 Sep 2010
26 Sep 2010

Making Speaker Diarization System Noise Tolerant
Davit S Karamyan ... Saten A Harutyunyan
Mathematical Problems of Computer Science | VOL. 59
Davit S Karamyan, et. al.Davit S Karamyan ... Saten A Harutyunyan
31 May 2023
Mathematical Problems of Computer Science | VOL. 59

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online speaker diarization using adapted i-vector transforms

Abstract

Talk to us

Similar Papers