A unified framework for domain independent online speaker indexing in eigen-voice space using an index tree of reference models

M H Moattar,M M Homayounpour

doi:10.1007/s10772-013-9190-8

Abstract

Speaker indexing referred in literature as speaker diarization is an important task in audio indexing and retrieval. Speaker indexing includes two important and usually separate stages, namely speaker segmentation and speaker clustering. Speaker indexing can be divided into online and offline categories. This paper mainly focuses on domain independent online speaker indexing. For this purpose, the proposed framework should be parameter free and no application specific parameters such as utterance duration or threshold settings are required. To reduce dependency on parameters, the traditional speaker segmentation is reformed to a voting based homogeneous speech segmentation, in which several approaches are applied in parallel to decide on the existence of a change point. In online indexing, data insufficiency is encountered at each time slice. In the proposed framework, a set of reference speaker models are used as side information to facilitate online tracking. To improve the indexing accuracy, adaptation approaches in eigen-voice decomposition space are proposed in this paper. To enhance the tracking performance from the computational cost point of view, an index structure of the reference models is proposed to speed up the search in the model space. The proposed framework is evaluated on the 2002 Rich Transcription Broadcast News and Conversational Telephone Speech Corpus (in Garofolo, NIST Rich Transcription, 2002) as well as a synthetic dataset. The indexing error of the proposed framework on telephone conversations, broadcast news and synthetic dataset are 7.51 %, 6.36 % and 9.34 %, respectively. Also, using the index tree structure approach, the tracking run time of the proposed framework is improved by 32 %.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A unified framework for domain independent online speaker indexing in eigen-voice space using an index tree of reference models

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Similar Papers

A review on speaker diarization systems and approaches
M.H Moattar ... M.M Homayounpour
Speech Communication | VOL. 54
M.H Moattar, et. al.M.H Moattar ... M.M Homayounpour
05 Jun 2012
Speech Communication | VOL. 54

The ELISA consortium approaches in broadcast news speaker segmentation during the NIST 2003 rich transcription evaluation
D Moraru ... L Besacier
-
D Moraru, et. al.D Moraru ... L Besacier
13 Jan 2017
The ELISA consortium approaches in broadcast news speaker segmentation during the NIST 2003 rich transcription evaluation
D Moraru ... L Besacier

Robust Unsupervised Speaker Segmentation for Audio Diarization
Hachem Kadri ... Manuel Davy
-
Hachem Kadri, et. al.Hachem Kadri ... Manuel Davy
01 Mar 2010
01 Mar 2010

Novel Approaches to Speaker Clustering for Speaker Diarization in Audio Broadcast News Data
Janez ibert ... France Miheli
-
Janez ibert, et. al.Janez ibert ... France Miheli
01 Nov 2008
01 Nov 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A unified framework for domain independent online speaker indexing in eigen-voice space using an index tree of reference models

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology