Speaker model selection based on the Bayesian information criterion applied to unsupervised speaker indexing

M Nishida,T Kawahara

doi:10.1109/tsa.2005.848890

M Nishida, T Kawahara

Open Access

https://doi.org/10.1109/tsa.2005.848890

Copy DOI

Abstract

In conventional speaker recognition tasks, the amount of training data is almost the same for each speaker, and the speaker model structure is uniform and specified manually according to the nature of the task and the available size of the training data. In real-world speech data such as telephone conversations and meetings, however, serious problems arise in applying a uniform model because variations in the utterance durations of speakers are large, with numerous short utterances. We therefore propose a flexible framework in which an optimal speaker model (GMM or VQ) is automatically selected based on the Bayesian Information Criterion (BIC) according to the amount of training data available. The framework makes it possible to use a discrete model when the data is sparse, and to seamlessly switch to a continuous model after a large amount of data is obtained. The proposed framework was implemented in unsupervised speaker indexing of a discussion audio. For a real discussion archive with a total duration of 10 hours, we demonstrate that the proposed method has higher indexing performance than that of conventional methods. The speaker index is also used to adapt a speaker-independent acoustic model to each participant for automatic transcription of the discussion. We demonstrate that speaker indexing with our method is sufficiently accurate for adaptation of the acoustic model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Speech and Audio Processing	Publication Date: Jul 1, 2005
Citations: 59	License type: other-oa

R Discovery Prime

R Discovery Prime

Speaker model selection based on the Bayesian information criterion applied to unsupervised speaker indexing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing

Lead the way for us

Similar Papers

Speaker indexing and adaptation using speaker clustering based on statistical model selection
M Nishida ... T Kawahara
-
M Nishida, et. al.M Nishida ... T Kawahara
17 May 2004
17 May 2004

A unified framework for domain independent online speaker indexing in eigen-voice space using an index tree of reference models
M H Moattar ... M M Homayounpour
International Journal of Speech Technology | VOL. 16
M H Moattar, et. al.M H Moattar ... M M Homayounpour
14 Feb 2013
International Journal of Speech Technology | VOL. 16

A Pitch-Based Rapid Speech Segmentation for Speaker Indexing
Min Yang ... Yingchun Yang
-
Min Yang, et. al. Min Yang ... Yingchun Yang
12 Dec 2005
12 Dec 2005

Robust Bootstrapping of Speaker Models for Unsupervised Speaker Indexing
Fu Zhonghua
-
Fu ZhonghuaFu Zhonghua
30 Jun 2007
30 Jun 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker model selection based on the Bayesian information criterion applied to unsupervised speaker indexing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing