Approaches to Speaker Detection and Tracking in Conversational Speech

Robert B Dunn,Douglas A Reynolds,Thomas F Quatieri

doi:10.1006/dspr.1999.0359

Abstract

Dunn, Robert B., Reynolds, Douglas A., and Quatieri, Thomas F., Approaches to Speaker Detection and Tracking in Conversational Speech, Digital Signal Processing10(2000), 93–112.Two approaches to detecting and tracking speakers in multispeaker audio are described. Both approaches use an adapted Gaussian mixture model, universal background model (GMM-UBM) speaker detection system as the core speaker recognition engine. In one approach, the individual log-likelihood ratio scores, which are produced on a frame-by-frame basis by the GMM-UBM system, are used to first partition the speech file into speaker homogenous regions and then to create scores for these regions. We refer to this approach as internal segmentation. Another approach uses an external segmentationalgorithm, based on blind clustering, to partition the speech file into speaker homogenous regions. The adapted GMM-UBM system then scores each of these regions as in the single-speaker recognition case. We show that the external segmentation system outperforms the internal segmentation system for both detection and tracking. In addition, we show how different components of the detection and tracking algorithms contribute to the overall system performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Approaches to Speaker Detection and Tracking in Conversational Speech

Abstract

Talk to us

Similar Papers

More From: Digital Signal Processing

Lead the way for us

Journal: Digital Signal Processing	Publication Date: Jan 1, 2000
Citations: 93

Similar Papers

A GMM-based probabilistic sequence kernel for speaker verification
Kong-Aik Lee ... Changhuai You
-
Kong-Aik Lee, et. al.Kong-Aik Lee ... Changhuai You
27 Aug 2007
27 Aug 2007

Cluster adaptive training weights as features in SVM-based speaker verification
Hao Yang ... Xianyu Zhao
-
Hao Yang, et. al.Hao Yang ... Xianyu Zhao
27 Aug 2007
27 Aug 2007

Text-independent speaker identification using GMM-UBM and frame level likelihood normalization
Rong Zheng ... Bo Xu
-
Rong Zheng, et. al. Rong Zheng ... Bo Xu
15 Dec 2004
15 Dec 2004

Speaker recognition in noisy environments using auxiliary information and Bayesian networks

-

01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approaches to Speaker Detection and Tracking in Conversational Speech

Abstract

Talk to us

Similar Papers

More From: Digital Signal Processing