Vector Quantization Based Classification and Maximum Likelihood Decoding for Speaker Recognition†

Nam Phamdo,Tai Hong Lee,Nariman Farvardin

doi:10.1007/978-3-642-57745-1_72

Abstract

In a VQ-based speaker recognition system, upon selecting a set of feature parameters that are useful for speaker characterization (such as short-term spectral parameters, pitch, gain, etc.), a vector quantizer is designed for each speaker. Each vector quantizer will then be used to separately encode the sequence of feature vectors generated by an unknown speaker. The speaker whose associate codebook results in the smallest cumulative distortion will then be selected. Previous studies [l]-[4] have shown that not all speech segments are equally effective for the task of speaker recognition. Broad phonetic classes (such as vowels and fricatives) and explicit phoneme classification schemes were proposed in [3], [4]. In these schemes, speech segments were classified into various phonetic categories. A weighted distortion measure based on these phonetic categories was used to identify the unknown speaker. In Section 72.2, we describe a classification scheme based on VQ.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Vector Quantization Based Classification and Maximum Likelihood Decoding for Speaker Recognition†

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

The use of broad phonetic class models in speaker recognition
Johan W Koolwaaij ... Johan De Veth
-
Johan W Koolwaaij, et. al.Johan W Koolwaaij ... Johan De Veth
30 Nov 1998
30 Nov 1998

NOVEL DISCRIMINATIVE VECTOR QUANTIZATION APPROACH FOR SPEAKER IDENTIFICATION
Guangyu Zhou ... Brent Myers
Journal of Circuits, Systems and Computers | VOL. 14
Guangyu Zhou, et. al.Guangyu Zhou ... Brent Myers
01 Jun 2005
Journal of Circuits, Systems and Computers | VOL. 14

Methods and apparatus for unknown speaker labeling using concurrent speech recognition, segmentation, classification and clustering
Alain Charles Louis Tritschler ... Mahesh Viswanathan
The Journal of the Acoustical Society of America | VOL. 113
Alain Charles Louis Tritschler, et. al.Alain Charles Louis Tritschler ... Mahesh Viswanathan
01 Jan 2003
The Journal of the Acoustical Society of America | VOL. 113

Broad phonetic classification using discriminative Bayesian networks
Franz Pernkopf ... Jeff A Bilmes
Speech Communication | VOL. 51
Franz Pernkopf, et. al.Franz Pernkopf ... Jeff A Bilmes
31 Jul 2008
Speech Communication | VOL. 51

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Vector Quantization Based Classification and Maximum Likelihood Decoding for Speaker Recognition†

Abstract

Talk to us

Similar Papers