Unsupervised speaker adaptation based on hierarchical spectral clustering

S Furui

doi:10.1109/29.45538

Abstract

The author proposes an automatic speaker adaptation algorithm for speech recognition, in which a small amount of training material of unspecified text can be used. The algorithm is easily applied to vector-quantization- (VQ) speech recognition systems consisting of a VQ codebook and a word dictionary in which each word is represented as a sequence of codebook entries. In the adaptation algorithm, the VQ codebook is modified for each new speaker, whereas the word dictionary is universally used for all speakers. The important feature of this algorithm is that a set of spectra in training frames and the codebook entries are clustered hierarchically. Based on the vectors representing deviation between centroids of the training frame clusters and the corresponding codebook clusters, adaptation is performed hierarchically from small to large numbers of clusters. The spectral resolution of the adaptation process is improved accordingly. Results of recognition experiments using utterances of 100 Japanese city names show that adaptation reduces the mean word recognition error rate from 4.9 to 2.9%. Since the error rate for speaker-dependent recognition is 2.2%, the adaptation method is highly effective. >

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised speaker adaptation based on hierarchical spectral clustering

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Acoustics, Speech, and Signal Processing

Lead the way for us

Journal: IEEE Transactions on Acoustics, Speech, and Signal Processing	Publication Date: Jan 1, 1989
Citations: 41

Similar Papers

Unsupervised speaker adaptation method based on hierarchical spectral clustering
S Furui
-
S FuruiS Furui
23 May 1989
23 May 1989

MSVQ-based speaker-adaptive Chinese syllable recognition based on discriminative training
Liang Zhou ...
-
Liang Zhou, et. al.Liang Zhou ...
01 Nov 1997
01 Nov 1997

Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives
Petr Cerva ... Ladislav Seps
Speech Communication | VOL. 55
Petr Cerva, et. al.Petr Cerva ... Ladislav Seps
08 Jul 2013
Speech Communication | VOL. 55

Fast Adaptation of Deep Neural Network Based on Discriminant Codes for Speech Recognition
Shaofei Xue ... Lirong Dai
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Shaofei Xue, et. al. Shaofei Xue ... Lirong Dai
01 Dec 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised speaker adaptation based on hierarchical spectral clustering

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Acoustics, Speech, and Signal Processing