Basis-Based Speaker Adaptation Using Partitioned HMM Mean Parameters of Training Speaker Models

Yongwon Jeong

doi:10.1007/s11265-015-0996-2

Abstract

This paper presents the basis-based speaker adaptation method that includes approaches using principal component analysis (PCA) and two-dimensional PCA (2DPCA). The proposed method partitions the hidden Markov model (HMM) mean vectors of training models into subvectors of smaller dimension. Consequently, the sample covariance matrix computed using the partitioned HMM mean vectors has various dimensions according to the dimension of the subvectors. From the eigen-decomposition of the sample covariance matrix, basis vectors are constructed. Thus, the dimension of basis vectors varies according to the dimension of the sample covariance matrix, and the proposed method includes PCA and 2DPCA-based approaches. We present the adaptation equation in both the maximum likelihood (ML) and maximum a posteriori (MAP) frameworks. We perform continuous speech recognition experiments using the Wall Street Journal (WSJ) corpus. The results show that the model with basis vectors whose dimensions are between those of PCA and 2DPCA-based approaches shows good overall performance. The proposed approach in the MAP framework shows additional performance improvement over the ML counterpart when the number of adaptation parameters is large but the amount of available adaptation data is small. Furthermore, the performance of the approach in the MAP framework approach is less sensitive to the choice of model order than the ML counterpart.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Basis-Based Speaker Adaptation Using Partitioned HMM Mean Parameters of Training Speaker Models

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems

Lead the way for us

Journal: Journal of Signal Processing Systems	Publication Date: Apr 1, 2015
Citations: 13

Similar Papers

Unified framework for basis-based speaker adaptation based on sample covariance matrix of variable dimension
Yongwon Jeong
Speech Communication | VOL. 55
Yongwon JeongYongwon Jeong
22 Sep 2012
Speech Communication | VOL. 55

Adaptation of Hidden Markov Models Using Model-as-Matrix Representation
Yongwon Jeong
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 20
Yongwon JeongYongwon Jeong
01 Oct 2012
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 20

Electron density and effective atomic number estimation in a maximum a posteriori framework for dual-energy computed tomography.
Mikaël Simard ... Hugo Bouchard
Medical physics | VOL. 47
Mikaël Simard, et. al.Mikaël Simard ... Hugo Bouchard
13 Jul 2020
Medical physics | VOL. 47

A structural Bayes approach to speaker adaptation
K Shinoda ... C.-H Lee
IEEE Transactions on Speech and Audio Processing | VOL. 9
K Shinoda, et. al.K Shinoda ... C.-H Lee
01 Mar 2001
IEEE Transactions on Speech and Audio Processing | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Basis-Based Speaker Adaptation Using Partitioned HMM Mean Parameters of Training Speaker Models

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems