Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation

Shaojun Wang Shaojun Wang,Yunxin Zhao Yunxin Zhao

doi:10.1109/89.943344

Abstract

This paper presents a new recursive Bayesian learning approach for transformation parameter estimation in speaker adaptation. Our goal is to incrementally transform or adapt a set of hidden Markov model (HMM) parameters for a new speaker and gain large performance improvement from a small amount of adaptation data. By constructing a clustering tree of HMM Gaussian mixture components, the linear regression (LR) or affine transformation parameters for HMM Gaussian mixture components are dynamically searched. An online Bayesian learning technique is proposed for recursive maximum a posteriori (MAP) estimation of LR and affine transformation parameters. This technique has the advantages of being able to accommodate flexible forms of transformation functions as well as a priori probability density functions (PDFs). To balance between model complexity and goodness of fit to adaptation data, a dynamic programming algorithm is developed for selecting models using a Bayesian variant of the "minimum description length" (MDL) principle. Speaker adaptation experiments with a 26-letter English alphabet vocabulary were conducted, and the results confirmed effectiveness of the online learning framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing

Lead the way for us

Journal: IEEE Transactions on Speech and Audio Processing	Publication Date: Jan 1, 2001
Citations: 73

Similar Papers

Speaker adaptation using improved MAP estimation with small amount of adaptation data
Takuya Futagami ... Noboru Hayasaka
-
Takuya Futagami, et. al.Takuya Futagami ... Noboru Hayasaka
01 Oct 2013
01 Oct 2013

Adaptation of Hidden Markov Models Using Model-as-Matrix Representation
Yongwon Jeong
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 20
Yongwon JeongYongwon Jeong
01 Oct 2012
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 20

A maximum a posteriori approach to speaker adaptation using the trended hidden Markov model
R Chengalvarayan ... Li Deng
IEEE Transactions on Speech and Audio Processing | VOL. 9
R Chengalvarayan, et. al.R Chengalvarayan ... Li Deng
01 Jul 2001
IEEE Transactions on Speech and Audio Processing | VOL. 9

Speaker adaptation in the maximum a posteriori framework based on the probabilistic 2-mode analysis of training models
Yongwon Jeong
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2013
Yongwon JeongYongwon Jeong
11 Apr 2013
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing