Ensemble environment modeling using affine transform group

Yu Tsao,Payton Lin,Ting-Yao Hu,Xugang Lu

doi:10.1016/j.specom.2014.12.007

Abstract

The ensemble speaker and speaking environment modeling (ESSEM) framework was designed to provide online optimization for enhancing workable systems under real-world conditions. In the ESSEM framework, ensemble models are built in the offline phase to characterize specific environments based on local statistics prepared from those particular conditions. In the online phase, a mapping function is computed based on the incoming testing data to perform model adaptation. Previous studies utilized linear combination (LC) and linear combination with a correction bias (LCB) as simple mapping functions that only apply one weighting coefficient on each model. In order to better utilize the ensemble models, this study presents a generalized affine transform group (ATG) mapping function for the ESSEM framework. Although ATG characterizes unknown testing conditions more precisely using a larger amount of parameters, over-fitting issues occur when the available adaptation data is especially limited. This study handles over-fitting issues with three optimization processes: maximum a posteriori (MAP) criterion, model selection (MS), and cohort selection (CS). Experimental results showed that ATG along with the three optimization processes enabled the ESSEM framework to allow unsupervised model adaptation using only one utterance to provide consistent performance improvements.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ensemble environment modeling using affine transform group

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Jan 8, 2015
Citations: 4

Similar Papers

A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling
Yu Tsao ... Chin-Hui Lee
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Yu Tsao, et. al.Yu Tsao ... Chin-Hui Lee
01 Feb 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

MAP estimation of online mapping parameters in ensemble speaker and speaking environment modeling
Yu Tsao ... Chin-Hui Lee
-
Yu Tsao, et. al.Yu Tsao ... Chin-Hui Lee
01 Dec 2009
01 Dec 2009

An efficient hybrid direct-vague fuzzy moves system using fuzzy-rules-based precise rules
Yan-Qing Zhang ... Abraham Kandel
Expert Systems With Applications | VOL. 13
Yan-Qing Zhang, et. al.Yan-Qing Zhang ... Abraham Kandel
01 Oct 1997
Expert Systems With Applications | VOL. 13

Using bias correction and ensemble modelling for predictive mapping and related uncertainty: A case study in digital soil mapping
Jean-Daniel Sylvain ... Évelyne Thiffault
Geoderma | VOL. 403
Jean-Daniel Sylvain, et. al.Jean-Daniel Sylvain ... Évelyne Thiffault
26 May 2021
Geoderma | VOL. 403

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ensemble environment modeling using affine transform group

Abstract

Talk to us

Similar Papers

More From: Speech Communication