Factor Analyzed Subspace Modeling and Selection

Jen-Tzung Chien,Chuan-Wei Ting

doi:10.1109/tasl.2007.910790

Abstract

We present a novel subspace modeling and selection approach for noisy speech recognition. In subspace modeling, we develop a factor analysis (FA) representation of noisy speech, which is a generalization of a signal subspace (SS) representation. Using FA, noisy speech is represented by the extracted common factors, factor loading matrix, and specific factors. The observation space of noisy speech is accordingly partitioned into a principal subspace, containing speech and noise, and a minor subspace, containing residual speech and residual noise. We minimize the energies of speech distortion in the principal subspace as well as in the minor subspace so as to estimate clean speech with residual information. Importantly, we explore the optimal subspace selection via solving the hypothesis test problems. We test the equivalence of eigenvalues in the minor subspace to select the subspace dimension. To fulfill the FA spirit, we also examine the hypothesis of uncorrelated specific factors/residual speech. The subspace can be partitioned according to a consistent confidence towards rejecting the null hypothesis. Optimal solutions are realized through the likelihood ratio tests, which arrive at the approximated chi-square distributions as test statistics. In the experiments on the Aurora2 database, the FA model significantly outperforms the SS model for speech enhancement and recognition. Subspace selection via testing the correlation of residual speech achieves higher recognition accuracies than that of testing the equivalent eigenvalues in the minor subspace.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Factor Analyzed Subspace Modeling and Selection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2008
Citations: 35

Similar Papers

A Dual Purpose Principal and Minor Subspace Gradient Flow
Xiangyu Kong ... Chongzhao Han
IEEE Transactions on Signal Processing | VOL. 60
Xiangyu Kong, et. al.Xiangyu Kong ... Chongzhao Han
01 Jan 2012
IEEE Transactions on Signal Processing | VOL. 60

A unified self-stabilizing neural network algorithm for principal and minor components extraction.
Xiangyu Kong ... Hongguang Ma
IEEE transactions on neural networks and learning systems | VOL. 23
Xiangyu Kong, et. al. Xiangyu Kong ... Hongguang Ma
01 Feb 2012
IEEE transactions on neural networks and learning systems | VOL. 23

Robust emotion recognition in noisy speech via sparse representation
Xiaoming Zhao ... Shiqing Zhang
Neural Computing and Applications | VOL. 24
Xiaoming Zhao, et. al.Xiaoming Zhao ... Shiqing Zhang
29 Mar 2013
Neural Computing and Applications | VOL. 24

Merging Subspace Models for Face Recognition
Władysław Skarbek
-
Władysław SkarbekWładysław Skarbek
01 Jan 2003
01 Jan 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Factor Analyzed Subspace Modeling and Selection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing