Robust Recognition of Noisy Speech Through Partial Imputation of Missing Data

Kian Ebrahim Kafoori,Seyed Mohammad Ahadi

doi:10.1007/s00034-017-0616-4

Abstract

Two main categories of speech recognition robustness through missing data are spectral imputation and classifier modification. In this paper, we introduce a novel technique that could combine methods from these two categories while improving the accuracy of the combined methods. Methods in these two categories are rarely employed together due to their incompatible structures. Based on our previous work, we propose a technique to solve the problem of incompatibility. The technique is based on the idea of partial restoration of the log-spectrum. We decide to whether restore or estimate a possible range for the missing component. We also propose a method to more effectively employ dynamic features. The combined techniques are a classic spectral imputation method and our previously proposed classifier modification technique, namely spectral variance learning. The experiments show that the proposed technique is able to improve the accuracies of both combined techniques significantly, leading to improvements in recognition accuracy as high as nearly four percent on Aurora 2.0 data and more than two percent on a noisy version of TIMIT data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust Recognition of Noisy Speech Through Partial Imputation of Missing Data

Abstract

Talk to us

Similar Papers

More From: Circuits, Systems, and Signal Processing

Lead the way for us

Journal: Circuits, Systems, and Signal Processing	Publication Date: Jul 31, 2017
Citations: 3

Similar Papers

Ensemble acoustic modeling in automatic speech recognition
Xin Chen
-
Xin ChenXin Chen
01 Jan 2010
01 Jan 2010

Strong-sense class-dependent features for statistical recognition
M.K Omar ... M Hasegawa-Johnson
-
M.K Omar, et. al.M.K Omar ... M Hasegawa-Johnson
01 Jan 2003
01 Jan 2003

Missing data mask models with global frequency and temporal constraints
Sébastien Demange ... Jean-Paul Haton
-
Sébastien Demange, et. al.Sébastien Demange ... Jean-Paul Haton
17 Sep 2006
17 Sep 2006

Modeling long distance dependence in language: topic mixtures vs. dynamic cache models
R Iyer ... M Ostendorf
-
R Iyer, et. al.R Iyer ... M Ostendorf
03 Oct 1996
03 Oct 1996

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Recognition of Noisy Speech Through Partial Imputation of Missing Data

Abstract

Talk to us

Similar Papers

More From: Circuits, Systems, and Signal Processing