Speech enhancement algorithms using kalman filtering and masking properties of human auditory systems

Ning Ma

doi:10.20381/ruor-19658

Abstract

Speech enhancement algorithms have been employed successfully in many areas such as VoIP, automatic speech recognition and speaker verification. Many approaches are presented in the literature. This thesis focuses on enhancing single channel speech degraded by white noise or colored noise. A Kalman filter algorithm combined with the masking properties of human auditory systems is proposed. The threshold computed from the masking properties is used as a constraint in the Kalman filter to theoretically derive a modified Kalman filter. The derivation gives a theoretical foundation for the feasibility of combining masking properties with a Kalman filter. Some heuristic methods are also proposed for an easier implementation. One algorithm proposes to use the frequency domain masking level as a hard threshold to reshape the Kalman filtered signal. Another algorithm is to use a post-filter concatenated with the Kalman filter, using a threshold where both time-domain and frequency domain masking properties are taken into account. The goal of the masking is to make the energy of the estimate state error smaller than the threshold. To further decrease the computational cost, a wavelet Kalman filter combined with masking thresholds is also introduced. In the above algorithms, the speech model is assumed to be linear. Nonlinear speech models are also considered in the thesis. To address the nonlinear model problem, dual Extended Kalman Filter (EKF) and dual Unscented Kalman Filter (UKF) algorithms are studied. In these cases, both time-domain and frequency domain masking properties are taken into account. The simulation results show that all the proposed methods combining Kalman filter and masking properties can produce promising results from the point of view of PESQ scores. The average PESQ score gains obtained by these proposed methods are from about 0.35 to 0.45. Some informal subjective tests also show that the performance of the proposed methods is promising. No voice activity detection is required in the proposed methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech enhancement algorithms using kalman filtering and masking properties of human auditory systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

State of charge estimation of lithium battery based on Dual Adaptive Unscented Kalman Filter
Peng Zhang ... Shibao Dong
-
Peng Zhang, et. al.Peng Zhang ... Shibao Dong
01 Nov 2018
01 Nov 2018

Joint estimation of states and parameters of Hodgkin–Huxley neuronal model using Kalman filtering
M Lankarany ... M.N.S Swamy
Neurocomputing | VOL. 136
M Lankarany, et. al.M Lankarany ... M.N.S Swamy
25 Jan 2014
Neurocomputing | VOL. 136

Rotor Asymmetry Detection in Wound Rotor Induction Motor Using Kalman Filter Variants and Investigations on Their Robustness: An Experimental Implementation
Furzana John Basha ... Kumar Somasundaram
Machines | VOL. 11
Furzana John Basha, et. al.Furzana John Basha ... Kumar Somasundaram
14 Sep 2023
Machines | VOL. 11

Noise covariance estimation using dual estimation for disturbance storm time index application
Boonsri Kaewkham-Ai ... Kasemsak Uthaichana
-
Boonsri Kaewkham-Ai, et. al.Boonsri Kaewkham-Ai ... Kasemsak Uthaichana
01 Dec 2010
01 Dec 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech enhancement algorithms using kalman filtering and masking properties of human auditory systems

Abstract

Talk to us

Similar Papers