FORWARD MASKING THRESHOLD ESTIMATION USING NEURAL NETWORKS AND ITS APPLICATION TO PARALLEL SPEECH ENHANCEMENT

T S Gunawan,O O Khalifa,E Ambikairajah

doi:10.31436/iiumej.v11i1.41

Abstract

Forward masking models have been used successfully in speech enhancement and audio coding. Presently, forward masking thresholds are estimated using simplified masking models which have been used for audio coding and speech enhancement applications. In this paper, an accurate approximation of forward masking threshold estimation using neural networks is proposed. A performance comparison to the other existing masking models in speech enhancement application is presented. Objective measures using PESQ demonstrates that our proposed forward masking model, provides significant improvements (5-15 %) over four existing models, when tested with speech signals corrupted by various noises at very low signal to noise ratios. Moreover, a parallel implementation of the speech enhancement algorithm was developed using Matlab parallel computing toolbox.

Highlights

Forward masking is a time domain phenomenon in which a masker precedes the signal in time
Functional models of the forward masking effect of the human auditory system have recently been used with success in speech and audio coding to provide more efficient signal compression [2, 3]
The masking threshold to noise ratio (MNR) in each subband can be calculated by using the ratio of a short-term average forward masking threshold, Pm n, and an estimate of the noise floor level, Qm n as given in Eqn (8)

Summary

INTRODUCTION

Forward masking is a time domain phenomenon in which a masker precedes the signal in time. Further refinement of the model requires software that can do curve-fitting of multi-dimensional data. For this purpose, we utilise neural network to better approximate forward masking threshold. Speech enhancement algorithm exploiting temporal masking properties of human auditory system has a very high computation requirement, especially when the noisy speech signal is long or the number of subbands is high. Recent advances in multi-core system make it a natural choice and viable option for solving high computation requirements of the speech enhancement algorithm. The objective of this paper is two-folds: to evaluate the performance of our forward masking model in terms of enhanced speech quality and to implement and evaluate parallel speech enhancement algorithm on a multi-core system.

FORWARD MASKING MODELS USING NEURAL NETWORKS

SPEECH ENHANCEMENT

PARALLEL SPEECH ENHANCEMENT ALGORITHM

Subjective and Objective Quality

Parallel Performance

CONCLUSIONS

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IIUM Engineering Journal	Publication Date: May 26, 2010
Citations: 9	License type: cc-by

R Discovery Prime

R Discovery Prime

FORWARD MASKING THRESHOLD ESTIMATION USING NEURAL NETWORKS AND ITS APPLICATION TO PARALLEL SPEECH ENHANCEMENT

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IIUM Engineering Journal

Lead the way for us

Similar Papers

The development of a forward masking model using neural networks and its application to speech enhancement
Teddy Surya Gunawan ... Othman O Khalifa
-
Teddy Surya Gunawan, et. al.Teddy Surya Gunawan ... Othman O Khalifa
01 May 2008
01 May 2008

Suitability of speech quality evaluation measures in speech enhancement
Zhang Jie ... Jingyun Xu
-
Zhang Jie, et. al.Zhang Jie ... Jingyun Xu
01 Jul 2014
01 Jul 2014

A new regularized forward blind source separation algorithm for automatic speech quality enhancement
Meriem Zoulikha ... Mohamed Djendi
Applied Acoustics | VOL. 112
Meriem Zoulikha, et. al.Meriem Zoulikha ... Mohamed Djendi
03 Jun 2016
Applied Acoustics | VOL. 112

A two-sensor Gauss–Seidel fast affine projection algorithm for speech enhancement and acoustic noise reduction
Ilyes Darazirar ... Mohamed Djendi
Applied Acoustics | VOL. 96
Ilyes Darazirar, et. al.Ilyes Darazirar ... Mohamed Djendi
30 Mar 2015
Applied Acoustics | VOL. 96

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FORWARD MASKING THRESHOLD ESTIMATION USING NEURAL NETWORKS AND ITS APPLICATION TO PARALLEL SPEECH ENHANCEMENT

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IIUM Engineering Journal