Multiresolution Speech Enhancement Based on Proposed Circular Nested Microphone Array in Combination with Sub-Band Affine Projection Algorithm

Ali Dehghan Firoozabadi,Pablo Irarrazaval,David Zabala-Blanco,Pablo Palacios-Játiva,Miguel Sanhueza,Pablo Adasme,Cesar Azurdia-Meza,Hugo Durney

doi:10.3390/app10113955

Abstract

Speech enhancement is one of the most important fields in audio and speech signal processing. The speech enhancement methods are divided into the single and multi-channel algorithms. The multi-channel methods increase the speech enhancement performance by providing more information with the use of more microphones. In addition, spatial aliasing is one of the destructive factors in speech enhancement strategies. In this article, we first propose a uniform circular nested microphone array (CNMA) for data recording. The microphone array increases the accuracy of the speech processing methods by increasing the information. Moreover, the proposed nested structure eliminates the spatial aliasing between microphone signals. The circular shape in the proposed nested microphone array implements the speech enhancement algorithm with the same probability for the speakers in all directions. In addition, the speech signal information is different in frequency bands, where the sub-band processing is proposed by the use of the analysis filter bank. The frequency resolution is increased in low frequency components by implementing the proposed filter bank. Then, the affine projection algorithm (APA) is implemented as an adaptive filter on sub-bands that were obtained by the proposed nested microphone array and analysis filter bank. This algorithm adaptively enhances the noisy speech signal. Next, the synthesis filters are implemented for reconstructing the enhanced speech signal. The proposed circular nested microphone array in combination with the sub-band affine projection algorithm (CNMA-SBAPA) is compared with the least mean square (LMS), recursive least square (RLS), traditional APA, distributed multichannel Wiener filter (DB-MWF), and multichannel nonnegative matrix factorization-minimum variance distortionless response (MNMF-MVDR) in terms of the segmental signal-to-noise ratio (SegSNR), perceptual evaluation of speech quality (PESQ), mean opinion score (MOS), short-time objective intelligibility (STOI), and speed of convergence on real and simulated data for white and colored noises. In all scenarios, the proposed method has high accuracy at different levels and noise types by the lower distortion in comparison with other works and, furthermore, the speed of convergence is higher than the compared researches.

Highlights

In the current century, the smartphones and other communication devices have been an important part of human life, where it is impossible to have social communications without them [1,2]
The proposed system with sub-band affine projection algorithm (APA) is compared by the quantitative, qualitative (PESQ, mean opinion score (MOS), and short-time objective intelligibility (STOI)) criteria, and speed of convergence with the least mean square (LMS), traditional APA, recursive least square (RLS), distributed multichannel Wiener filter (DB-MWF), and multichannel nonnegative matrix factorization-minimum variance distortionless response (MNMF-MVDR) algorithms on real and simulated data under white and colored noisy conditions
A multi-channel speech enhancement method was proposed based on the microphone array

Summary

Introduction

The smartphones and other communication devices have been an important part of human life, where it is impossible to have social communications without them [1,2]. A multi-channel speech enhancement method is introduced based on the proposed circular nested microphone array in combination with the sub-band affine projection algorithm (CNMA-SBAPA). The affine projection algorithm (APA), as an adaptive method for the speech enhancement, is implemented on sub-band signals from the circular nested microphone array (NMA). The proposed system with sub-band APA is compared by the quantitative (segmental SNR), qualitative (PESQ, MOS, and STOI) criteria, and speed of convergence with the least mean square (LMS), traditional APA, recursive least square (RLS), distributed multichannel Wiener filter (DB-MWF), and multichannel nonnegative matrix factorization-minimum variance distortionless response (MNMF-MVDR) algorithms on real and simulated data under white and colored noisy conditions.

The Microphone Model and Proposed Nested Microphone Array

Microphone Signal Model

The Proposed Uniform Circular Nested Microphone Array

The Proposed Multiresolution Sub-band-APA for the Speech Enhancement

Results and Discussion

Figure

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jun 6, 2020
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multiresolution Speech Enhancement Based on Proposed Circular Nested Microphone Array in Combination with Sub-Band Affine Projection Algorithm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Analysis of statistical estimators and neural network approaches for speech enhancement
Ravi Kumar Kandagatla ... Rajeswari K
SciEnggJ | VOL. 17
Ravi Kumar Kandagatla, et. al.Ravi Kumar Kandagatla ... Rajeswari K
13 Feb 2024
SciEnggJ | VOL. 17

Performance analysis of neural network, NMF and statistical approaches for speech enhancement
Ravi Kumar Kandagatla ... Venkata Subbaiah Potluri
International Journal of Speech Technology | VOL. 23
Ravi Kumar Kandagatla, et. al.Ravi Kumar Kandagatla ... Venkata Subbaiah Potluri
17 Sep 2020
International Journal of Speech Technology | VOL. 23

New research on monaural speech segregation based on quality assessment
Xiaoping Xie ... Fei Ding
Computer Speech & Language | VOL. 85
Xiaoping Xie, et. al.Xiaoping Xie ... Fei Ding
05 Dec 2023
Computer Speech & Language | VOL. 85

Speech Enhancement based on Deep Convolutional Neural Network
Ramesh Nuthakki ... Yukta T N
-
Ramesh Nuthakki, et. al.Ramesh Nuthakki ... Yukta T N
11 Nov 2021
11 Nov 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiresolution Speech Enhancement Based on Proposed Circular Nested Microphone Array in Combination with Sub-Band Affine Projection Algorithm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences