Adaptive Speech Separation Based on Beamforming and Frequency Domain-Independent Component Analysis

Ke Zhang,Yi Wang,Yangjie Wei,Dan Wu

doi:10.3390/app10072593

Abstract

Voice signals acquired by a microphone array often include considerable noise and mutual interference, seriously degrading the accuracy and speed of speech separation. Traditional beamforming is simple to implement, but its source interference suppression is not adequate. In contrast, independent component analysis (ICA) can improve separation, but imposes an iterative and time-consuming process to calculate the separation matrix. As a supporting method, principle component analysis (PCA) contributes to reduce the dimension, retrieve fast results, and disregard false sound sources. Considering the sparsity of frequency components in a mixed signal, we propose an adaptive fast speech separation algorithm based on multiple sound source localization as preprocessing to select between beamforming and frequency domain ICA according to different mixing conditions per frequency bin. First, a fast positioning algorithm allows calculating the maximum number of components per frequency bin of a mixed speech signal to prevent the occurrence of false sound sources. Then, PCA reduces the dimension to adaptively adjust the weight of beamforming and ICA for speech separation. Subsequently, the ICA separation matrix is initialized based on the sound source localization to notably reduce the iteration time and mitigate permutation ambiguity. Simulation and experimental results verify the effectiveness and speedup of the proposed algorithm.

Highlights

Speech separation aims at the effective extraction of target speech and removal of noise and interference
An adaptive fast speech separation algorithm based on multiple sound source localization as preprocessing to select between beamforming and frequency domain independent component analysis (ICA) according to different mixing conditions per frequency bin is proposed
We propose an adaptive and fast speech separation based on ICA and beamforming

Summary

Introduction

Speech separation aims at the effective extraction of target speech and removal of noise and interference. Simple fixed beamforming has a low performance for speech separation in real environments, and due to the nonorthogonality of the steering vector [5] of each sound source, the interference suppression of adaptive beamforming depends on the accurate estimation of the propagation process [6] Alterations, such as the use of masks, can improve interference removal, but they can degrade the target signal components [7]. To solve the above-mentioned problems, and based on the sparsity of frequency components in a mixed signal, we propose an adaptive speech separation algorithm that consists of multiple sound source localization as preprocessing and selects either beamforming or frequency-domain ICA according to the frequency bin characteristics. The fourth section presents simulations and experiment, and conclusions are drawn in the fifth section

Sound Mixing of Microphone Arrays

Circular

Dimensionality Reduction

Speech Separation Using ICA

Proposed Speech Separation Algorithm

Counting and Localization of Multiple Sources

Speech

Case I

Case III

Simulation Set-up

Simulations

Result in Position Result in

14. Computation

Experiments

MEMS microphones

273°. Figures

15. Spectrum

Discussions

Findings

Conclusions

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Apr 9, 2020
Citations: 5	License type: CC BY 4.0

R Discovery Prime

Adaptive Speech Separation Based on Beamforming and Frequency Domain-Independent Component Analysis

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Binaural Localization of Multiple Sound Sources by Non-Negative Tensor Factorization
Elie Laurent Benaroya ... Axel Roebel
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26
Elie Laurent Benaroya, et. al.Elie Laurent Benaroya ... Axel Roebel
01 Jun 2018
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26

Multiple Sound Source Localization Based on Inter-Channel Correlation Using a Distributed Microphone System in a Real Environment
Kook Cho ... Takanobu Nishiura
IEICE Transactions on Information and Systems | VOL. E93-D
Kook Cho, et. al.Kook Cho ... Takanobu Nishiura
01 Jan 2009
IEICE Transactions on Information and Systems | VOL. E93-D

Multiple Sound Source Localization in Three Dimensions Using Convolutional Neural Networks and Clustering Based Post-Processing
Saulius Sakavicius ... Vytautas Abromavicius
IEEE Access | VOL. 10
Saulius Sakavicius, et. al.Saulius Sakavicius ... Vytautas Abromavicius
01 Jan 2021
IEEE Access | VOL. 10

Spatial localization of concurrent multiple sound sources
Huakang Li ... Keita Tanno
-
Huakang Li, et. al.Huakang Li ... Keita Tanno
01 Nov 2010
01 Nov 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Adaptive Speech Separation Based on Beamforming and Frequency Domain-Independent Component Analysis

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences