The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech

S Araki,S Makino,T Nishikawa,R Mukai,H Saruwatari

doi:10.1109/tsa.2003.809193

Abstract

Despite several recent proposals to achieve blind source separation (BSS) for realistic acoustic signals, the separation performance is still not good enough. In particular, when the impulse responses are long, performance is highly limited. In this paper, we consider a two-input, two-output convolutive BSS problem. First, we show that it is not good to be constrained by the condition T>P, where T is the frame length of the DFT and P is the length of the room impulse responses. We show that there is an optimum frame size that is determined by the trade-off between maintaining the number of samples in each frequency bin to estimate statistics and covering the whole reverberation. We also clarify the reason for the poor performance of BSS in long reverberant environments, highlighting that the framework of BSS works as two sets of frequency-domain adaptive beamformers. Although BSS can reduce reverberant sounds to some extent like adaptive beamformers, they mainly remove the sounds from the jammer direction. This is the reason for the difficulty of BSS in reverberant environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing

Lead the way for us

Journal: IEEE Transactions on Speech and Audio Processing	Publication Date: Mar 1, 2003
Citations: 323

Similar Papers

Blind Source Separation of Convolutive Mixtures of Speech
Shoji Makino
-
Shoji MakinoShoji Makino
01 Jan 2003
01 Jan 2003

Blind source separation of convolutive mixtures
Shoji Makino
-
Shoji MakinoShoji Makino
17 Apr 2006
17 Apr 2006

Subband based blind source separation for convolutive mixtures of speech
S Araki ... S Makino
-
S Araki, et. al.S Araki ... S Makino
06 Apr 2003
06 Apr 2003

Fundamental limitation of frequency domain blind source separation for convolutive mixture of speech
S Araki ... H Saruwatari
-
S Araki, et. al.S Araki ... H Saruwatari
07 May 2001
07 May 2001

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing