Combination of nested microphone array and subband processing for multiple simultaneous speaker localization

Ali Dehghan Firoozabadi,Hamid Reza Abutalebi

doi:10.1109/istel.2012.6483115

Abstract

Speaker localization is one of the active topics in speech processing field. In this paper, we use a two-step method based on Time Difference Of Arrival (TDOA) for the localization of multiple simultaneous speech sources. In this method, directions of speakers are estimated by computing Generalized Cross Correlation (GCC) between microphone signals. In this paper, we propose a method based on combination of subband processing and nested microphone arrays. The use of subband processing is effective in increasing accuracy of multiple speaker localization. Also, the nested array can remove spatial aliasing by intelligent selection of some microphone subsets and assigning them to different subbands. When microphones of each subband were determined, subband processing is just applied on the data from that microphone subset. Moreover, targeting the high-noise environmental conditions, we use the GCC-Maximum Likelihood (GCC-ML) as the localization core of the proposed method. The combination of these all leads to omitting spatial aliasing and increasing the localization accuracy. Simulation results on different environmental scenarios validate the superior performance of the proposed method in the localization of multiple simultaneous speakers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combination of nested microphone array and subband processing for multiple simultaneous speaker localization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Localization of multiple simultaneous speakers by combining the information from different subbands
Ali Dehghan Firoozabadi ... Hamid Reza Abutalebi
-
Ali Dehghan Firoozabadi, et. al.Ali Dehghan Firoozabadi ... Hamid Reza Abutalebi
01 May 2013
01 May 2013

3D Localization of Multiple Simultaneous Speakers with Discrete Wavelet Transform and Proposed 3D Nested Microphone Array
Ali Dehghan Firoozabadi ... Ismael Soto
-
Ali Dehghan Firoozabadi, et. al.Ali Dehghan Firoozabadi ... Ismael Soto
01 Sep 2018
01 Sep 2018

Modified State Coherence Transform to reduce spatial aliasing in TDOA estimation of multiple sound sources
Mehdi Azadi ... Hamid Reza Abutalebi
-
Mehdi Azadi, et. al.Mehdi Azadi ... Hamid Reza Abutalebi
01 Sep 2014
01 Sep 2014

Performance of speaker localization using microphone array
R Visalakshi ... P Dhanalakshmi
International Journal of Speech Technology | VOL. 19
R Visalakshi, et. al.R Visalakshi ... P Dhanalakshmi
25 Apr 2016
International Journal of Speech Technology | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combination of nested microphone array and subband processing for multiple simultaneous speaker localization

Abstract

Talk to us

Similar Papers