Abstract

Multiple simultaneous sound source localization (SSL) is one of the most important applications in the speech signal processing. The one-step algorithms with the advantage of low computational complexity (and low accuracy), and the two-step methods with high accuracy (and high computational complexity) are proposed for multiple SSL. In this article, a combination of one-step-based method based on the generalized eigenvalue decomposition (GEVD), and a two-step-based method based on the adaptive generalized cross-correlation (GCC) by using the phase transform/maximum likelihood (PHAT/ML) filters along with a novel T-shaped circular distributed microphone array (TCDMA) is proposed for 3D multiple simultaneous SSL. In addition, the low computational complexity advantage of the GCC algorithm is considered in combination with the high accuracy of the GEVD method by using the distributed microphone array to eliminate spatial aliasing and thus obtain more appropriate information. The proposed T-shaped circular distributed microphone array-based adaptive GEVD and GCC-PHAT/ML algorithms (TCDMA-AGGPM) is compared with hierarchical grid refinement (HiGRID), temporal extension of multiple response model of sparse Bayesian learning with spherical harmonic (SH) extension (SH-TMSBL), sound field morphological component analysis (SF-MCA), and time-frequency mixture weight Bayesian nonparametric acoustical holography beamforming (TF-MW-BNP-AHB) methods based on the mean absolute estimation error (MAEE) criteria in noisy and reverberant environments on simulated and real data. The superiority of the proposed method is presented by showing the high accuracy and low computational complexity for 3D multiple simultaneous SSL.

Highlights

  • In recent years, the analysis of smart meeting room activities has been an important area in the acoustic signal processing, where the sound source localization (SSL) is one of these applications

  • Most of the methods have higher accuracy in high signal-to-noise ratio (SNR), but the proposed method with averaged mean absolute estimation error (MAEE) 31 cm even works better in comparison with TF-mixture weight (MW)-Bayesian nonparametric (BNP)-acoustical holography beamforming (AHB) algorithm with 35 cm in SNR = 20 dB. This figure clearly shows that the accuracy of all methods decreases in low SNRs, and the simulated data has better results in comparison with real data. These results show the superiority of the proposed T-shaped circular distributed microphone array (TCDMA)-AGGPM algorithm in comparison with other previous works

  • A novel 3D multiple simultaneous SSL algorithm was proposed based on the T-shaped circular distributed microphone array (DMA) in combination with generalized eigenvalue decomposition (GEVD) and adaptive generalized cross-correlation (GCC)-phase transform/maximum likelihood (PHAT/maximum likelihood (ML)) methods for noisy and reverberant environments

Read more

Summary

Introduction

The analysis of smart meeting room activities has been an important area in the acoustic signal processing, where the sound source localization (SSL) is one of these applications. The circular microphone arrays are widely considered in multi-speaker applications due to the flexibility in speech signal analysis, but the accuracy of the SSL algorithms is strongly dependent to the physical properties of the microphones, the level of the noise-reverberation, and the number of speakers. To address this problem, Kunkun et al in 2021 presented an indoor multiple SSL algorithm based on an acoustical holography beamforming (AHB) and Bayesian nonparametric (BNP) methods [25].

Distributed Microphone Array
The Proposed T-Shaped Circular Distributed Microphone Array for SSL
The Proposed SSL Algorithm in Combination with Distributed Microphone Array
Data Recording and Simulation Conditions
The Evaluation’s Scenarios
The Results on Simulated and Real Data
The proposed TCDMA-AGGPM
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call