Abstract

Multiple sound source localization is a hot issue of concern in recent years. The Single Source Zone (SSZ) based localization methods achieve good performance due to the detection and utilization of the Time-Frequency (T-F) zone where only one source is dominant. However, some T-F points consisting of components from multiple sources are also included in the detected SSZ sometimes. Once a T-F point in SSZ is contributed by multiple components, this point is defined as an outlier. The existence of outliers within the detected SSZ is usually an unavoidable problem for SSZ-based methods. To solve this problem, a multi-source localization by using offset residual weight is proposed in this paper. In this method, an assumption is developed: the direction estimated by all the T-F points within the detected SSZ has a difference along with the actual direction of sources. But this difference is much smaller than the difference between the directions estimated by the outliers along with the actual source localization. After verifying this assumption experimentally, Point Offset Residual Weight (PORW) and Source Offset Residual Weight (SORW) are proposed to reduce the influence of outliers on the localization results. Then, a composite weight is formed by combining PORW and SORW, which can effectively distinguish the outliers and desired points. After that, the outliers are removed by composite weight. Finally, a statistical histogram of DOA estimation with outliers removed is used for multi-source localization. The objective evaluation of the proposed method is conducted in various simulated environments. The results show that the proposed method achieves a better performance compared with the reference methods in sources localization.

Highlights

  • Multiple sound source localization is a hot subject in audio signal processing and has gained extensive attention over decades for its vital role in various audio applications

  • 3.3 The proposed point offset residual measurement From the former section, we found that there are always outliers exist in the detected Single Source Zone (SSZ) and the change of the SSZ detection threshold can hardly change the proportion of outliers

  • An experiment is conducted to verify the proposed Source Offset Residual Weight (SORW) in different reverberation time using the same group of data, the average of SORW calculated by outliers and desired points is shown in Fig. 5: It can be found that the SORWs which are given to the desired points are higher than that given to the outliers in different reverberation time

Read more

Summary

Introduction

Multiple sound source localization is a hot subject in audio signal processing and has gained extensive attention over decades for its vital role in various audio applications. When there are no reverberation components or noise components in the T-F zone used for analysis, all the TF points within the T-F zone consist of the signals from the same source through the direct path, the result of (4) should satisfy rpq(Z) = 1, while it is hard to realize in the actual experimental environment so the criterion is relaxed as below: rpqðZÞ > 1−ε ð6Þ where ε is an empirical threshold set by user according to the practical scenario This threshold should guarantee that enough T-F zones are detected to perform the localization; most of the T-F zones contaminated by reverberation and interfere sources should be removed

Angular calculation for T-F points
Results and discussion
Conclusions
35 Proposed method without PORW Proposed method without SORW Proposed method
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.