A spherical microphone array is used to detect and localize sound sources in terms of model-based machine learning (ML). In this application, it is crucial to establish parametric models to distinguish background sound environment from presence of sound sources. In the presence of sound sources, the parameter models are also used to localize an unknown number of potentially multiple sound sources. In this work, a model-based Bayesian learning framework is presented for localizing an unknown number of sound sources. Among them, a no-source scenario needs to be accounted for. The model-based machine learning applies the model comparison between the no-source model and the one-source model for sound source detection. After detecting sound sources, the machine learning needs to involve sound source enumeration and localization in order to correctly localize potential multiple sound sources. Specifically, sound environment is analyzed using Bayesian model comparison of two different models accounting for absence and presence of the sound sources for source detection. Upon a positive detection, potentially multiple source models are involved to analyze direction of arrivals (DoAs) for far-field and to localize sound sources for near-field including source distances, amplitudes, and DoAs using Bayesian model selection and parameter estimation.
Read full abstract