Robust direction estimation with convolutional neural networks based steered response power

Pasi Pertila,Emre Cakir

doi:10.1109/icassp.2017.7953333

Abstract

The steered response power (SRP) methods can be used to build a map of sound direction likelihood. In the presence of interference and reverberation, the map will exhibit multiple peaks with heights related to the corresponding sound's spectral content. Often in realistic use cases, the target of interest (such as speech) can exhibit a lower peak compared to an interference source. This will corrupt any direction dependent method, such as beamforming. Regression has been used to predict time-frequency (TF) regions corrupted by reverberation, and static broadband noise can be efficiently estimated for TF points. TF regions dominated by noise or reverberation can then be de-emphasized to obtain more reliable source direction estimates. In this work, we propose the use of convolutional neural networks (CNNs) for the prediction of a TF mask for emphasizing the direct path speech signal in time-varying interference. SRP with phase transform (SRP-PHAT) combined with the CNN-based masking is shown to be capable of reducing the impact of time-varying interference for speaker direction estimation using real speech sources in reverberation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust direction estimation with convolutional neural networks based steered response power

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Performance Evaluation of Iterative SRP-PHAT Techniques for Acoustic Source Localization
Ritu Boora ... Sanjeev Kumar Dhull
-
Ritu Boora, et. al.Ritu Boora ... Sanjeev Kumar Dhull
01 Jan 2021
01 Jan 2021

SRP-PHAR Combined Velocity Scanning for Locating the Shallow Underground Acoustic Source
Pengfei Nie ... Ping Chen
IEEE Access | VOL. 7
Pengfei Nie, et. al.Pengfei Nie ... Ping Chen
01 Jan 2019
IEEE Access | VOL. 7

DOA Estimation for Spherical Microphone Array using Spherical Convolutional Neural Networks
Israel Mendoza Velazquez ... Yi Ren
-
Israel Mendoza Velazquez, et. al.Israel Mendoza Velazquez ... Yi Ren
12 Oct 2021
12 Oct 2021

Direction of Arrival Estimation with Microphone Arrays Using SRP-PHAT and Neural Networks
David Díaz-Guerra Aparicio ... José Ramón Beltrán Blázquez
Jornada de Jóvenes Investigadores del I3A | VOL. 6
David Díaz-Guerra Aparicio, et. al.David Díaz-Guerra Aparicio ... José Ramón Beltrán Blázquez
25 May 2018
Jornada de Jóvenes Investigadores del I3A | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust direction estimation with convolutional neural networks based steered response power

Abstract

Talk to us

Similar Papers