Loss Function Design for DNN-Based Sound Event Localization and Detection on Low-Resource Realistic Data

Qing Wang,Jia Pan,Jun Du,Chin-Hui Lee,Li Chai,Shutong Niu,Zhaoxu Nian,Huaxin Wu

doi:10.1109/icassp49357.2023.10095144

Abstract

This study focuses on the design of a loss function for a deep neural network (DNN)-based model with two branches, which is used to solve sound event localization and detection (SELD) on low-resource realistic data. To this end, we employ a secondary network for audio classification, which provides global event information to the main network, enabling it to make robust SELD predictions. Furthermore, we suggest utilizing a momentum strategy for direction-of-arrival (DOA) estimation, taking advantage of the strong temporal consistency of sound events, thereby effectively reducing localization error. Lastly, we incorporate a regularization term into the loss function to alleviate the overfitting problem on the small dataset. We evaluate our proposed methods on the Detection and Classification of Acoustic Scenes and Events (DCASE) 2022 Task 3 dataset, and the results demonstrate consistent improvements in SELD performance. In comparison to the baseline system, the proposed loss function yields significantly improved results for both localization and detection metrics on realistic data. Moreover, the proposed loss function demonstrates its ability to generalize across different network architectures, as evidenced by the consistent improvements achieved.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Loss Function Design for DNN-Based Sound Event Localization and Detection on Low-Resource Realistic Data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Method Based on Dual Cross-Modal Attention and Parameter Sharing for Polyphonic Sound Event Localization and Detection
Sang-Hoon Lee ... Hyung-Min Park
Applied Sciences | VOL. 12
Sang-Hoon Lee, et. al.Sang-Hoon Lee ... Hyung-Min Park
18 May 2022
Applied Sciences | VOL. 12

Adaptive Memory-Controlled Self-Attention for Polyphonic Sound Event Detection
Mei Wang ... Hongbin Qiu
Symmetry | VOL. 14
Mei Wang, et. al.Mei Wang ... Hongbin Qiu
12 Feb 2022
Symmetry | VOL. 14

Sound Event Localization and Detection Using Imbalanced Real and Synthetic Data via Multi-Generator
Yeongseo Shin ... Chanjun Chun
Sensors | VOL. 23
Yeongseo Shin, et. al.Yeongseo Shin ... Chanjun Chun
23 Mar 2023
Sensors | VOL. 23

Creating a new research community on detection and classification of acoustic scenes and events: Lessons from the first ten years of DCASE challenges and workshops
Mark Plumbley ... Tuomas Virtanen
INTER-NOISE and NOISE-CON Congress and Conference Proceedings | VOL. 265
Mark Plumbley, et. al.Mark Plumbley ... Tuomas Virtanen
01 Feb 2023
INTER-NOISE and NOISE-CON Congress and Conference Proceedings | VOL. 265

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Loss Function Design for DNN-Based Sound Event Localization and Detection on Low-Resource Realistic Data

Abstract

Talk to us

Similar Papers