Sound Event Detection in Domestic Environment Using Frequency-Dynamic Convolution and Local Attention

Grigorios-Aris Cheimariotis,Nikolaos Mitianoudis

doi:10.3390/info14100534

Grigorios-Aris Cheimariotis, Nikolaos Mitianoudis

Open Access

https://doi.org/10.3390/info14100534

Copy DOI

Journal: Information	Publication Date: Sep 30, 2023
Citations: 2	License type: CC BY 4.0

Affiliation: Democritus University of Thrace

Abstract

This work describes a methodology for sound event detection in domestic environments. Efficient solutions in this task can support the autonomous living of the elderly. The methodology deals with the “Challenge on Detection and Classification of Acoustic Scenes and Events (DCASE)” 2023, and more specifically with Task 4a “Sound event detection of domestic activities”. This task involves the detection of 10 common events in domestic environments in 10 s sound clips. The events may have arbitrary duration in the 10 s clip. The main components of the methodology are data augmentation on mel-spectrograms that represent the sound clips, feature extraction by passing spectrograms through a frequency-dynamic convolution network with an extra attention module in sequence with each convolution, concatenation of these features with BEATs embeddings, and use of BiGRU for sequence modeling. Also, a mean teacher model is employed for leveraging unlabeled data. This research focuses on the effect of data augmentation techniques, of the feature extraction models, and on self-supervised learning. The main contribution is the proposed feature extraction model, which uses weighted attention on frequency in each convolution, combined in sequence with a local attention module adopted by computer vision. The proposed system features promising and robust performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sound Event Detection in Domestic Environment Using Frequency-Dynamic Convolution and Local Attention

Abstract

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

A Sequence Matching Network for Polyphonic Sound Event Localization and Detection
Thi Ngoc Tho Nguyen ... Douglas L Jones
-
Thi Ngoc Tho Nguyen, et. al.Thi Ngoc Tho Nguyen ... Douglas L Jones
01 May 2020
01 May 2020

DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection
...
arXiv (Cornell University) | VOL. -
, et. al. ...
29 Jun 2021
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection
...

A Transpose-SELDNet for Polyphonic Sound Event Localization and Detection
Spoorthy V ... Shashidhar G Koolagudi
-
Spoorthy V, et. al.Spoorthy V ... Shashidhar G Koolagudi
07 Apr 2023
07 Apr 2023

A Model Ensemble Approach for Sound Event Localization and Detection
Qing Wang ... Huaxin Wu
-
Qing Wang, et. al.Qing Wang ... Huaxin Wu
24 Jan 2021
24 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sound Event Detection in Domestic Environment Using Frequency-Dynamic Convolution and Local Attention

Abstract

Talk to us

Similar Papers

More From: Information