Few-shot bioacoustic event detection utilizing spectro-temporal receptive field

Deokki Min,Yong-Hwa Park,Hyeonuk Nam

doi:10.3397/in_2024_3055

Abstract

Bioacoustic event detection is a task of identifying specific animal sounds within a biological audio recordings. Specialists only can label temporal information of animal sounds since knowledge for biological sound is demanded for labeling and they can give only few annotations due to high labor intensity caused by large duration of biological recordings. Therefore, the task is set to few-shot learning scheme and method called prototypical network effectively deal with the few-shot learning scheme by learning representation space which represent each class for given few examples. In this work, we utilized spectro-temporal receptive field, which is inspired auditory cortex which responses actively to certain spectro-temporal modulation, as a convolutional layer kernel of prototypical network. Bioacoustic events retain plentiful spectro-temporal modulation that STRF is expected to capture animal sounds effectively. Also, STRF kernels are constructed to fixed shape rather than trained that a model utilizing STRF kernels would learn representation space with less parameter. We built a model called Two Branch STRFNet (TB-STRFNet), in which STRF branch captures spectro-temporal modulation by STRF kernels and the other branch captures detailed time-frequency information which can be smashed in STRF branch. TB-STRFNet outperformed other models showing that effectiveness of auditory system inspired method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Few-shot bioacoustic event detection utilizing spectro-temporal receptive field

Abstract

Talk to us

Similar Papers

More From: INTER-NOISE and NOISE-CON Congress and Conference Proceedings

Lead the way for us

Similar Papers

Speech-nonspeech discrimination using the information bottleneck method and spectro-temporal modulation index
Maria Markaki ... Yannis Stylianou
-
Maria Markaki, et. al.Maria Markaki ... Yannis Stylianou
27 Aug 2007
27 Aug 2007

Few-shot fault diagnosis for pitch system of wind turbines based on prototypical network with Mahalanobis distance
Jiajian Yao ... Yuxian Zhang
-
Jiajian Yao, et. al.Jiajian Yao ... Yuxian Zhang
07 Sep 2022
07 Sep 2022

Face Image and Video Analysis in Biometrics and Health Applications
Na Zhang
-
Na ZhangNa Zhang
16 May 2023
16 May 2023

Hierarchy of speech-driven spectrotemporal receptive fields in human auditory cortex
Jonathan H Venezia ... Gregory Hickok
NeuroImage | VOL. 186
Jonathan H Venezia, et. al.Jonathan H Venezia ... Gregory Hickok
28 Nov 2018
NeuroImage | VOL. 186

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Few-shot bioacoustic event detection utilizing spectro-temporal receptive field

Abstract

Talk to us

Similar Papers

More From: INTER-NOISE and NOISE-CON Congress and Conference Proceedings