Creating a new research community on detection and classification of acoustic scenes and events: Lessons from the first ten years of DCASE challenges and workshops

Mark Plumbley,Tuomas Virtanen

doi:10.3397/in_2022_0643

Abstract

Research work on automatic speech recognition and automatic music transcription has been around for several decades, supported by dedicated conferences or conference sessions. However, while individual researchers have been working on recognition of more general environmental sounds, until ten years ago there were no regular workshops or conference sessions where this research, or its researchers, could be found. There was also little available data for researchers to work on or to benchmark their work. In this talk we will outline how a new research community working on Detection and Classification of Acoustic Scenes and Events (DCASE) has grown over the last ten years, from two challenges on acoustic scene classification and sound event detection with a small workshop poster session, to an annual data challenge with six tasks and a dedicated annual workshop, attracting hundreds of delegates and strong industry interest. We will also describe how the analysis methods have evolved, from mel frequency cepstral coefficients (MFCCs) or cochelograms classified by support vector machines (SVMs) or hidden Markov models (HMMs), to deep learning methods such as transfer learning, transformers, and self-supervised learning. We will finish by suggesting some potential future directions for automatic sound recognition and the DCASE community.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Creating a new research community on detection and classification of acoustic scenes and events: Lessons from the first ten years of DCASE challenges and workshops

Abstract

Talk to us

Similar Papers

More From: INTER-NOISE and NOISE-CON Congress and Conference Proceedings

Lead the way for us

Journal: INTER-NOISE and NOISE-CON Congress and Conference Proceedings	Publication Date: Feb 1, 2023
License type: cc-by

Similar Papers

Acoustic Scene Classification Using Reduced MobileNet Architecture
Jun-Xiang Xu ... Tzu-Ching Lin
-
Jun-Xiang Xu, et. al.Jun-Xiang Xu ... Tzu-Ching Lin
01 Dec 2018
01 Dec 2018

A Method Based on Dual Cross-Modal Attention and Parameter Sharing for Polyphonic Sound Event Localization and Detection
Sang-Hoon Lee ... Hyung-Min Park
Applied Sciences | VOL. 12
Sang-Hoon Lee, et. al.Sang-Hoon Lee ... Hyung-Min Park
18 May 2022
Applied Sciences | VOL. 12

Classification of audio scenes with novel features in a fused system framework
Shefali Waldekar ... Goutam Saha
Digital Signal Processing | VOL. 75
Shefali Waldekar, et. al.Shefali Waldekar ... Goutam Saha
11 Jan 2018
Digital Signal Processing | VOL. 75

Adaptive Memory-Controlled Self-Attention for Polyphonic Sound Event Detection
Mei Wang ... Hongbin Qiu
Symmetry | VOL. 14
Mei Wang, et. al.Mei Wang ... Hongbin Qiu
12 Feb 2022
Symmetry | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Creating a new research community on detection and classification of acoustic scenes and events: Lessons from the first ten years of DCASE challenges and workshops

Abstract

Talk to us

Similar Papers

More From: INTER-NOISE and NOISE-CON Congress and Conference Proceedings