ADTOF: A large dataset of non-synthetic music for automatic drum transcription

Mickaël Zehren ,Marco Alunno ,Paolo Bientinesi

doi:10.5281/zenodo.5624527

Abstract

The state-of-the-art methods for drum transcription in the presence of melodic instruments (DTM) are machine learning models trained in a supervised manner, which means that they rely on labeled datasets. The problem is that the available public datasets are limited either in size or in realism, and are thus suboptimal for training purposes. Indeed, the best results are currently obtained via a rather convoluted multi-step training process that involves both real and synthetic datasets. To address this issue, starting from the observation that the communities of rhythm games players provide a large amount of annotated data, we curated a new dataset of crowdsourced drum transcriptions. This dataset contains real-world music, is manually annotated, and is about two orders of magnitude larger than any other non-synthetic dataset, making it a prime candidate for training purposes. However, due to crowdsourcing, the initial annotations contain mistakes. We discuss how the quality of the dataset can be improved by automatically correcting different types of mistakes. When used to train a popular DTM model, the dataset yields a performance that matches that of the state-of-the-art for DTM, thus demonstrating the quality of the annotations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ADTOF: A large dataset of non-synthetic music for automatic drum transcription

Abstract

Talk to us

Similar Papers

More From: Zenodo (CERN European Organization for Nuclear Research)

Lead the way for us

Journal: Zenodo (CERN European Organization for Nuclear Research)	Publication Date: Nov 7, 2021
License type: cc-by

Similar Papers

Design and technical validation to generate a synthetic 12-lead electrocardiogram dataset to promote artificial intelligence research
Hakje Yoo ... Hyung Joon Joo
Health Information Science and Systems | VOL. 11
Hakje Yoo, et. al.Hakje Yoo ... Hyung Joon Joo
30 Aug 2023
Health Information Science and Systems | VOL. 11

Few-shot Drum Transcription in Polyphonic Music
...
-
, et. al. ...
11 Oct 2020
11 Oct 2020

A data augmentation strategy for scene text recognition
Xin Luan ... Wushour Silamu
-
Xin Luan, et. al.Xin Luan ... Wushour Silamu
28 Apr 2023
28 Apr 2023

Generation of a Melanoma and Nevus Data Set From Unstandardized Clinical Photographs on the Internet
Soo Ick Cho ... Seung Seog Han
JAMA dermatology | VOL. 159
Soo Ick Cho, et. al.Soo Ick Cho ... Seung Seog Han
04 Oct 2023
JAMA dermatology | VOL. 159

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ADTOF: A large dataset of non-synthetic music for automatic drum transcription

Abstract

Talk to us

Similar Papers

More From: Zenodo (CERN European Organization for Nuclear Research)