Few-shot Drum Transcription in Polyphonic Music

Yu Wang ,Justin Salamon ,Nicholas J Bryan ,Mark Cartwright ,Juan Pablo Bello

doi:10.5281/zenodo.4245384

Abstract

Data-driven approaches to automatic drum transcription (ADT) are often limited to a predefined, small vocabulary of percussion instrument classes. Such models cannot recognize out-of-vocabulary classes nor are they able to adapt to finer-grained vocabularies. In this work, we address open vocabulary ADT by introducing few-shot learning to the task. We train a Prototypical Network on a synthetic dataset and evaluate the model on multiple real-world ADT datasets with polyphonic accompaniment. We show that, given just a handful of selected examples at inference time, we can match and in some cases outperform a state-of-the-art supervised ADT approach under a fixed vocabulary setting. At the same time, we show that our model can successfully generalize to finer-grained or extended vocabularies unseen during training, a scenario where supervised approaches cannot operate at all. We provide a detailed analysis of our experimental results, including a breakdown of performance by sound class and by polyphony.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Few-shot Drum Transcription in Polyphonic Music

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

ADTOF: A large dataset of non-synthetic music for automatic drum transcription
...
Zenodo (CERN European Organization for Nuclear Research) | VOL. -
, et. al. ...
07 Nov 2021
Zenodo (CERN European Organization for Nuclear Research) | VOL. -

Is Synthetic Dataset Reliable for Benchmarking Generalizable Person Re-Identification?
Cuicui Kang
-
Cuicui KangCuicui Kang
10 Oct 2022
10 Oct 2022

Formula omitted]-GAN: Robust generative adversarial networks
Aurele Tohokantche Gnanha ... Qing Li
Information Sciences | VOL. 593
Aurele Tohokantche Gnanha, et. al.Aurele Tohokantche Gnanha ... Qing Li
04 Feb 2022
Information Sciences | VOL. 593

Benchmarking the benchmark — Comparing synthetic and real-world Network IDS datasets
Siamak Layeghy ... Marius Portmann
Journal of Information Security and Applications | VOL. 80
Siamak Layeghy, et. al.Siamak Layeghy ... Marius Portmann
02 Jan 2024
Journal of Information Security and Applications | VOL. 80

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Few-shot Drum Transcription in Polyphonic Music

Abstract

Talk to us

Similar Papers