High-Resolution Piano Transcription With Pedals by Regressing Onset and Offset Times

Qiuqiang Kong,Xuchen Song,Bochen Li,Yuan Wan,Yuxuan Wang

doi:10.1109/taslp.2021.3121991

Abstract

Automatic music transcription (AMT) is the task of transcribing audio recordings into symbolic representations. Recently, neural network based methods have been applied to AMT, and have achieved state-of-the-art results. However, many previous systems only detect onset and offset of notes in frame-wise, so the transcription resolution is limited to the frame hop size. There is a lack of research of using different strategies to encode onset and offset targets for training. In addition, previous AMT systems are sensitive to the misaligned onset and offset labels of audio recordings. Furthermore, there are limited research of sustain pedal transcription on large-scale datasets. In this article, we propose a high-resolution AMT system trained by regressing precise onset and offset times of piano notes. At inference, we propose an algorithm to analytically calculate the precise onset and offset times of piano notes and pedal events. We show that our AMT system is robust to misaligned onset and offset labels compared to previous systems. Our proposed system achieves an onset F1 of 96.72% on the MAESTRO dataset, outperforming previous onsets and frames system of 94.80%. Our system achieves a pedal onset F1 score of 91.86%, which is the first benchmark result on the MAESTRO dataset. We have released the source code and checkpoints of our work at https://github.com/bytedance/piano_transcription.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

High-Resolution Piano Transcription With Pedals by Regressing Onset and Offset Times

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2021
Citations: 40

Similar Papers

An exhaustive review of automatic music transcription techniques: Survey of music transcription techniques
B S Gowrishankar ... Nagappa U Bhajantri
-
B S Gowrishankar, et. al.B S Gowrishankar ... Nagappa U Bhajantri
01 Oct 2016
01 Oct 2016

Low rank modelling for polyphonic music analysis.

-

31 Jul 2020
31 Jul 2020

Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
Xiangming Gu ... Wei Zeng
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20
Xiangming Gu, et. al.Xiangming Gu ... Wei Zeng
16 May 2024
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20

Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription
Adrien Ycart ... Emmanouil Benetos
Transactions of the International Society for Music Information Retrieval | VOL. 3
Adrien Ycart, et. al.Adrien Ycart ... Emmanouil Benetos
12 Jun 2020
Transactions of the International Society for Music Information Retrieval | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High-Resolution Piano Transcription With Pedals by Regressing Onset and Offset Times

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing