A Baseline for General Music Object Detection with Deep Learning

Alexander Pacha,Jorge Calvo-Zaragoza,Jan Hajič

doi:10.3390/app8091488

Alexander Pacha, Jorge Calvo-Zaragoza + Show 1 more

Open Access

https://doi.org/10.3390/app8091488

Copy DOI

Abstract

Deep learning is bringing breakthroughs to many computer vision subfields including Optical Music Recognition (OMR), which has seen a series of improvements to musical symbol detection achieved by using generic deep learning models. However, so far, each such proposal has been based on a specific dataset and different evaluation criteria, which made it difficult to quantify the new deep learning-based state-of-the-art and assess the relative merits of these detection models on music scores. In this paper, a baseline for general detection of musical symbols with deep learning is presented. We consider three datasets of heterogeneous typology but with the same annotation format, three neural models of different nature, and establish their performance in terms of a common evaluation standard. The experimental results confirm that the direct music object detection with deep learning is indeed promising, but at the same time illustrates some of the domain-specific shortcomings of the general detectors. A qualitative comparison then suggests avenues for OMR improvement, based both on properties of the detection model and how the datasets are defined. To the best of our knowledge, this is the first time that competing music object detection systems from the machine learning paradigm are directly compared to each other. We hope that this work will serve as a reference to measure the progress of future developments of OMR in music object detection.

Highlights

Optical Music Recognition (OMR) is the field of research that investigates how to computationally read music notation in documents
The aggregate detection performance of the individual models over each of the datasets is reported in Table 2, presenting both mean AP (mAP) and weighted mAP (w-mAP) as defined for the Common Objects in Context (COCO) challenge [17]
These results should serve as the baseline for further music object detection research

Summary

Introduction

Optical Music Recognition (OMR) is the field of research that investigates how to computationally read music notation in documents. OMR has been approached by workflows composed of several stages, as outlined in the previous section. These stages were further subdivided into smaller steps. Inside of the music object detection stage, the key step used to be the staff-line detection and removal [20]. Even with an ideal staff-line removal algorithm, isolating musical symbols by means of connected components remains problematic, since multiple primitives could be connected to each other (e.g., a beam group can be a single connected component that includes several heads, stems, and beams) or a single unit can have multiple disconnected parts (e.g., a fermata, voltas, f-clef). The second case is severe in the context of handwritten notation, where symbols can be written with such a high variability (e.g., detached noteheads) that modeling all possible appearances becomes intractable

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Aug 29, 2018
Citations: 41	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Baseline for General Music Object Detection with Deep Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Pengantar dan Survey Tentang Optical Music Recognition
Kevin Purwito
Jurnal ULTIMATICS | VOL. 6
Kevin PurwitoKevin Purwito
01 Jun 2014
Jurnal ULTIMATICS | VOL. 6

Towards a Universal Music Symbol Classifier
Alexander Pacha ... Horst Eidenberger
-
Alexander Pacha, et. al.Alexander Pacha ... Horst Eidenberger
01 Nov 2017
01 Nov 2017

Optical Music Recognition
Pierfrancesco Bellini ... Ivan Bruno
-
Pierfrancesco Bellini, et. al.Pierfrancesco Bellini ... Ivan Bruno
01 Jan 2008
01 Jan 2008

Low- and high-level approaches to optical music score recognition
K.C Ng
-
K.C NgK.C Ng
01 Jan 1995
01 Jan 1995

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Baseline for General Music Object Detection with Deep Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences