End-to-End Neural Optical Music Recognition of Monophonic Scores

Jorge Calvo-Zaragoza,David Rizo

doi:10.3390/app8040606

Abstract

Optical Music Recognition is a field of research that investigates how to computationally decode music notation from images. Despite the efforts made so far, there are hardly any complete solutions to the problem. In this work, we study the use of neural networks that work in an end-to-end manner. This is achieved by using a neural model that combines the capabilities of convolutional neural networks, which work on the input image, and recurrent neural networks, which deal with the sequential nature of the problem. Thanks to the use of the the so-called Connectionist Temporal Classification loss function, these models can be directly trained from input images accompanied by their corresponding transcripts into music symbol sequences. We also present the Printed Music Scores dataset, containing more than 80,000 monodic single-staff real scores in common western notation, that is used to train and evaluate the neural approach. In our experiments, it is demonstrated that this formulation can be carried out successfully. Additionally, we study several considerations about the codification of the output musical sequences, the convergence and scalability of the neural models, as well as the ability of this approach to locate symbols in the input score.

Highlights

During the past few years, the availability of huge collections of digital scores has facilitated both the music professional practice and the amateur access to printed sources that were difficult to obtain in the past
We present the Printed Images of Music Staves (PrIMuS) dataset, containing more than 80,000 monodic single-staff real scores in common western notation, that is used to train and evaluate the neural approach
We study in this work a holistic approach to the task of retrieving the music symbols that appear in score images

Summary

Introduction

During the past few years, the availability of huge collections of digital scores has facilitated both the music professional practice and the amateur access to printed sources that were difficult to obtain in the past. Some examples of these collections are the IMSLP (http://imslp.org) website with currently 425,000 classical music scores, or many different sites offering Real Book jazz lead sheets. The great possibilities that current music-based applications can offer are restricted to scores symbolically encoded An initial processing of the image is required This involves various steps of document analysis, not always strictly related to the musical domain. Results have reached values closer to the optimum over standard benchmarks by using DL [27,28]

Objectives

Methods

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Apr 11, 2018
Citations: 60	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

End-to-End Neural Optical Music Recognition of Monophonic Scores

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

New Switchless and Free Positioning Gesture Recognition System Using RNN and CTC Loss Function
Kei Nakada ... Atsushi Ito
-
Kei Nakada, et. al.Kei Nakada ... Atsushi Ito
01 Dec 2018
01 Dec 2018

A Weakly-Supervised Approach for Layout Analysis in Music Score Images
Eric Ayllon ... Jorge Calvo-Zaragoza
-
Eric Ayllon, et. al.Eric Ayllon ... Jorge Calvo-Zaragoza
01 Jan 2023
01 Jan 2023

Survey on Silentinterpreter : Analysis of Lip Movement and Extracting Speech using Deep Learning
Ameen Hafeez ... Prof Shwetha K S
International Journal of Scientific Research in Science, Engineering and Technology | VOL. 11
Ameen Hafeez, et. al. Ameen Hafeez ... Prof Shwetha K S
07 Apr 2024
International Journal of Scientific Research in Science, Engineering and Technology | VOL. 11

Pengantar dan Survey Tentang Optical Music Recognition
Kevin Purwito
Jurnal ULTIMATICS | VOL. 6
Kevin PurwitoKevin Purwito
01 Jun 2014
Jurnal ULTIMATICS | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

End-to-End Neural Optical Music Recognition of Monophonic Scores

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences