Deep Neural Networks for Document Processing of Music Score Images

Jorge Calvo-Zaragoza,Francisco J Castellanos,Gabriel Vigliensoni,Ichiro Fujinaga

doi:10.3390/app8050654

Abstract

There is an increasing interest in the automatic digitization of medieval music documents. Despite efforts in this field, the detection of the different layers of information on these documents still poses difficulties. The use of Deep Neural Networks techniques has reported outstanding results in many areas related to computer vision. Consequently, in this paper, we study the so-called Convolutional Neural Networks (CNN) for performing the automatic document processing of music score images. This process is focused on layering the image into its constituent parts (namely, background, staff lines, music notes, and text) by training a classifier with examples of these parts. A comprehensive experimentation in terms of the configuration of the networks was carried out, which illustrates interesting results as regards to both the efficiency and effectiveness of these models. In addition, a cross-manuscript adaptation experiment was presented in which the networks are evaluated on a different manuscript from the one they were trained. The results suggest that the CNN is capable of adapting its knowledge, and so starting from a pre-trained CNN reduces (or eliminates) the need for new labeled data.

Highlights

Significant efforts for the preservation of music heritage have occurred in recent decades.The digitization process has significantly improved the access to these sources while ensuring their physical preservation; to make the music contained in these documents truly browsable and searchable, it is necessary to encode the symbolic information into a structured digital format such as MusicXML or Music Encoding Initiative (MEI)
We define the document processing as the detection and categorization of the different layers of information contained in the music score image
Since the possible combinations of all the different parameters lead to a huge set of different neural networks to be trained per fold, we propose a serialization of the experiments

Summary

Introduction

Significant efforts for the preservation of music heritage have occurred in recent decades.The digitization process has significantly improved the access to these sources while ensuring their physical preservation; to make the music contained in these documents truly browsable and searchable, it is necessary to encode the symbolic information into a structured digital format such as MusicXML or Music Encoding Initiative (MEI). Each pixel of the image is queried, and its feature block is forwarded and processed by the network, as illustrated in Concerning the CNN configuration, it is still an open question what hyper-parameters (e.g., number of layers, number of filters per layer, size of the filters, etc.) are useful to a greater or lesser extent for this task. This is why we carried out a thorough study of different CNN configurations. The idea is not to find an optimal configuration, which would be unfeasible to demonstrate, but to study what hyper-parameter configurations make a greater difference in performance and computational cost, as well as to analyze the best level of accuracy that can be attained using this approach.

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Apr 24, 2018
Citations: 32	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Deep Neural Networks for Document Processing of Music Score Images

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Deep distributed convolutional neural networks: Universality
Ding-Xuan Zhou
Analysis and Applications | VOL. 16
Ding-Xuan ZhouDing-Xuan Zhou
01 Nov 2018
Analysis and Applications | VOL. 16

Research on improved convolutional wavelet neural network
Jingwei Liu ... Xuehan Tang
Scientific Reports | VOL. 11
Jingwei Liu, et. al.Jingwei Liu ... Xuehan Tang
09 Sep 2021
Scientific Reports | VOL. 11

Strategies for Boosted Learning Using VGG 3 and Deep Neural Network as Baseline Models
K S Gautam ... M Akila
-
K S Gautam, et. al.K S Gautam ... M Akila
01 Jan 2020
01 Jan 2020

Brain tumor segmentation with deep convolutional symmetric neural network
Hao Chen ... Zhen Qin
Neurocomputing | VOL. 392
Hao Chen, et. al.Hao Chen ... Zhen Qin
24 Apr 2019
Neurocomputing | VOL. 392

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Neural Networks for Document Processing of Music Score Images

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences