Abstract

The automatic recognition of scanned Medieval manuscripts written in square notation still represents a challenge due to degradation, non-standard layouts, or notations. We propose to apply CNN/LSTM networks that are trained using the segmentation-free CTC-loss-function. For evaluation, we use three different manuscripts and achieve a diplomatic Symbol Accuracy Rate (dSAR) of 86.0% on the most difficult book and 92.2% on the cleanest one. A neume dictionary during decoding yields a relative improvement of about 5%. Finally, we perform a thorough error analysis to provide a deeper insight into problems of the algorithm.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call