Historical Text Line Segmentation Using Deep Learning Algorithms: Mask-RCNN against U-Net Networks.

Florian Côme Fizaine,Patrick Bard,Edouard Bouyé,Annie Vinter,Cécile Robin,Raphaël Lefèvre,Michel Paindavoine

doi:10.3390/jimaging10030065

Abstract

Text line segmentation is a necessary preliminary step before most text transcription algorithms are applied. The leading deep learning networks used in this context (ARU-Net, dhSegment, and Doc-UFCN) are based on the U-Net architecture. They are efficient, but fall under the same concept, requiring a post-processing step to perform instance (e.g., text line) segmentation. In the present work, we test the advantages of Mask-RCNN, which is designed to perform instance segmentation directly. This work is the first to directly compare Mask-RCNN- and U-Net-based networks on text segmentation of historical documents, showing the superiority of the former over the latter. Three studies were conducted, one comparing these networks on different historical databases, another comparing Mask-RCNN with Doc-UFCN on a private historical database, and a third comparing the handwritten text recognition (HTR) performance of the tested networks. The results showed that Mask-RCNN outperformed ARU-Net, dhSegment, and Doc-UFCN using relevant line segmentation metrics, that performance evaluation should not focus on the raw masks generated by the networks, that a light mask processing is an efficient and simple solution to improve evaluation, and that Mask-RCNN leads to better HTR performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Historical Text Line Segmentation Using Deep Learning Algorithms: Mask-RCNN against U-Net Networks.

Abstract

Talk to us

Similar Papers

More From: Journal of imaging

Lead the way for us

Journal: Journal of imaging	Publication Date: Mar 5, 2024
License type: CC BY 4.0

Similar Papers

Line Segmentation of Tibetan Ancient Books Based on A* Algorithm
Huaming Liu ... Xiuyou Wang
Journal of Physics: Conference Series | VOL. 2356
Huaming Liu, et. al.Huaming Liu ... Xiuyou Wang
01 Oct 2022
Journal of Physics: Conference Series | VOL. 2356

Text line and word segmentation of handwritten documents
G Louloudis ... C Halatsis
Pattern Recognition | VOL. 42
G Louloudis, et. al.G Louloudis ... C Halatsis
04 Jan 2009
Pattern Recognition | VOL. 42

A robust method for line and word segmentation in handwritten text
Abdelaali Hassaine
-
Abdelaali HassaineAbdelaali Hassaine
01 Jan 2013
01 Jan 2013

Touching text line segmentation combined local baseline and connected component for Uchen Tibetan historical documents
Pengfei Hu ... Tiejun Wang
Information Processing & Management | VOL. 58
Pengfei Hu, et. al.Pengfei Hu ... Tiejun Wang
27 Jul 2021
Information Processing & Management | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Historical Text Line Segmentation Using Deep Learning Algorithms: Mask-RCNN against U-Net Networks.

Abstract

Talk to us

Similar Papers

More From: Journal of imaging