Abstract
This paper introduces a novel loss function, the boundary Gaussian distance loss, designed to enhance character segmentation in high-resolution scans of old metal-type printed documents. Despite various printing defects caused by low-quality printing technology in the 14th and 15th centuries, the proposed loss function allows the segmentation network to accurately extract character strokes that can be attributed to the typeface of the movable metal type used for printing. Our method calculates deviation between the boundary of predicted character strokes and the counterpart of the ground-truth strokes. Diverging from traditional Euclidean distance metrics, our approach determines the deviation indirectly utilizing boundary pixel-value difference over a Gaussian-smoothed version of the stroke boundary. This approach helps extract characters with smooth boundaries efficiently. Through experiments, it is confirmed that the proposed method not only smoothens stroke boundaries in character extraction, but also effectively eliminates noise and outliers, significantly improving the clarity and accuracy of the segmentation process.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.