Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation

Khaoula Elagouni,Pascale Sébillot,Franck Mamalet,Christophe Garcia

doi:10.1007/s10032-013-0202-7

Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation

Khaoula Elagouni, Pascale Sébillot + Show 2 more

Open Access

https://doi.org/10.1007/s10032-013-0202-7

Copy DOI

Journal: International Journal on Document Analysis and Recognition (IJDAR)	Publication Date: Feb 19, 2013
Citations: 64

Affiliation: Orange (France), Institut National des Sciences Appliquées de Rennes, Institut de Recherche en Informatique et Systèmes Aléatoires

#Optical Character Recognition Systems #Optical Character Recognition + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Text embedded in multimedia documents represents an important semantic information that helps to automatically access the content. This paper proposes two neural-based optical character recognition (OCR) systems that handle the text recognition problem in different ways. The first approach segments a text image into individual characters before recognizing them, while the second one avoids the segmentation step by integrating a multi-scale scanning scheme that allows to jointly localize and recognize characters at each position and scale. Some linguistic knowledge is also incorporated into the proposed schemes to remove errors due to recognition confusions. Both OCR systems are applied to caption texts embedded in videos and in natural scene images and provide outstanding results showing that the proposed approaches outperform the state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: International Journal on Document Analysis and Recognition (IJDAR)

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.