Printed Japanese Character Recognition Using Multiple Commercial OCRs

Hidetoshi Miyao,Atsuhiko Tani,Hirosato Tabaru,Yasuaki Nakano,Toshihiro Hananoi

doi:10.20965/jaciii.2004.p0200

Hidetoshi Miyao, Atsuhiko Tani + Show 3 more

Open Access

https://doi.org/10.20965/jaciii.2004.p0200

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

This paper proposes two algorithms for maintaining matching between lines and characters in text documents output by multiple commercial optical character readers (OCRs). (1) a line matching algorithm using dynamic programming (DP) matching and (2) a character matching algorithm using character string division and standard character strings. The paper proposes a method that introduces majority logic and reject processing in character recognition. To demonstrate the feasibility of the method, we conducted experiments on line matching recognition for 127 document images using five commercial OCRs. Results demonstrated that the method extracted character areas with more accuracy than a single OCR along with appropriate line matching. The proposed method enhanced recognition from 97.61% provided by a single OCR to 98.83% in experiments using the character matching algorithm and character recognition. This method is expected to be highly useful in correcting locations at which unwanted lines or characters occur or required lines or characters disappear.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Advanced Computational Intelligence and Intelligent Informatics	Publication Date: Mar 20, 2004
Citations: 3	License type: cc-by-nd

R Discovery Prime

Printed Japanese Character Recognition Using Multiple Commercial OCRs

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Advanced Computational Intelligence and Intelligent Informatics

Lead the way for us

Similar Papers

Application of Geometry Rectification to Deformed Characters Recognition
Honghui Fan ... Liqun Wang
-
Honghui Fan, et. al.Honghui Fan ... Liqun Wang
01 Jan 2015
01 Jan 2015

A self-powered character recognition device based on a triboelectric nanogenerator
Il-Woong Tcho ... Yang-Kyu Choi
Nano Energy | VOL. 70
Il-Woong Tcho, et. al.Il-Woong Tcho ... Yang-Kyu Choi
25 Jan 2020
Nano Energy | VOL. 70

An Experimental Performance Analysis on Robotics Process Automation (RPA) With Open Source OCR Engines: Microsoft Ocr And Google Tesseract OCR
T Malathi ... D Selvamuthukumaran
IOP Conference Series: Materials Science and Engineering | VOL. 1059
T Malathi, et. al.T Malathi ... D Selvamuthukumaran
01 Feb 2021
IOP Conference Series: Materials Science and Engineering | VOL. 1059

<title>Fast title extraction method for business documents</title>
Yutaka Katsuyama ... Satoshi Naoi
-
Yutaka Katsuyama, et. al.Yutaka Katsuyama ... Satoshi Naoi
03 Apr 1997
03 Apr 1997

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Printed Japanese Character Recognition Using Multiple Commercial OCRs

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Advanced Computational Intelligence and Intelligent Informatics