Layout and Perspective Distortion Independent Recognition of Captured Chinese Document Image

Yanwei Wang,Yuefang Sun,Changsong Liu

doi:10.1109/icdar.2017.102

Abstract

This paper introduced a layout and perspective distortion independent recognition framework for captured Chinese document image. Under the framework, 1) Conditional random field (CRF) is employed for text line extraction from a global point of view. As the text line extraction is layout independent it could be widely used in different type of document images 2) A text line image based perspective distortion correction method is detailed and used in three different ways. 3) The text line extraction and perspective distortion correction are combined with character recognition to construct a recognition system. On three captured document image datasets, the proposed framework improves the accuracies from 94.03% to 95.20%, 13.01% to 93.71% and 10.63% to 92.68% respectively for different distortion degrees. The experimental results demonstrate that the introduced recognition framework is promising for solving layout and perspective distortion problems in captured document image recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Layout and Perspective Distortion Independent Recognition of Captured Chinese Document Image

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

State Estimation in a Document Image and Its Application in Text Block Identification and Text Line Extraction
Hyung Il Koo ... Nam Ik Cho
-
Hyung Il Koo, et. al.Hyung Il Koo ... Nam Ik Cho
01 Jan 2009
01 Jan 2009

Neural Networks for Document Image and Text Processing
Joan Pastor Pellicer
-
Joan Pastor PellicerJoan Pastor Pellicer
03 Nov 2017
03 Nov 2017

Segmentation of Handwritten Document Images into Text Lines
Vassilis Katsouros ... Vassilis Papavassiliou
-
Vassilis Katsouros, et. al.Vassilis Katsouros ... Vassilis Papavassiliou
19 Apr 2011
19 Apr 2011

A Hybrid Method for Text Line Extraction in Handwritten Document Images
Ehsan Kiumarsi ... Alireza Alaei
-
Ehsan Kiumarsi, et. al.Ehsan Kiumarsi ... Alireza Alaei
01 Aug 2018
01 Aug 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Layout and Perspective Distortion Independent Recognition of Captured Chinese Document Image

Abstract

Talk to us

Similar Papers