Text Line Images Research Articles

Text in images contains exact semantic information and the text knowledge can be utilized in many image cognition and understanding applications. The human reading habits can provide the clues of text line structure for text line extraction. In this paper, we propose a novel human reading knowledge inspired text line extraction method based on k-shortest paths global optimization. Firstly, the candidate character extraction is reformulated as Maximal Stable Extremal Region (MSER) algorithm on gray, red, blue, and green channels of the target images, and the extracted MSERs are fed into Convolutional Neural Network (CNN) to remove the noise components. Then, the directed graph is built upon the character component nodes with edges inspired by human reading sense. The directed graph can automatically construct the relationship to eliminate the disorder of candidate text components. The text line paths optimization is inspired by the human reading ability in planning of a text line path sequentially. Therefore, the text line extraction problem can be solved using the k-shortest paths optimization algorithm by taking advantage of the human reading sense structure of the directed graph. It can extract the text lines iteratively to avoid the exhaustive searching and obtain global optimized text line number. The proposed method achieves the f-measure of 0.820 and 0.812 on public ICDAR2011 and ICDAR2013 dataset, respectively. The experimental results demonstrate the effectiveness of the proposed human reading knowledge inspired text line extraction method in comparison with state-of-the-art methods This paper presents one human reading knowledge inspired text line extraction method, which approves that the human reading knowledge can benefit the text line extraction and image text discovery.

Read full abstract

Text recognition in scene image and video frames is difficult because of low resolution, blur, background noise, etc. Since traditional OCRs do not perform well in such images, information retrieval using keywords could be an alternative way to index/retrieve such text information. Date is a useful piece of information which has various applications including date-wise videos/scene searching, indexing or retrieval. This paper presents a date spotting based information retrieval system for natural scene image and video frames where text appears with complex backgrounds. We propose a line based date spotting approach using Hidden Markov Model (HMM) which is used to detect the date information in a given text. Different date models are searched from a line without segmenting characters or words. Given a text line image in RGB, we apply an efficient gray image conversion to enhance the text information. Wavelet decomposition and gradient sub-bands are used to enhance text information in gray scale. Next, Pyramid Histogram of Oriented Gradient (PHOG) feature has been extracted from gray image and binary images for date-spotting framework. Binary and gray image features are combined by MLP based Tandem approach. Finally, to boost the performance further, a shape coding based scheme is used to combine the similar shape characters in same class during word spotting. For our experiment, three different date models have been constructed to search similar date information having numeric dates that contains numeral values and punctuations and semi-numeric that contains dates with numerals along with months in scene/video text. We have tested our system on 1648 text lines and the results show the effectiveness of our proposed date spotting approach.

Read full abstract

Text Line Images Research Articles

Related Topics

Articles published on Text Line Images

Automatic Visual Features for Writer Identification: A Deep Learning Approach

Ground Truth for training OCR engines on historical documents in German Fraktur and Early Modern Latin

Human Reading Knowledge Inspired Text Line Extraction

Date-field retrieval in scene image and video frames using text enhancement and shape coding

Urdu Nastaliq recognition using convolutional–recursive deep learning

Word graphs size impact on the performance of handwriting document applications

Offline cursive Urdu-Nastaliq script recognition using multidimensional recurrent neural networks

Open-vocabulary recognition of machine-printed Arabic text using hidden Markov models

Arabic Handwriting Text Offline Recognition Using the HMM Toolkit (HTK)

Transcript mapping for handwritten Chinese documents by integrating character recognition model and geometric context

Automatic writer identification from text line images

Offline recognition of omnifont Arabic text using the HMM ToolKit (HTK)

Lexicon-driven segmentation and recognition of handwritten character strings for Japanese address reading

Recognition-based handwritten Chinese character segmentation using a probabilistic Viterbi algorithm

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text Line Images Research Articles

Related Topics

Articles published on Text Line Images

Automatic Visual Features for Writer Identification: A Deep Learning Approach

Ground Truth for training OCR engines on historical documents in German Fraktur and Early Modern Latin

Human Reading Knowledge Inspired Text Line Extraction

Date-field retrieval in scene image and video frames using text enhancement and shape coding

Urdu Nastaliq recognition using convolutional–recursive deep learning

Word graphs size impact on the performance of handwriting document applications

Offline cursive Urdu-Nastaliq script recognition using multidimensional recurrent neural networks

Open-vocabulary recognition of machine-printed Arabic text using hidden Markov models

Arabic Handwriting Text Offline Recognition Using the HMM Toolkit (HTK)

Transcript mapping for handwritten Chinese documents by integrating character recognition model and geometric context

Automatic writer identification from text line images

Offline recognition of omnifont Arabic text using the HMM ToolKit (HTK)

Lexicon-driven segmentation and recognition of handwritten character strings for Japanese address reading

Recognition-based handwritten Chinese character segmentation using a probabilistic Viterbi algorithm