English

Khatib Tahani,Karajeh Huda,Mohammad Hiba,Rajab Lama

doi:10.5897/sre2014.6146

Abstract

The textual pieces in scene images might often provide vital semantic data for visual content understanding, indexing and analysis; as a result, text extraction had become a significant research area in image processing and computer vision. In this paper, we propose a new hybrid multilevel algorithm to extract text in various scene images. The algorithm converts the Red – Green –Blue (RGB) image into grayscale for color reduction. Next, it applies edge detection and mathematical morphological operations to extract edges in the image preprocessing phase. The resultant binary image passes through three subsequent levels in a multi layer behavior. Connected components labeling and text candidates' selection take place in each level through different criteria analysis. We used the structural features of connected components as basis criteria for selecting candidate texts, those features include: area, width, length and condense intensity mean of connected components. Afterwards, Horizontal projection profile analysis is used to further refine the candidate text areas and to eliminate non-text regions. The proposed algorithm is evaluated on a set of fifty images chosen from a well known text locating test dataset: KAIST. Extensive experiments show high robustness under different environments such as indoor, outdoor, shadow, night and light, and for different text properties such as various font size, style and complexities of backgrounds and textures. The algorithm effectively extracts textual contents from scenes images with high average of Precision, Recall, and F-Score which are 90.1, 99, and 94.3%, respectively.    Key words: Multilevel text extraction, hybrid text extraction, edge detection, connected components, text candidates, morphological operations, horizontal projection profile. &nbsp

Highlights

The development of digital technologies accelerated the rapid growth in digital content
As digitalization is expanding in all categories and materials, it becomes important to extract any textual content from digital media to acquire semantic clues to help in visual content illustration and analysis
Rajab et al (2014), we presented a text extraction technique that employs image enhancement, morphological operations and different transformations in order to label text candidates

Summary

Introduction

The development of digital technologies accelerated the rapid growth in digital content. As digitalization is expanding in all categories and materials, it becomes important to extract any textual content from digital media to acquire semantic clues to help in visual content illustration and analysis. As an essential form of digital media, may include pieces of text that comprise useful information for automatic explanation and structuring of images (Mancas-Thillou et al, 2007). Attribution License 4.0 International License Sci. Res.

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Research and Essays	Publication Date: Feb 15, 2015
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

English

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Research and Essays

Lead the way for us

Similar Papers

An approach to extract text regions from scene image
Rashedul Islam ... Kamrul Hasan Talukder
-
Rashedul Islam, et. al.Rashedul Islam ... Kamrul Hasan Talukder
01 Dec 2016
01 Dec 2016

Development of data driven adaptive edge detectors for image processing
...
IOP Conference Series: Materials Science and Engineering | VOL. 1187
, et. al. ...
01 Sep 2021
IOP Conference Series: Materials Science and Engineering | VOL. 1187

English
Divya Gera, Neelu Jain
International Journal of Innovative Research in Computer and Communication Engineering | VOL. 03
Divya Gera, Neelu JainDivya Gera, Neelu Jain
28 Feb 2015
International Journal of Innovative Research in Computer and Communication Engineering | VOL. 03

Devanagari and Bangla Text Extraction from Natural Scene Images
Ujjwal Bhattacharya ... Srikanta Mondal
-
Ujjwal Bhattacharya, et. al.Ujjwal Bhattacharya ... Srikanta Mondal
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

English

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Research and Essays