Text Extraction Method Research Articles

In this paper, we present a robust algorithm to character classes. In addition, our experiments have shown recognize extracted text from grocery product images captured and image processing. HE use of digital cameras to capture text from natural Sciences,3005 Bern, Switzerland (e-mail: farshideh.einsele@bfh.ch). Science, University of Central Florida, Orlando, FL 32816, USA (e-mail: foroosh@cs.ucf.edu). World Academy of Science, Engineering and Technology International Journal of Computer, Information, Systems and Control Engineering Vol:9 No:1, 2015 159 International Scholarly and Scientific Research & Innovation 9(1) 2015 In te rn at io na l S ci en ce I nd ex V ol :9 , N o: 1, 2 01 5 w as et .o rg /P ub lic at io n/ 10 00 02 57 into Fourier domain and ignore the phase information to gain rotation invariant features. Scale and translation invariance is gained with normalization and considering the centroid as the center of the coordinate system. The experimental results show that the centroid and complex coordinate signatures have a high precision and recall rate whereas the curvature and cumulative angular functions deliver unreliable results. Dionisio et al. in [11] also report a contour-based shape classification technique based on polygon approximation that is invariant under rotation and scaling. The vertices of polygon approximation are formed by high curvature points of the profile and are selected by the Fourier transform of the object contour. A series of features are computed from the polygonal approximation and a minimum distance classifier is used for object recognition. Although such contour-based invariants deliver promising results using Fourier descriptors for character recognition and are also reported in the survey of Trier et al. [2], the reported test and training databases are synthetically deformed patterns and the features are invariant with respect to translation, scaling, rotation and do not consider other transformations coming from real world captured images (e.g. shearing, shadowing, bad illumination and perspective distortion). Besides Trier et al.state in [2] that a statistical classification system should consider the so called curse of dimensionality meaning that it should be training-based containing a minimum number of patterns that is 8-10 times bigger than the number of the chosen features. As already stated, when dealing with a database containing real world character images, database generation is an expensive and time-consuming drawback. To sum up, the above mentioned works have been performed either by considering a synthetically degraded database or are training-based approaches. In this paper, we present a method for camera-based character recognition that uses a small real world database extracted from images of grocery products captured by a cellular phone with a resolution of 5 mega pixels. The presented method does not need a training set that should rely on the previously described term of curse of dimensionality. Therefore our proposed method can be applied directly to the extracted text with no use of cost-intensive image enhancement algorithms and delivers promising results. The remainder of this paper is organized as follows: in section II, we introduce the specificities of text in product images. Section III reports about our proposed character recognition algorithm including used feature extraction and classification methods. Section IV presents our evaluation results and section V is about our conclusions and a short sketch of our future works. II. CHALLENGES OF PRODUCT TEXT RECOGNITION We extract text from images taken with cell phones from grocery products. Text extraction from camera-based images is a relatively well researched area with plenty of existing works in the literature [12]. However, text extraction methods from camera-based images is tightly related to a specific application and there does not exist a valid generic method for the extraction of text within different camera-based scenarios. We therefore use a text extraction algorithm that has been developed for the specificities of the text from grocery products and is explained in detail in [13]. The resolution of the used cell phone camera is 5 mega pixels and the images are taken from different angles with the camera having a similar distance to products as the one a common grocery shopper would have when he crosses grocery aisles. The extracted text has mostly a height between 20-50 pixels and characters can be mostly labeled and segmented using connected components algorithms. Table I shows some extracted words in our database.

Hazelnuts and tea are two major agricultural crops grown in the eastern Black Sea region in Turkey. Since this part of Turkey is not industrialized, most of the local people work in agriculture, making hazelnuts and tea a part of their lives. For the government side, it is crucial to keep records of the amount of harvested croplands to implement agricultural policies. In fact, the harvested area and crop type of each cadastral parcel are collected either during cadastral surveys or with the declaration of individual farmers, yet this information is mostly not up-to-date and does not reflect the current land-use status. This study aims to determine the extent and distribution of hazelnuts and tea grown areas using the Random Forest (RF) classification algorithm. Tea and hazelnuts give similar spectral reflectance values to surrounding vegetation, which makes it difficult to distinguish them using only their spectral properties. To tackle this problem, the normalized difference vegetation index (NDVI) and texture extraction methods such as the Grey Level Co-occurrence Matrix (GLCM) and Gabor filter were integrated with the RF algorithm, and their contributions to the success of the RF classification method were examined. WorldView-2 satellite images, which have eight multispectral bands (MS: 2 m) and one higher spatial resolution panchromatic band (PAN: 0.5 m), were used. Since the study area contains agricultural products grown in different seasons, satellite images belonging to both summer and winter periods were used. Preliminary results acquired using only spectral values indicated that the RF method gives 79.05% and 71.84% overall accuracies for summer and winter periods, respectively. Integrating texture information improves the performance of the RF algorithm such that the overall classification accuracies are increased to 83.54% and 87.89% when texture information extracted with GLCM and the Gabor filter is added. The classification performance of the winter image is also boosted to be 77.41% and 79.73% with the contribution of texture information obtained with GLCM and the Gabor filter, respectively. Finally, produced thematic maps were compared with the latest cadastral maps to validate classification results with ground truth data. The obtained results reveal the success of integrating texture features in classification since the created thematic maps are consistent with actual land use. The results also show that the crops grown on some cadastral parcels are not coherent with the most current cadastral database, which implies that the cadastral maps need to be updated.

Text Extraction Method Research Articles

Related Topics

Articles published on Text Extraction Method

Automated Extraction of Large Scale Scanned Document Images using Google Vision OCR in Apache Hadoop Environment

A Novel M-ACA-Based Tumor Segmentation and DAPP Feature Extraction with PPCSO-PKC-Based MRI Classification

Text extraction method for historical Tibetan document images based on block projections

Evaluation of effectiveness of three fuzzy systems and three texture extraction methods for building damage detection from post-event LiDAR data

Text Extraction and Recognition from the Normal Images using MSER Feature Extraction and Text Segmentation Methods

A Fuzzy-GA Based Decision Making System for Detecting Damaged Buildings from High-Spatial Resolution Optical Images

Flotation froth image texture extraction method based on deterministic tourist walks

QUALITY ASSESSMENT OF BUILDING TEXTURES EXTRACTED FROM OBLIQUE AIRBORNE THERMAL IMAGERY

QUALITY ASSESSMENT OF BUILDING TEXTURES EXTRACTED FROM OBLIQUE AIRBORNE THERMAL IMAGERY

Filter Design and Performance Evaluation for Fingerprint Image Segmentation.

Prevalence of severe mitral regurgitation eligible for edge-to-edge mitral valve repair (MitraClip).

A Method of Effective Text Extraction for Complex Video Scene

A New Approach to Extract Text from Images based on DWT and K-means Clustering

An experiment on wear particle’s texture analysis and identification by using deterministic tourist walk algorithm

Complex networks-based texture extraction and classification method for mineral flotation froth images

A local binary pattern based texture descriptors for classification of tea leaves

Review Paper on Various Methodology of Text Extraction from Image

Scene Text Extraction in IHLS Color Space Using Support Vector Machine

Recognition of Grocery Products in Images Captured by Cellular Phones

Integrating multiple texture methods and NDVI to the Random Forest classification algorithm to detect tea and hazelnut plantation areas in northeast Turkey

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text Extraction Method Research Articles

Related Topics

Articles published on Text Extraction Method

Automated Extraction of Large Scale Scanned Document Images using Google Vision OCR in Apache Hadoop Environment

A Novel M-ACA-Based Tumor Segmentation and DAPP Feature Extraction with PPCSO-PKC-Based MRI Classification

Text extraction method for historical Tibetan document images based on block projections

Evaluation of effectiveness of three fuzzy systems and three texture extraction methods for building damage detection from post-event LiDAR data

Text Extraction and Recognition from the Normal Images using MSER Feature Extraction and Text Segmentation Methods

A Fuzzy-GA Based Decision Making System for Detecting Damaged Buildings from High-Spatial Resolution Optical Images

Flotation froth image texture extraction method based on deterministic tourist walks

QUALITY ASSESSMENT OF BUILDING TEXTURES EXTRACTED FROM OBLIQUE AIRBORNE THERMAL IMAGERY

QUALITY ASSESSMENT OF BUILDING TEXTURES EXTRACTED FROM OBLIQUE AIRBORNE THERMAL IMAGERY

Filter Design and Performance Evaluation for Fingerprint Image Segmentation.

Prevalence of severe mitral regurgitation eligible for edge-to-edge mitral valve repair (MitraClip).

A Method of Effective Text Extraction for Complex Video Scene

A New Approach to Extract Text from Images based on DWT and K-means Clustering

An experiment on wear particle’s texture analysis and identification by using deterministic tourist walk algorithm

Complex networks-based texture extraction and classification method for mineral flotation froth images

A local binary pattern based texture descriptors for classification of tea leaves

Review Paper on Various Methodology of Text Extraction from Image

Scene Text Extraction in IHLS Color Space Using Support Vector Machine

Recognition of Grocery Products in Images Captured by Cellular Phones

Integrating multiple texture methods and NDVI to the Random Forest classification algorithm to detect tea and hazelnut plantation areas in northeast Turkey