Text In Natural Scene Images Research Articles

Reading text in natural scene images is an active research area in the fields of computer vision and pattern recognition as text detection, text recognition and script identification are required. In this data article, a comprehensive dataset for Urdu text detection and recognition in natural scene images is presented and analysed. To develop the dataset, more than 2500 natural scene images were captured using a digital camera and a built-in mobile phone camera. Three separate datasets for isolated Urdu character images, cropped word images and end-to-end text spotting were developed. The isolated Urdu character and cropped word images dataset contain a much larger number of samples than existing Arabic natural scene text datasets. The Urdu text spotting dataset contains images with Urdu, English and Sindhi text instances. However, the focus has been given to the Urdu text instances. The ground truths for each image in the isolated character, cropped word or text spotting datasets are provided separately. The proposed datasets can be used to perform Urdu text detection and recognition or end-to-end recognition in natural scenes. These datasets can also be helpful to develop Arabic and Persian natural scene text detection and recognition systems, as Urdu is a derived language of these scripts and has many similar letters. The datasets can also be helpful to develop multi-language translation systems, which can facilitate foreign tourists to read and translate multilingual text in natural scene images. To evaluate the datasets, state-of-the-art machine learning and deep neural networks were used to build the text detection and recognition models, where the best classification accuracies are achieved. To the best of the authors’ knowledge, this is the first dataset proposed for Urdu text detection, recognition or end-to-end text recognition in natural scene images. The aim of this data article is to present a benchmark work in the field of document analysis and recognition.Computer ScienceComputer Vision and Pattern RecognitionTablesFiguresImagesText FilesUsing a digital camera with a 20 megapixels (MP) sensor, an iPhone with a 12 MP back camera and a Samsung mobile with a 16MP back camera.RawAnalyzedEnvironmental factors such as illuminations, blurring and lighting conditions were considered while capturing images. The focus was given to the text within an image.The images in the dataset were obtained from the advertisement banners, sign-boards along the road side and streets, shop name boards, text written on the passing vehicles and walls.The images provided in this dataset were collected in different cities of Sindh, Pakistan.Summarized data are hosted with the article.The datasets and their related files are hosted in a Mendeley public data repository.DOI: https://data.mendeley.com/datasets/k5fz57zd9z/1URL: http://dx.doi.org/10.17632/k5fz57zd9z.1

Read full abstract

Of late, the rapid development in the technology and multimedia capability in digital cameras and mobile devices has led to ever increasing number of images or multi-media data to the digital world. Particularly, in natural scene images, the text content provides explicit information to understand the semantics of images. Therefore, a system developed for extracting and recognizing texts accurately from natural scene images, in real-time, has significant relevance to numerous applications such as, assistive technology for people with vision impairment, tourist with language barrier, vehicle number plate detection, street signs, advertisement bill boards, robotics, etc. The extraction of the texts from natural scene images is a formidable task due to large variations in character fonts, styles, sizes, text orientations, presence of complex backgrounds and varying light conditions. The main focus of this research paper is to propose a novel hybrid approach for automatic detection, localization, extraction and recognition of text in natural scene images with cluttered background. Firstly, image regions with text are detected using edge features (GLCM) extracted from Contourlet transformed image and SVM (Support Vector Machine) classifier. Secondly, horizontal projection is applied on text regions for segmenting lines and vertical projection is applied on each text line for segmenting characters. The proposed method for text extraction has produced the precision, recall, F-Score and accuracy of 98.50%, 90.85.62%, 95.00%, and 89.90%, respectively. And, these results prove that the proposed method is efficient. Further, the so extracted characters are processed for recognition using contourlet transform and Probabilistic Neural Network (PNN) classifier. The computed features are moment invariants. Only the English script is considered for the experimentation. The proposed character recognition method has accuracy of 79.07%, which is higher in comparison to accuracy of 75.15% obtained by KNN (K-Nearest Neighbors) classifier

Read full abstract

Text In Natural Scene Images Research Articles

Related Topics

Articles published on Text In Natural Scene Images

A Comprehensive Review on Text Detection and Recognition in Scene Images

A Comprehensive Review on Text Detection and Recognition in Scene Images

An End-to-End Scene Text Recognition for Bilingual Text

Deep Learning Techniques for Detecting and Segmenting Text in Natural Scene Images: Review

Irregular Scene Text Detection Based on a Graph Convolutional Network.

Deep-Learning-Based Complex Scene Text Detection Algorithm for Architectural Images

Dominating set based arbitrary oriented bilingual scene text localization

Local Resultant Gradient Vector Difference and Inpainting for 3D Text Detection in the Wild

Contour feature learning for locating text in natural scene images

A new method for detection and prediction of occluded text in natural scene images

Urdu text in natural scene images: a new dataset and preliminary text detection.

Rule-based perspective rectification for Chinese text in natural scene images

A comparative approach on detecting multi-lingual and multi-oriented text in natural scene images

Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.

Text detection in natural scene images using morphological component analysis and Laplacian dictionary

Text Extraction and Recognition in Natural Scene Images using Contourlet Transform and PNN

Real-time localization of multi-oriented text in natural scene images using a linear spatial filter

Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition

Multi-level Fuzzy Based Renyi Entropy for Linguistic Classification of Texts in Natural Scene Images

Scene word recognition from pieces to whole

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text In Natural Scene Images Research Articles

Related Topics

Articles published on Text In Natural Scene Images

A Comprehensive Review on Text Detection and Recognition in Scene Images

A Comprehensive Review on Text Detection and Recognition in Scene Images

An End-to-End Scene Text Recognition for Bilingual Text

Deep Learning Techniques for Detecting and Segmenting Text in Natural Scene Images: Review

Irregular Scene Text Detection Based on a Graph Convolutional Network.

Deep-Learning-Based Complex Scene Text Detection Algorithm for Architectural Images

Dominating set based arbitrary oriented bilingual scene text localization

Local Resultant Gradient Vector Difference and Inpainting for 3D Text Detection in the Wild

Contour feature learning for locating text in natural scene images

A new method for detection and prediction of occluded text in natural scene images

Urdu text in natural scene images: a new dataset and preliminary text detection.

Rule-based perspective rectification for Chinese text in natural scene images

A comparative approach on detecting multi-lingual and multi-oriented text in natural scene images

Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.

Text detection in natural scene images using morphological component analysis and Laplacian dictionary

Text Extraction and Recognition in Natural Scene Images using Contourlet Transform and PNN

Real-time localization of multi-oriented text in natural scene images using a linear spatial filter

Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition

Multi-level Fuzzy Based Renyi Entropy for Linguistic Classification of Texts in Natural Scene Images

Scene word recognition from pieces to whole