Text Recognition Model Research Articles

Sand theft or illegal mining in river dredging areas has been a problem in recent decades. For this reason, increasing the use of artificial intelligence in dredging areas, building automated monitoring systems, and reducing human involvement can effectively deter crime and lighten the workload of security guards. In this investigation, a smart dredging construction site system was developed using automated techniques that were arranged to be suitable to various areas. The aim in the initial period of the smart dredging construction was to automate the audit work at the control point, which manages trucks in river dredging areas. Images of dump trucks entering the control point were captured using monitoring equipment in the construction area. The obtained images and the deep learning technique, YOLOv3, were used to detect the positions of the vehicle license plates. Framed images of the vehicle license plates were captured and were used as input in an image classification model, C-CNN-L3, to identify the number of characters on the license plate. Based on the classification results, the images of the vehicle license plates were transmitted to a text recognition model, R-CNN-L3, that corresponded to the characters of the license plate. Finally, the models of each stage were integrated into a real-time truck license plate recognition (TLPR) system; the single character recognition rate was 97.59%, the overall recognition rate was 93.73%, and the speed was 0.3271 s/image. The TLPR system reduces the labor force and time spent to identify the license plates, effectively reducing the probability of crime and increasing the transparency, automation, and efficiency of the frontline personnel’s work. The TLPR is the first step toward an automated operation to manage trucks at the control point. The subsequent and ongoing development of system functions can advance dredging operations toward the goal of being a smart construction site. By intending to facilitate an intelligent and highly efficient management system of dredging-related departments by providing a vehicle LPR system, this paper forms a contribution to the current body of knowledge in the sense that it presents an objective approach for the TLPR system.

Reading text in natural scene images is an active research area in the fields of computer vision and pattern recognition as text detection, text recognition and script identification are required. In this data article, a comprehensive dataset for Urdu text detection and recognition in natural scene images is presented and analysed. To develop the dataset, more than 2500 natural scene images were captured using a digital camera and a built-in mobile phone camera. Three separate datasets for isolated Urdu character images, cropped word images and end-to-end text spotting were developed. The isolated Urdu character and cropped word images dataset contain a much larger number of samples than existing Arabic natural scene text datasets. The Urdu text spotting dataset contains images with Urdu, English and Sindhi text instances. However, the focus has been given to the Urdu text instances. The ground truths for each image in the isolated character, cropped word or text spotting datasets are provided separately. The proposed datasets can be used to perform Urdu text detection and recognition or end-to-end recognition in natural scenes. These datasets can also be helpful to develop Arabic and Persian natural scene text detection and recognition systems, as Urdu is a derived language of these scripts and has many similar letters. The datasets can also be helpful to develop multi-language translation systems, which can facilitate foreign tourists to read and translate multilingual text in natural scene images. To evaluate the datasets, state-of-the-art machine learning and deep neural networks were used to build the text detection and recognition models, where the best classification accuracies are achieved. To the best of the authors’ knowledge, this is the first dataset proposed for Urdu text detection, recognition or end-to-end text recognition in natural scene images. The aim of this data article is to present a benchmark work in the field of document analysis and recognition.Computer ScienceComputer Vision and Pattern RecognitionTablesFiguresImagesText FilesUsing a digital camera with a 20 megapixels (MP) sensor, an iPhone with a 12 MP back camera and a Samsung mobile with a 16MP back camera.RawAnalyzedEnvironmental factors such as illuminations, blurring and lighting conditions were considered while capturing images. The focus was given to the text within an image.The images in the dataset were obtained from the advertisement banners, sign-boards along the road side and streets, shop name boards, text written on the passing vehicles and walls.The images provided in this dataset were collected in different cities of Sindh, Pakistan.Summarized data are hosted with the article.The datasets and their related files are hosted in a Mendeley public data repository.DOI: https://data.mendeley.com/datasets/k5fz57zd9z/1URL: http://dx.doi.org/10.17632/k5fz57zd9z.1

Text Recognition Model Research Articles

Related Topics

Articles published on Text Recognition Model

TSRGAN: Real-world text image super-resolution based on adversarial learning and triplet attention

Improvement in Efficiency of The State-Of-The-Art Handwritten Text Recognition Models

Automated Sensing System for Real-Time Recognition of Trucks in River Dredging Areas Using Computer Vision and Convolutional Deep Learning.

RETRACTED ARTICLE: Effective offline handwritten text recognition model based on a sequence-to-sequence approach with CNN–RNN networks

Morphological Feature Aware Multi-CNN Model for Multilingual Text Recognition

Realistic Text Replacement With Non-Uniform Style Conditioning

Simulation of English text recognition model based on ant colony algorithm and genetic algorithm

Scene text spotting based on end-to-end

A Holistic Model for Recognition of Handwritten Arabic Text Based on the Local Binary Pattern Technique

An attention-based row-column encoder-decoder model for text recognition in Japanese historical documents

Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.

Depression Tendency Screening Use Text Based Emotional Analysis Technique

Deep neural network with attention model for scene text recognition

A blind deconvolution model for scene text detection and recognition in video

Fractional poisson enhancement model for text detection and recognition in video frames

A robust model for on-line handwritten japanese text recognition

A language model using variable length tokens for open-vocabulary Hangul text recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text Recognition Model Research Articles

Related Topics

Articles published on Text Recognition Model

TSRGAN: Real-world text image super-resolution based on adversarial learning and triplet attention

Improvement in Efficiency of The State-Of-The-Art Handwritten Text Recognition Models

Automated Sensing System for Real-Time Recognition of Trucks in River Dredging Areas Using Computer Vision and Convolutional Deep Learning.

RETRACTED ARTICLE: Effective offline handwritten text recognition model based on a sequence-to-sequence approach with CNN–RNN networks

Morphological Feature Aware Multi-CNN Model for Multilingual Text Recognition

Realistic Text Replacement With Non-Uniform Style Conditioning

Simulation of English text recognition model based on ant colony algorithm and genetic algorithm

Scene text spotting based on end-to-end

A Holistic Model for Recognition of Handwritten Arabic Text Based on the Local Binary Pattern Technique

An attention-based row-column encoder-decoder model for text recognition in Japanese historical documents

Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.

Depression Tendency Screening Use Text Based Emotional Analysis Technique

Deep neural network with attention model for scene text recognition

A blind deconvolution model for scene text detection and recognition in video

Fractional poisson enhancement model for text detection and recognition in video frames

A robust model for on-line handwritten japanese text recognition

A language model using variable length tokens for open-vocabulary Hangul text recognition