Robust Text Detection Research Articles

Object detection and recognition are the most important and challenging problems in computer vision. The remarkable advancements in deep learning techniques have significantly accelerated the momentum of object detection/recognition in recent years. Meanwhile, text detection/recognition is also a critical task in computer vision and has gotten more attention from many researchers due to its wide range of applications. This work focuses on detecting and recognizing multiple retail products stacked on the shelves and off the shelves in the grocery stores by identifying the label texts. In this paper, we proposed a new framework is composed of three modules: (a) retail product detection, (b) product-text detection, (c) product-text recognition. In the first module, on-the-shelf and off-the-shelf retail products are detected using the YOLOv5 object detection algorithm. In the second module, we improve the performance of the state-of-the-art text detection algorithm by replacing the backbone network with ResNet50 + FPN and by introducing a new post-processing technique, Width Height based Bounding Box Reconstruction, to mitigate the problem of inaccurate text detection. In the final module, we used a state-of-the-art text recognition model to recognize the retail product’s text information. The YOLOv5 algorithm accurately detects both on-the-shelf and off-the-shelf grocery products from the video frames and the static images. The experimental results show that the proposed post-processing approach improves the performance of the existing methods on both regular and irregular text. The robust text detection and text recognition methods greatly support our proposed framework to recognize the on-the-shelf retail products by extracting product information such as product name, brand name, price, and expiring date. The recognized text contexts around the retail products can be used as the identifier to distinguish the product.

In recent years, light-emitting diodes dot-matrix text (LED text) is being widely used for displaying information and announcements. However, there is currently no text detection system that is capable of handling LED text. Unlike general printed text, it is not easy to detect and recognize LED text due to its discontinuity. A character of the LED is generally displayed with a matrix of segments and composed with them to generate the text. Furthermore, it is necessary to detect each character from a line of LED text for creating a robust text detection system. Thus, this paper proposes a method for LED text detection and recognition in natural scene images. To perform this goal of detection and recognition of a character and text, it consists of two main steps with the following steps: the first step, a Canny edge was used to detect character pixels which appear in LED display area from scene images. The center points of edge segments are calculated. These points are merged based on their properties to generate a character candidate. In order to obtain character feature, the spatial information such as a centroid and orientation of the character candidate are used. These values are then analyzed using a k-nearest neighbor approach for classifying the character candidate as a certain alphanumeric. In the second step, the recognized characters are later combined into a text line based on the similarity of their characteristics such as width, height, aspect ratio and color. The post-processing of text line generating is then applied for rectifying the falsely recognized characters. In experiments, our proposed method achieves 68.8% and 47% for detection and recognition rate, respectively. These results show the robustness and effectiveness of the proposed method for detecting and recognizing the LED text in natural scene images that has filled the vacancy that the printed and dense text detection system has not covered.

Robust Text Detection Research Articles

Articles published on Robust Text Detection

Domain adaptive multigranularity proposal network for text detection under extreme traffic scenes

Feature Fusion Pyramid Network for End-to-end Scene Text Detection

A Deep Learning Framework for Grocery Product Detection and Recognition

A Robust and Effective Text Detector Supervised by Contrastive Learning

Natural Scene Text Detection and Segmentation Using Phase-Based Regions and Character Retrieval

IOS-Net: An inside-to-outside supervision network for scale robust text detection in the wild

Feature Enhancement Network: A Refined Scene Text Detector

Multi-oriented text detection and verification in video frames and scene images

Scene text detection via extremal region based double threshold convolutional network classification.

Rotation and script independent text detection from video frames using sub pixel mapping

Histograms of Stroke Widths for Multi-script Text Detection and Verification in Road Scenes

Robust text detection via multi-degree of sharpening and blurring

A robust approach for text detection from natural scene images

LED Dot matrix text recognition method in natural scene

Robust Text Detection in Natural Scenes Using Text Geometry and Visual Appearance

A robust arbitrary text detection system for natural scene images

A Robust Multilingual Text Detection Approach Based on Transforms and Wavelet Entropy

Fast and robust text detection in images and video frames

Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Robust Text Detection Research Articles

Articles published on Robust Text Detection

Domain adaptive multigranularity proposal network for text detection under extreme traffic scenes

Feature Fusion Pyramid Network for End-to-end Scene Text Detection

A Deep Learning Framework for Grocery Product Detection and Recognition

A Robust and Effective Text Detector Supervised by Contrastive Learning

Natural Scene Text Detection and Segmentation Using Phase-Based Regions and Character Retrieval

IOS-Net: An inside-to-outside supervision network for scale robust text detection in the wild

Feature Enhancement Network: A Refined Scene Text Detector

Multi-oriented text detection and verification in video frames and scene images

Scene text detection via extremal region based double threshold convolutional network classification.

Rotation and script independent text detection from video frames using sub pixel mapping

Histograms of Stroke Widths for Multi-script Text Detection and Verification in Road Scenes

Robust text detection via multi-degree of sharpening and blurring

A robust approach for text detection from natural scene images

LED Dot matrix text recognition method in natural scene

Robust Text Detection in Natural Scenes Using Text Geometry and Visual Appearance

A robust arbitrary text detection system for natural scene images

A Robust Multilingual Text Detection Approach Based on Transforms and Wavelet Entropy

Fast and robust text detection in images and video frames

Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm