Abstract
The label from industrial commodity packaging usually contains important data, such as production date, manufacturer, and other commodity-related information. As such, those labels are essential for consumers to purchase goods, help commodity supervision, and reveal potential product safety problems. Consequently, packaging label detection, as the prerequisite for product label identification, becomes a very useful application, which has achieved promising results in the past decades. Yet, in complex industrial scenarios, traditional detection methods are often unable to meet the requirements, which suffer from many problems of low accuracy and efficiency. In this paper, we propose a multifeature fast and attention-based algorithm using a combination of area suggestion and semantic segmentation. This algorithm is an attention-based efficient and multifeature fast text detector (termed AEMF). The proposed approach is formed by fusing segmentation branches and detection branches with each other. Based on the original algorithm that can only detect text in any direction, it is possible to detect different shapes with a better accuracy. Meanwhile, the algorithm also works better on long-text detection. The algorithm was evaluated using ICDAR2015, CTW1500, and MSRA-TD500 public datasets. The experimental results show that the proposed multifeature fusion with self-attention module makes the algorithm more accurate and efficient than existing algorithms. On the MSRA-TD500 dataset, the AEMF algorithm has an F-measure of 72.3% and a frame per second (FPS) of 8. On the CTW1500 dataset, the AEMF algorithm has an F-measure of 62.3% and an FPS of 23. In particular, the AEMF algorithm has achieved an F-measure of 79.3% and an FPS of 16 on the ICDAR2015 dataset, demonstrating the excellent performance in detecting label text on industrial packaging.
Highlights
In recent years, with the continuous development of industry, the quality and safety of commodities have attracted more and more attention. erefore, textual information from the product labels needs to be recognized
This paper proposes a novel detection algorithm based on a combination of region suggestion and semantic segmentation, through multimodel integration
Comparison with State-of-the-Art. e following three tables show the comparison of the recall, accuracy, average score, and frames per second (FPS) for the AEMF algorithm, using the VGG16 and ResNet50-based backbone network, against existing methods
Summary
With the continuous development of industry, the quality and safety of commodities have attracted more and more attention. erefore, textual information from the product labels needs to be recognized. Traditional text detection algorithms usually employ hand-crafted features such as edge gradients, directional gradient histograms, and local binarization to classify candidate regions into text and nontext areas. Such hand-crafted features fail to accurately describe or capture complex textual domains in natural scenes. Existing methods are limited in terms of text localization in complex industrial environments To overcome these issues, this paper proposes a novel detection algorithm based on a combination of region suggestion and semantic segmentation, through multimodel integration. This paper proposes a novel detection algorithm based on a combination of region suggestion and semantic segmentation, through multimodel integration While the former is used to predict candidate text boxes, the latter is to detect the candidate text region. (iii) e proposed algorithm combines area suggestion and semantic segmentation for test detection, thereby improving the detection accuracy and speed
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.