In the traditional manufacturing industry, the safe operation of storage and retrieval (S/R) machines is vital for efficient automated warehouse management. Recently, deep learning-based monocular vision approaches have emerged as a promising solution for collision avoidance in automated storage and retrieval systems (AS/RS). However, the scarcity of specific datasets and the need for high-performance, lightweight neural network models pose challenges in the AS/RS domain. To address these challenges, this paper introduces a novel workflow for implementing a monocular anti-collision system for S/R machines. A semi-supervised annotation method leveraging a pretrained clustering model is proposed to expedite image collection and classification, resulting in a tailored dataset. Furthermore, the NightMix technique is introduced, combining Grayscale Enhancement and Floating Grid Mix method to enhance dataset diversity and capacity. Notably, a residual attention awareness module is integrated into MobileNet V3 to enhance its feature processing capabilities. Finally, the proposed approaches are validated through extensive experiments involving various design choices. The experimental results demonstrate that our methods are not only computationally efficient but also highly competitive in terms of learning performance.
Read full abstract