Abstract

In recent years, the culture and fishing of precious marine product are heavily dependent on manual work, which is labor-intensive, high-cost and time-consuming. To address this issue, an underwater robot can be used to monitor the size of the marine products and fish the mature ones automatically. Automatic detection of marine products from underwater images is one of the most important steps in developing an underwater robot perceiving method. In the traditional detection model, the CNN based backbone suffers from the limited receptive field and hinders the modeling of long-range dependencies, due to the small kernel size. In this paper, a novel detection model FFT_YOLOX based on a modified YOLOX is proposed. Firstly, a unique FFT_Filter is presented, which is a computational efficient and conceptually simple architecture to capture global information of images. Then, a novel FFT_YOLOX model is introduced with fewer model parameters and FLOPs by replacing the standard 3 × 3 kernel in the original backbone of the YOLOX model with a FFT_Filter, for an underwater object detection vision task. Extensive experimental results demonstrate the effectiveness and generalization of the visual representation of our proposed FFT_YOLOX model.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call