Abstract

The segmentation of forward-looking sonar (FLS) image could assist underwater vehicles to recognize and measure underwater crash objects. Due to the complex noise and blurred object edge information in FLS image, the accurate segmentation result requires the model to have strong feature extraction ability. The CNN-based semantic segmentation networks focus too much on local information, which may amplify the complex noise. And their computational overhead is high. To address these problems, we construct a novel efficient Mix Transformer U-like network named MiTU-Net for FLS image segmentation. In addition, we introduce the online hard example mining (OHEM) crossentropy loss function to improve the learning ability of hard samples in dataset. We have carried out a series of experiments on the self-made FLS dataset. The experimental results demonstrate that MiTU-Net has better performance than other methods, and it shows effectiveness and robustness for FLS image segmentation task.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call