Abstract

The poor quality of optical imaging caused by the complex and varying underwater environment is a significant challenge to underwater target recognition. Moreover, the insufficiency of relevant datasets may lead to the overfitting problem in target recognition models based on deep learning. Taking the instance segmentation of three underwater creatures (echinus, holothurian, and starfish) as an example, we propose a new method for recognition of underwater creatures. It combines the MSRCR (multi-scale Retinex with color restoration) image enhancement algorithm and the Mask R-CNN (region-based convolutional neural work) framework, and achieves a mAP (mean average accuracy) value higher than 90% on a small sample dataset. This method consists of three major steps. First, the dataset with 84 images is augmented (flip, adding noise, and GAN (generative adversarial networks)) to 430 images, and all images are enhanced with MSRCR to improve their qualities; Second, the model is pre-trained on the COCO (Microsoft common objects in context) dataset to shorten the training time and overcome overfitting; Finally, the pre-trained model is transferred to the underwater dataset, and the whole training process is completed. We achieve 97.46% precision and 94.52% recall, and the mAP (intersection over union (IOU) = 50) is 94.84%. The effectiveness of the proposed method is verified by comparing it with several popular target recognition models, including SSD (Single Shot Detector), YOLOv3 (You only look once), original Mask R-CNN, and a SIFT-based (Scale-invariant feature transform) model.

Highlights

  • Seventy-one percent of the Earth’s surface is occupied by oceans, which contain rich resources [1]

  • The comparative models include a target detection model based on SIFT and the deep learning methods: SSD [23], YOLOv3 [48], and Mask R-CNN (MRCNN)

  • The SIFT-based model achieves a 100% precision, but it is far worse than deep learning models in mAP and Recall, and its speed is the slowest

Read more

Summary

Introduction

Seventy-one percent of the Earth’s surface is occupied by oceans, which contain rich resources [1]. INDEX TERMS Object recognition, mask R-CNN, image enhancement, underwater creature. S. Song et al.: Integrate MSRCR and Mask R-CNN to Recognize Underwater Creatures on Small Sample Datasets

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call