MSGNet: multi-source guidance network for fish segmentation in underwater videos

Peng Zhang,Hong Yu,Yuanshan Lin,Wan Tu,Xin Zhang,Zongyi Yang,Haiqing Li,Sixue Wei,Junfeng Wu

doi:10.3389/fmars.2023.1256594

Abstract

Fish segmentation in underwater videos provides basic data for fish measurements, which is vital information that supports fish habitat monitoring and fishery resources survey. However, because of water turbidity and insufficient lighting, fish segmentation in underwater videos has low accuracy and poor robustness. Most previous work has utilized static fish appearance information while ignoring fish motion in underwater videos. Considering that motion contains more detail, this paper proposes a method that simultaneously combines appearance and motion information to guide fish segmentation in underwater videos. First, underwater videos are preprocessed to highlight fish in motion, and obtain high-quality underwater optical flow. Then, a multi-source guidance network (MSGNet) is presented to segment fish in complex underwater videos with degraded visual features. To enhance both fish appearance and motion information, a non-local-based multiple co-attention guidance module (M-CAGM) is applied in the encoder stage, in which the appearance and motion features from the intra-frame salient fish and the moving fish in video sequences are reciprocally enhanced. In addition, a feature adaptive fusion module (FAFM) is introduced in the decoder stage to avoid errors accumulated in the video sequences due to blurred fish or inaccurate optical flow. Experiments based on three publicly available datasets were designed to test the performance of the proposed model. The mean pixel accuracy (mPA) and mean intersection over union (mIoU) of MSGNet were 91.89% and 88.91% respectively with the mixed dataset. Compared with those of the advanced underwater fish segmentation and video object segmentation models, the mPA and mIoU of the proposed model significantly improved. The results showed that MSGNet achieves excellent segmentation performance in complex underwater videos and can provide an effective segmentation solution for fisheries resource assessment and ocean observation. The proposed model and code are exposed via Github1.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Marine Science	Publication Date: Sep 21, 2023
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

MSGNet: multi-source guidance network for fish segmentation in underwater videos

Abstract

Talk to us

Similar Papers

More From: Frontiers in Marine Science

Lead the way for us

Similar Papers

Improved deep learning framework for fish segmentation in underwater videos
Nawaf Farhan Funkur Alshdaifat ... Mohd Azam Osman
Ecological Informatics | VOL. 59
Nawaf Farhan Funkur Alshdaifat, et. al.Nawaf Farhan Funkur Alshdaifat ... Mohd Azam Osman
03 Jul 2020
Ecological Informatics | VOL. 59

Study on the Influence of Label Image Accuracy on the Performance of Concrete Crack Segmentation Network Models.
Kaifeng Ma ... Junzhen Meng
Sensors (Basel, Switzerland) | VOL. 24
Kaifeng Ma, et. al.Kaifeng Ma ... Junzhen Meng
06 Feb 2024
Sensors (Basel, Switzerland) | VOL. 24

Unsupervised Spatio-Temporal Segmentation with Sparse Spectral-Clustering
Gabe Sibley ... Mahsa Ghafarianzadeh
-
Gabe Sibley, et. al.Gabe Sibley ... Mahsa Ghafarianzadeh
01 Jan 2014
01 Jan 2014

An Identification Method for Mixed Coal Vitrinite Components Based on An Improved DeepLabv3+ Network
Fujie Wang ... Fanfan Li
Energies | VOL. 17
Fujie Wang, et. al.Fujie Wang ... Fanfan Li
13 Jul 2024
Energies | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MSGNet: multi-source guidance network for fish segmentation in underwater videos

Abstract

Talk to us

Similar Papers

More From: Frontiers in Marine Science