Non-invasive methods of accurate estimation of fish weight are essential for biomass assessment, precision feeding, and effective management in aquaculture. However, the lack of large-scale stereo matching datasets and costly depth sensing equipment in underwater scenarios present significant challenges to accurately measure the size of free-swimming fish. This study introduces a novel NeRF-supervised stereo matching network for precise depth estimation and 3D reconstruction of the fish body. The state-of-the-art neural rendering methods are employed to generate stereo training data from image sequences. In this study, the residual network with deformable convolutions was proposed to extract contextual texture features, significantly enhancing disparity estimation accuracy at the edge of the fish body. Additionally, multiscale spatial features are fused using a feature pyramid network (FPN). To enhance the adaptability of disparity search, we introduce a lightweight Triplet Attention module to capture cross-dimensional dependencies prior to constructing the correlation volume. Experimental results demonstrate the effectiveness of the proposed depth estimation method, achieving an impressive result of 92.35% in the similarity metric, compared to the ground truth. Moreover, we developed a weight estimation model based on bass perimeter using instance segmentation and 3D reconstruction techniques. Notably, a robust correlation (correlation coefficient of 0.98) between fish perimeter and weight is observed, highlighting the potential application of our methods to enhance the accuracy of weight estimation for free-swimming fish.
Read full abstract7-days of FREE Audio papers, translation & more with Prime
7-days of FREE Prime access