Aiming at the influence of factors such as the special optical characteristics of water bodies on the perceptual quality of generated images, this paper proposes the DifSG2-CCL model for reducing the special optical characteristics of water bodies and the DPL-SG2 model for introducing perceptual loss. Combining the ideas of cyclic consistency and style migration, this paper builds the Underwater Cycle Consistency Loss (U-CCL) module. The DifSG2-CCL model is based on the method of image reconstruction, which converts the underwater image into the style of the land image to reduce the influence of the water body factors. VGG16 is introduced as a perceptual loss into the DPL-SG2 to enhance the visual perception of the image by feature extraction with different layers and tonal weighting. Furthermore, in addition to the already disclosed SA dataset, a T dataset with a resolution of 256 × 256 in 9.366k sheets is provided in this paper. The experimental results show that DifSG2-CCL and DPL-SG2 can effectively enhance the perceptual quality of the images. The unique underwater image generation of DifSG2-CCL produces excellent results in qualitative analysis and reduces its FID value to 8.97. DPL-SG2 is more outstanding in the training of T dataset, and its FID value is reduced to 5.39. Therefore, the U-CCL and VGG16 can be applied as an innovative approach to enhance visual perception of underwater images. The experimental code with pre-trained models will be published shortly at https://github.com/yff0428/DPL-SG2/tree/main.
Read full abstract