Abstract
Underwater image datasets are crucial in underwater vision research. Because of the strong absorption and scattering effects that occur underwater, some ground truth such as the depth map, which can be easily collected in-air, becomes a great challenge in underwater environments. To solve the issues associated with the lack of underwater ground truth, we propose a trainable end-to-end system of an underwater multistyle generative adversarial network (UMGAN) that takes advantage of a cycle-consistent adversarial network (CycleGAN) and conditional generative adversarial networks. This system can generate multiple realistic underwater images from in-air images using a hybrid adversarial system and an unpaired method. Moreover, our model can translate in-air images to underwater images that retain the main content and structural information of the in-air images under specified turbidities or water styles through a style classifier and a conditional vector. Furthermore, we define the color loss and include the structural similarity index measure loss for the system to preserve the content and structure of original in-air images while transferring the backgrounds of the images from air to water. Using UMGAN, we can take advantage of the in-air ground truth and convert the corresponding in-air images into an underwater dataset with multiple water color styles. Our experiments demonstrate that our synthesized underwater images have a high score on image assessment against CycleGAN, WaterGAN, StarGAN, AdaIN, and other state-of-the-art methods. We also show that our synthesized underwater images with in-air depths can be applied to real underwater image depth map estimation.
Highlights
Underwater vision is fundamental to marine research
We prove that the conditional label can represent extra information when synthesizing images using generative adversarial networks (GANs) [16], [17]
We show the effectiveness of our synthesized images by applying them to construct a semi-real RGB-D underwater image dataset for use in underwater depth estimation
Summary
Underwater vision is fundamental to marine research. For example, underwater vision systems can provide necessary support to marine biology research in heavily degraded georeferenced underwater images [1], and marine researchers can use underwater vision devices, such as stereo imaging platforms, to reconstruct high-resolution, large-scale optical 3D maps of underwater areas by studying the distribution and occurrence of submerged objects [2]. The underwater environment restricts some widely used in-air vision devices, such as Kinect units [3] and binocular stereo cameras [4]. These devices do not work well for image acquisition or display in underwater environments because of the strong absorption and scattering effects that take place there. For these reasons, underwater image collection is costly, and it is more difficult to measure ground-truth color, shape or depth information in underwater environments.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.