Abstract
Infrared images can provide clear contrast information to distinguish between the target and the background under any lighting conditions. In contrast, visible images can provide rich texture details and are compatible with the human visual system. The fusion of a visible image and infrared image will thus contain both comprehensive contrast information and texture details. In this study, a novel approach for the fusion of infrared and visible images is proposed based on a dual-discriminator generative adversarial network with a squeeze-and-excitation module (DDGANSE). Our approach establishes confrontation training between one generator and two discriminators. The goal of the generator is to generate images that are similar to the source images, and contain the information from both infrared and visible source images. The purpose of the two discriminators is to increase the similarity between the image generated by the generator and the infrared and visible images. We experimentally demonstrated that using continuous adversarial training, DDGANSE outputs images retain the advantages of both infrared and visible images with significant contrast information and rich texture details. Finally, we compared the performance of our proposed method with previously reported techniques for fusing infrared and visible images using both quantitative and qualitative assessments. Our experiments on the TNO dataset demonstrate that our proposed method shows superior performance compared to other similar reported methods in the literature using various performance metrics.
Highlights
Image fusion is a technique to gather all the important information from multiple images to produce fewer images or a single comprehensive and informative image for subsequent processing functions [1]
We evaluate the performance of the proposed DDGANSE using the TNO dataset
The fused image shows that DDGANSE is clearly better than the other popular fusion methods, including FusionGAN, GANMcC, PMGI, and DDCGAN
Summary
Image fusion is a technique to gather all the important information from multiple images to produce fewer images or a single comprehensive and informative image for subsequent processing functions [1]. The image fusion technique is widely used in infrared and visible images for object detection and target recognition applications [2–4]. Infrared (IR) images can provide enhanced contrast between the target object and the background which is an important feature in imaging systems. Visible images have the advantage of higher resolution and detailed texture information at the expense of poor contrast between target object and the background. The visible images contain rich texture details but poor contrast We added SE networks to the generator to assist in learning the correlation between channels, screen out attention for the channel, and further improving the performance of the network. Adversarial learning between the generation and discrimination networks can effectively correct the prediction error
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.