Abstract

Infrared images can provide clear contrast information to distinguish between the target and the background under any lighting conditions. In contrast, visible images can provide rich texture details and are compatible with the human visual system. The fusion of a visible image and infrared image will thus contain both comprehensive contrast information and texture details. In this study, a novel approach for the fusion of infrared and visible images is proposed based on a dual-discriminator generative adversarial network with a squeeze-and-excitation module (DDGANSE). Our approach establishes confrontation training between one generator and two discriminators. The goal of the generator is to generate images that are similar to the source images, and contain the information from both infrared and visible source images. The purpose of the two discriminators is to increase the similarity between the image generated by the generator and the infrared and visible images. We experimentally demonstrated that using continuous adversarial training, DDGANSE outputs images retain the advantages of both infrared and visible images with significant contrast information and rich texture details. Finally, we compared the performance of our proposed method with previously reported techniques for fusing infrared and visible images using both quantitative and qualitative assessments. Our experiments on the TNO dataset demonstrate that our proposed method shows superior performance compared to other similar reported methods in the literature using various performance metrics.

Highlights

  • Image fusion is a technique to gather all the important information from multiple images to produce fewer images or a single comprehensive and informative image for subsequent processing functions [1]

  • We evaluate the performance of the proposed DDGANSE using the TNO dataset

  • The fused image shows that DDGANSE is clearly better than the other popular fusion methods, including FusionGAN, GANMcC, PMGI, and DDCGAN

Read more

Summary

Introduction

Image fusion is a technique to gather all the important information from multiple images to produce fewer images or a single comprehensive and informative image for subsequent processing functions [1]. The image fusion technique is widely used in infrared and visible images for object detection and target recognition applications [2–4]. Infrared (IR) images can provide enhanced contrast between the target object and the background which is an important feature in imaging systems. Visible images have the advantage of higher resolution and detailed texture information at the expense of poor contrast between target object and the background. The visible images contain rich texture details but poor contrast We added SE networks to the generator to assist in learning the correlation between channels, screen out attention for the channel, and further improving the performance of the network. Adversarial learning between the generation and discrimination networks can effectively correct the prediction error

DDGANSE of the Generator
Experiments
Training Details
Performance Metrics
Results for the TNO Dataset (1) Qualitative Comparison
Method
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call