Research on Multimodal Image Fusion Target Detection Algorithm Based on Generative Adversarial Network

Zhaoli Wu,Zhiwei Yang,Chao Wang,Yuancai Zhu,Haibo Yang,Xuehan Wu,Jilong Sun,Jingxuan Zhai

doi:10.1155/2022/1740909

Abstract

In this paper, we propose a target detection algorithm based on adversarial discriminative domain adaptation for infrared and visible image fusion using unsupervised learning methods to reduce the differences between multimodal image information. Firstly, this paper improves the fusion model based on generative adversarial network and uses the fusion algorithm based on the dual discriminator generative adversarial network to generate high-quality IR-visible fused images and then blends the IR and visible images into a ternary dataset and combines the triple angular loss function to do migration learning. Finally, the fused images are used as the input images of faster RCNN object detection algorithm for detection, and a new nonmaximum suppression algorithm is used to improve the faster RCNN target detection algorithm, which further improves the target detection accuracy. Experiments prove that the method can achieve mutual complementation of multimodal feature information and make up for the lack of information in single-modal scenes, and the algorithm achieves good detection results for information from both modalities (infrared and visible light).

Highlights

With the rapid development of deep learning, the task of target detection in computer vision tasks has made great progress
In response to the above problems, this paper starts from the perspective of the adversarial discriminant domain [4], uses an unsupervised learning method to reduce the modal difference between bimodal images, and proposes a modal information fusion detection algorithm based on a generative adversarial network
In the improved generative confrontation network, the generator is designed with local detail features and global semantic features to extract source image details and semantic information, and perceptual loss is added to the discriminator to keep the data distribution of the fused image consistent with the source image and improve fusion image accuracy. e fused features enter the interest pooling network for rough classification, and the generated candidate frame is mapped to the feature map, and the target classification and positioning are completed through the fully connected layer

Summary

Introduction

With the rapid development of deep learning, the task of target detection in computer vision tasks has made great progress. With the successful application of deep convolutional neural networks in target detection tasks, scholars have produced many excellent results in multimodal research. E author uses a convolutional neural network to fuse two modal information and discusses the impact of different fusion stages on the target detection results [1]. In response to the above problems, this paper starts from the perspective of the adversarial discriminant domain [4], uses an unsupervised learning method to reduce the modal difference between bimodal images, and proposes a modal information fusion detection algorithm based on a generative adversarial network. In the improved generative confrontation network, the generator is designed with local detail features and global semantic features to extract source image details and semantic information, and perceptual loss is added to the discriminator to keep the data distribution of the fused image consistent with the source image and improve fusion image accuracy. In the improved generative confrontation network, the generator is designed with local detail features and global semantic features to extract source image details and semantic information, and perceptual loss is added to the discriminator to keep the data distribution of the fused image consistent with the source image and improve fusion image accuracy. e fused features enter the interest pooling network for rough classification, and the generated candidate frame is mapped to the feature map, and the target classification and positioning are completed through the fully connected layer

Algorithm Structure

Target Detection Model

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Wireless Communications and Mobile Computing	Publication Date: Jan 24, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Research on Multimodal Image Fusion Target Detection Algorithm Based on Generative Adversarial Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing

Lead the way for us

Similar Papers

MSAt-GAN: a generative adversarial network based on multi-scale and deep attention mechanism for infrared and visible light image fusion
Junwu Li ... Weiwei Cai
Complex & Intelligent Systems | VOL. 8
Junwu Li, et. al.Junwu Li ... Weiwei Cai
22 Apr 2022
Complex & Intelligent Systems | VOL. 8

DFPGAN: Dual fusion path generative adversarial network for infrared and visible image fusion
Shi Yi ... Xuesong Yuan
Infrared Physics & Technology | VOL. 119
Shi Yi, et. al.Shi Yi ... Xuesong Yuan
28 Oct 2021
Infrared Physics & Technology | VOL. 119

Infrared and Visible Image Fusion via Multi-discriminators Wasserstein Generative Adversarial Network
Jing Li ... Chang Li
-
Jing Li, et. al.Jing Li ... Chang Li
01 Dec 2019
01 Dec 2019

AttentionFGAN: Infrared and Visible Image Fusion Using Attention-Based Generative Adversarial Networks
Jing Li ... Chang Li
IEEE Transactions on Multimedia | VOL. 23
Jing Li, et. al.Jing Li ... Chang Li
05 Jun 2020
IEEE Transactions on Multimedia | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research on Multimodal Image Fusion Target Detection Algorithm Based on Generative Adversarial Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing