A Cross-Scale Iterative Attentional Adversarial Fusion Network for Infrared and Visible Images

Zhishe Wang,Yanlin Chen,Wenyu Shao,Jiawei Xu,Lei Zhang

doi:10.1109/tcsvt.2023.3239627

Abstract

Recent existing methods generally adopt a simple concatenation or addition strategy to integrate features at the fusion layer, failing to adequately consider the intrinsic characteristics of different modal images and feature interaction of different scales, which may produce a limited fusion performance. Toward this end, we introduce a cross-scale iterative attentional adversarial fusion network, namely CrossFuse. More specifically, in the generator, we design a cross-modal attention integrated module to merge the intrinsic content of different modal images. The parallel spatial-independent and channel-independent pathways are proposed to calculate the attentional weights, which are assigned to measure the activity levels of source images at the same scale. Moreover, we construct a cross-scale iterative decoder framework to interact with different modality features at different scales, which can constantly optimize their activity levels. By this means, the generator learns to integrate their modality characteristics via attentional weights in an iterative manner, and the generated result characterizes competitive infrared radiant intensity and distinct visible detail description. Extensive experiments on three different benchmarks demonstrate that our CrossFuse outperforms other nine state-of-the-art methods in terms of fusion performance, generalization ability and computational efficiency. Our codes will be released at https://github.com/Zhishe-Wang/CrossFuse.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Cross-Scale Iterative Attentional Adversarial Fusion Network for Infrared and Visible Images

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Aug 1, 2023
Citations: 15

Similar Papers

E-fuzzy feature fusion and thresholding for morphology segmentation of brain MRI modalities
G. Anand Kumar ... P. V. Sridevi
Multimedia Tools and Applications | VOL. 80
G. Anand Kumar, et. al.G. Anand Kumar ... P. V. Sridevi
01 Mar 2021
Multimedia Tools and Applications | VOL. 80

Multicolor image classification using the multimodal information bottleneck network (MMIB-Net) for detecting diabetic retinopathy.
Jingqi Song ... Yuanjie Zheng
Optics Express | VOL. 29
Jingqi Song, et. al.Jingqi Song ... Yuanjie Zheng
02 Jul 2021
Optics Express | VOL. 29

SwinFuse: A Residual Swin Transformer Fusion Network for Infrared and Visible Images
Zhishe Wang ... Hui Li
IEEE Transactions on Instrumentation and Measurement | VOL. 71
Zhishe Wang, et. al.Zhishe Wang ... Hui Li
01 Jan 2021
IEEE Transactions on Instrumentation and Measurement | VOL. 71

DRCM: a disentangled representation network based on coordinate and multimodal attention for medical image fusion.
Wanwan Huang ... Xiongwen Quan
Frontiers in Physiology | VOL. 14
Wanwan Huang, et. al.Wanwan Huang ... Xiongwen Quan
03 Nov 2023
Frontiers in Physiology | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Cross-Scale Iterative Attentional Adversarial Fusion Network for Infrared and Visible Images

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society