Abstract

In the field of deep learning vision, infrared and visible image fusion (IVIF) has received significant attention due to its ability to enhance scene comprehension by combining the complementary features of both types of images. Existing methods based on generative models are either unstable in training, such as generative adversarial network, or only have a single denoising object, such as diffusion models, resulting in poor fused results that cannot fully contain multi-modal features. To solve these problems, we propose a multi-attribute diffusion model to fuse infrared and visible images, termed MADMFuse. Specifically, to preserve the salient information in infrared images and the texture details in visible images, we have designed a forward diffusion process with shared noise for multiple independent attributes, ensuring that the denoising network learns complementary features of different attributes simultaneously during training. Subsequently, our reverse process with multi-attribute as conditions employs the denoising network that iteratively reparameterizes noise, gradually adjusting and achieving fine image fusion. Furthermore, to address the issue of feature degradation resulting from solely focusing on a single denoising object, we derive a multiple diffusion object loss function based on pixel-level fusion target, the fidelity and luminance term in this loss function aim to guide the fused results towards visual similarity with the source image while preserving its brightness. Extensive experiments indicate that MADMFuse is more effective than other state-of-the-art image fusion methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.