Misalignment fusion network for parsing infrared and visible urban scenes

Jinfu Liu,Ting Luo,Yulai Zhang,Wujie Zhou

doi:10.1016/j.optlaseng.2024.108260

Abstract

In the realm of infrared and visible scene parsing, satisfactory performance has been achieved by leveraging the complementary nature of infrared and visible imaging modalities. Existing methods have employed various strategies to fuse cross-modality features. However, these strategies typically integrate features at the same levels (the depths of network) neglecting the potential interactions across different levels. To address this limitation, we introduce a novel concept called misalignment fusion, which involves merging multimodality feature maps at distinct levels. In line with this concept, we propose a misalignment fusion network (MFNet) specifically designed for the task of infrared and visible urban scene parsing. Our network incorporates a misalignment-guided fusion module to integrate cross-modality features, as well as an adaptive refined selective fusion module to combine the predicted segmentation maps obtained from two parallel-branch decoders. Numerous experiments have been conducted to evaluate the performance of the proposed MFNet. The results consistently demonstrate that our approach surpasses existing state-of-the-art methods in the field of infrared and visible urban scene parsing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Misalignment fusion network for parsing infrared and visible urban scenes

Abstract

Talk to us

Similar Papers

More From: Optics and Lasers in Engineering

Lead the way for us

Similar Papers

PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing
Wujie Zhou ... Lu Yu
IEEE Transactions on Multimedia | VOL. 25
Wujie Zhou, et. al.Wujie Zhou ... Lu Yu
01 Jan 2023
IEEE Transactions on Multimedia | VOL. 25

ResFusion: deeply fused scene parsing network for RGB‐D images
Juting Dai ... Xinyi Tang
IET Computer Vision | VOL. 12
Juting Dai, et. al.Juting Dai ... Xinyi Tang
03 Sep 2018
IET Computer Vision | VOL. 12

Built-in Depth-Semantic Coupled Encoding for Scene Parsing, Vehicle Detection, and Road Segmentation
Shiyu Liu ... Jingyu Yang
IEEE Transactions on Intelligent Transportation Systems | VOL. 22
Shiyu Liu, et. al.Shiyu Liu ... Jingyu Yang
05 May 2020
IEEE Transactions on Intelligent Transportation Systems | VOL. 22

RANUS: RGB and NIR Urban Scene Dataset for Deep Scene Parsing
Gyeongmin Choe ... Sunghoon Im
IEEE Robotics and Automation Letters | VOL. 3
Gyeongmin Choe, et. al.Gyeongmin Choe ... Sunghoon Im
01 Jul 2018
IEEE Robotics and Automation Letters | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Misalignment fusion network for parsing infrared and visible urban scenes

Abstract

Talk to us

Similar Papers

More From: Optics and Lasers in Engineering