Electromagnetic Imaging Boosted Visual Object Recognition Under Difficult Visual Conditions

Min Tan,Tao Jin,Kuiwen Xu,Xiaoling Gu,Danhui Ye,Jun Yu

doi:10.1109/tgrs.2023.3278349

Abstract

Object imaging and recognition under difficult visual conditions is extremely challenging due to the captured low-quality images, and traditional optical-based recognition methods always fail in this task. In this paper, we propose to utilize the visual-microwave image pairs captured by both visual cameras and microwave sensors for imaging and recognition. To address the heavy noises in the low-quality optical images, we retrieve the physically quantitative images from associated scattered field data, and enhance visual features by both optical and retrieval images. We develop a cross-modal Enhanced Attentive Visual-Microwave Fusion (EAVMF) object recognition model to jointly learn the cross-modal generator and multimodal recognizer. In addition, an attention module for the visual subnetwork is utilized to highlight the regions of interest. Two multimodal datasets with synthetic visual-microwave image pairs are built to simulate the difficult visual condition. The numerical results on these datasets demonstrate that: 1) both the multimodal fusion, cross-modal enhancement, and visual attention module can enhance the performance; and 2) compared with existing methods, the proposed EAVMF not only performs better in terms of accuracy but also has good scalability and one-shot learning ability.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Electromagnetic Imaging Boosted Visual Object Recognition Under Difficult Visual Conditions

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society

Lead the way for us

Journal: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society	Publication Date: Jan 1, 2023
Citations: 1

Similar Papers

An Image Captioning Algorithm Based on Combination Attention Mechanism
Jinlong Liu ... Haiyan Jin
Electronics | VOL. 11
Jinlong Liu, et. al.Jinlong Liu ... Haiyan Jin
27 Apr 2022
Electronics | VOL. 11

CAM-RNN: Co-Attention Model Based RNN for Video Captioning.
Bin Zhao ... Xiaoqiang Lu
IEEE Transactions on Image Processing | VOL. 28
Bin Zhao, et. al.Bin Zhao ... Xiaoqiang Lu
20 May 2019
IEEE Transactions on Image Processing | VOL. 28

An effective multimodal representation and fusion method for multimodal intent recognition
Xuejian Huang ... Najla Alnabhan
Neurocomputing | VOL. 548
Xuejian Huang, et. al.Xuejian Huang ... Najla Alnabhan
06 Jun 2023
Neurocomputing | VOL. 548

Pyramid-Context Guided Feature Fusion for RGB-D Semantic Segmentation
Haoming Liu ... Zhongwen Zhou
-
Haoming Liu, et. al.Haoming Liu ... Zhongwen Zhou
18 Jul 2022
18 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Electromagnetic Imaging Boosted Visual Object Recognition Under Difficult Visual Conditions

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society