A Confounder-Free Fusion Network for Aerial Image Scene Feature Representation

Wei Xiong,Zhenyu Xiong,Yaqi Cui

doi:10.1109/jstars.2022.3189052

Abstract

The increasing number and complex content of aerial images have made some recent methods based on deep learning not fit well with different aerial image processing tasks. The coarse-grained feature representation proposed by these methods is not discriminative enough. Besides, the confounding factors in the datasets and long-tailed distribution of the training data will lead to biased and spurious associations among the objects of aerial images. This study proposes a confounder-free fusion network (CFF-NET) to address the challenges. Global and local feature extraction branches are designed to capture comprehensive and fine-grained deep features from the whole image. Specifically, to extract the discriminative local feature and explore the contextual information across different regions, the models based on gated recurrent units (GRUs) are constructed to extract features of the image region and output the important weight of each region. Further, the confounder-free object feature extraction branch is proposed to generate reasonable visual attention and provide more multi-grained image information. It also eliminates the spurious and biased visual relationships of the image on the object-level. Finally, the output of the three branches is combined to obtain the fusion feature representation. Extensive experiments are conducted on the three popular aerial image processing tasks: image classification, image retrieval, and image captioning. It is found that the proposed CFF-NET achieve reasonable and state-of-the-art results, including high-level task such as aerial image captioning.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing	Publication Date: Jan 1, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Confounder-Free Fusion Network for Aerial Image Scene Feature Representation

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

Lead the way for us

Similar Papers

Land Usage Identification with Fusion of Thepade SBTC and Sauvola Thresholding Features of Aerial Images Using Ensemble of Machine Learning Algorithms
Sudeep D Thepade ... Piyush R Chaudhari
Applied Artificial Intelligence | VOL. 35
Sudeep D Thepade, et. al.Sudeep D Thepade ... Piyush R Chaudhari
05 Nov 2020
Applied Artificial Intelligence | VOL. 35

Deep Learning Techniques to Classify the Aerial Images with Gabor Filter
P Sumathi ... Prakruthi D P
International Journal for Research in Applied Science and Engineering Technology | VOL. 10
P Sumathi, et. al.P Sumathi ... Prakruthi D P
31 May 2022
International Journal for Research in Applied Science and Engineering Technology | VOL. 10

Integrating Aerial and Street View Images for Urban Land Use Classification
Bozhi Liu ... Rui Cao
Remote Sensing | VOL. 10
Bozhi Liu, et. al.Bozhi Liu ... Rui Cao
27 Sep 2018
Remote Sensing | VOL. 10

Coupled Global–Local object detection for large VHR aerial images
Dongliang Ma ... Xi Chen
Knowledge-Based Systems | VOL. 260
Dongliang Ma, et. al.Dongliang Ma ... Xi Chen
17 Nov 2022
Knowledge-Based Systems | VOL. 260

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Confounder-Free Fusion Network for Aerial Image Scene Feature Representation

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing