CF2PN: A Cross-Scale Feature Fusion Pyramid Network Based Remote Sensing Target Detection

Wei Huang,Ming Ju,Jiantao Qu,Guanyi Li,Qiqiang Chen

doi:10.3390/rs13050847

Abstract

In the wake of developments in remote sensing, the application of target detection of remote sensing is of increasing interest. Unfortunately, unlike natural image processing, remote sensing image processing involves dealing with large variations in object size, which poses a great challenge to researchers. Although traditional multi-scale detection networks have been successful in solving problems with such large variations, they still have certain limitations: (1) The traditional multi-scale detection methods note the scale of features but ignore the correlation between feature levels. Each feature map is represented by a single layer of the backbone network, and the extracted features are not comprehensive enough. For example, the SSD network uses the features extracted from the backbone network at different scales directly for detection, resulting in the loss of a large amount of contextual information. (2) These methods combine with inherent backbone classification networks to perform detection tasks. RetinaNet is just a combination of the ResNet-101 classification network and FPN network to perform the detection tasks; however, there are differences in object classification and detection tasks. To address these issues, a cross-scale feature fusion pyramid network (CF2PN) is proposed. First and foremost, a cross-scale fusion module (CSFM) is introduced to extract sufficiently comprehensive semantic information from features for performing multi-scale fusion. Moreover, a feature pyramid for target detection utilizing thinning U-shaped modules (TUMs) performs the multi-level fusion of the features. Eventually, a focal loss in the prediction section is used to control the large number of negative samples generated during the feature fusion process. The new architecture of the network proposed in this paper is verified by DIOR and RSOD dataset. The experimental results show that the performance of this method is improved by 2–12% in the DIOR dataset and RSOD dataset compared with the current SOTA target detection methods.

Highlights

With regard to development in technology and the advent of the era of machine learning, deep learning technology is advancing by leaps and bounds and has encouraged the development of target detection technology.Traditional target detection [1,2] extracts features from candidate regions within the image using techniques such as Haar [3], HOG [4] or sparse representation [5,6,7,8] and classifies them using the SVM [9] model
The experimental results show that the performance of this method is improved by 2–12% in the DIOR dataset and RSOD dataset compared with the current SOTA target detection methods
There are two categories of deep learning-based target detection methods: the first category involves two-stage target detection based on region proposals whereas the second category involves single-stage target detection based on regression

Summary

Introduction

Traditional target detection [1,2] extracts features from candidate regions within the image using techniques such as Haar [3], HOG [4] or sparse representation [5,6,7,8] and classifies them using the SVM [9] model. Deep learning-based target detection methods have been used widely. There are two categories of deep learning-based target detection methods: the first category involves two-stage target detection based on region proposals whereas the second category involves single-stage target detection based on regression. RPN instead of selective search algorithm improves detection speed. Converts the target detection task into a regression problem, greatly speeding up detection. Selective search and detection are divided into two stages resulting in slow speed; poor detection for small targets The introduction of the new feature pyramid solves the defect that the feature map of each scale in the traditional feature pyramid contains only single level or few levels of features.

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Remote Sensing	Publication Date: Feb 25, 2021
Citations: 63	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CF2PN: A Cross-Scale Feature Fusion Pyramid Network Based Remote Sensing Target Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

A Small Intestinal Stromal Tumor Detection Method Based on an Attention Balance Feature Pyramid.
Fei Xie ... Xuesong Zhao
Sensors (Basel, Switzerland) | VOL. 23
Fei Xie, et. al.Fei Xie ... Xuesong Zhao
09 Dec 2023
Sensors (Basel, Switzerland) | VOL. 23

MFEFNet: Multi-scale feature enhancement and Fusion Network for polyp segmentation
Yang Xia ... Yanjun Liu
Computers in Biology and Medicine | VOL. 157
Yang Xia, et. al.Yang Xia ... Yanjun Liu
02 Mar 2023
Computers in Biology and Medicine | VOL. 157

High-Precision Target Detection of Remote Sensing Image Based on Feature Enhancement with 6G Technology
Haiyan Chen
Advances in Multimedia | VOL. 2022
Haiyan ChenHaiyan Chen
05 Aug 2022
Advances in Multimedia | VOL. 2022

Small crack detection method for cylinder block based on improved SSD model research
Lingju Kong ... Weijie Li
-
Lingju Kong, et. al.Lingju Kong ... Weijie Li
14 Oct 2021
14 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CF2PN: A Cross-Scale Feature Fusion Pyramid Network Based Remote Sensing Target Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing