MDCT: Multi-Kernel Dilated Convolution and Transformer for One-Stage Object Detection of Remote Sensing Images

Juanjuan Chen,Hansheng Hong,Bin Song,Jie Guo,Chen Chen,Junjie Xu

doi:10.3390/rs15020371

Abstract

Deep learning (DL)-based object detection algorithms have gained impressive achievements in natural images and have gradually matured in recent years. However, compared with natural images, remote sensing images are faced with severe challenges due to the complex backgrounds and difficult detection of small objects in dense scenes. To address these problems, a novel one-stage object detection model named MDCT is proposed based on a multi-kernel dilated convolution (MDC) block and transformer block. Firstly, a new feature enhancement module, MDC block, is developed in the one-stage object detection model to enhance small objects’ ontology and adjacent spatial features. Secondly, we integrate a transformer block into the neck network of the one-stage object detection model in order to prevent the loss of object information in complex backgrounds and dense scenes. Finally, a depthwise separable convolution is introduced to each MDC block to reduce the computational cost. We conduct experiments on three datasets: DIOR, DOTA, and NWPU VHR-10. Compared with the YOLOv5, our model improves the object detection accuracy by 2.3%, 0.9%, and 2.9% on the DIOR, DOTA, and NWPU VHR-10 datasets, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Remote Sensing	Publication Date: Jan 7, 2023
Citations: 19	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

MDCT: Multi-Kernel Dilated Convolution and Transformer for One-Stage Object Detection of Remote Sensing Images

Abstract

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

A Novel Adaptive Edge Aggregation and Multiscale Feature Interaction Detector for Object Detection in Remote Sensing Images
Wei Huang ... Yuwen Chen
Remote Sensing | VOL. 15
Wei Huang, et. al.Wei Huang ... Yuwen Chen
01 Nov 2023
Remote Sensing | VOL. 15

ℱ3-Net: Feature Fusion and Filtration Network for Object Detection in Optical Remote Sensing Images
Xinhai Ye ... Fengchao Xiong
Remote Sensing | VOL. 12
Xinhai Ye, et. al.Xinhai Ye ... Fengchao Xiong
09 Dec 2020
Remote Sensing | VOL. 12

Multi-scale Dense Object Detection in Remote Sensing Imagery Based on Keypoints
Qingxiang Guo ... Chaohui Li
-
Qingxiang Guo, et. al.Qingxiang Guo ... Chaohui Li
01 Jan 2020
01 Jan 2020

A Multi-Feature Fusion and Attention Network for Multi-Scale Object Detection in Remote Sensing Images
Yong Cheng ... Ngoc Nguyen Tran
Remote Sensing | VOL. 15
Yong Cheng, et. al.Yong Cheng ... Ngoc Nguyen Tran
16 Apr 2023
Remote Sensing | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MDCT: Multi-Kernel Dilated Convolution and Transformer for One-Stage Object Detection of Remote Sensing Images

Abstract

Talk to us

Similar Papers

More From: Remote Sensing