DF-SSD: An Improved SSD Object Detection Algorithm Based on DenseNet and Feature Fusion

Sheping Zhai,Susu Dong,Dingrong Shang,Shuhuan Wang

doi:10.1109/access.2020.2971026

Sheping Zhai, Susu Dong + Show 2 more

Open Access

https://doi.org/10.1109/access.2020.2971026

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 123	License type: CC BY 4.0

Affiliation: Xi’an University of Posts and Telecommunications

Abstract

In view of the lack of feature complementarity between the feature layers of Single Shot MultiBox Detector (SSD) and the weak detection ability of SSD for small objects, we propose an improved SSD object detection algorithm based on Dense Convolutional Network (DenseNet) and feature fusion, which is called DF-SSD. On the basis of SSD, we design the feature extraction network DenseNet-S-32-1 with reference to the dense connection of DenseNet, and replace the original backbone network VGG-16 of SSD with DenseNet-S-32-1 to enhance the feature extraction ability of the model. In the part of multi-scale detection, a fusion mechanism of multi-scale feature layers is introduced to organically combine low-level visual features and high-level semantic features in the network structure. Finally, a residual block is established before the object prediction to further improve the model performance. We train the DF-SSD model from scratch. The experimental results show that our model DF-SSD with 300 × 300 input achieves 81.4% mAP, 79.0% mAP, and 29.5% mAP on PASCAL VOC 2007, VOC 2012, and MS COCO datasets, respectively. Compared with SSD, the detection accuracy of DF-SSD on VOC 2007 is improved by 3.1% mAP. DF-SSD requires only 1/2 parameters to SSD and 1/9 parameters to Faster RCNN. We inject more semantic information into DF-SSD, which makes it have advanced detection effect on small objects and objects with specific relationships.

Highlights

Object detection is one of the important research topics in the field of computer vision
Since AlexNet [6] proposed by Krizhevsky et al made a significant improvement on ImageNet [7] in 2012, various deep learning methods represented by convolutional neural networks (CNNs) have
As an object detection algorithm based on deep learning, Single Shot MultiBox Detector (SSD) [8] has high performance in both detection accuracy and detection speed

Summary

Introduction

Object detection is one of the important research topics in the field of computer vision. Since AlexNet [6] proposed by Krizhevsky et al made a significant improvement on ImageNet [7] in 2012, various deep learning methods represented by convolutional neural networks (CNNs) have. Been widely applied in many visual tasks, including object detection. As an object detection algorithm based on deep learning, Single Shot MultiBox Detector (SSD) [8] has high performance in both detection accuracy and detection speed. SSD algorithm is proposed by Liu W et al in 2016 to solve the problem of insufficient detection accuracy of YOLO [9] series algorithm in object positioning. Its main idea is to sample densely and evenly at different locations of the image.

Methods

Findings

Discussion

Conclusion