Abstract

In view of the lack of feature complementarity between the feature layers of Single Shot MultiBox Detector (SSD) and the weak detection ability of SSD for small objects, we propose an improved SSD object detection algorithm based on Dense Convolutional Network (DenseNet) and feature fusion, which is called DF-SSD. On the basis of SSD, we design the feature extraction network DenseNet-S-32-1 with reference to the dense connection of DenseNet, and replace the original backbone network VGG-16 of SSD with DenseNet-S-32-1 to enhance the feature extraction ability of the model. In the part of multi-scale detection, a fusion mechanism of multi-scale feature layers is introduced to organically combine low-level visual features and high-level semantic features in the network structure. Finally, a residual block is established before the object prediction to further improve the model performance. We train the DF-SSD model from scratch. The experimental results show that our model DF-SSD with 300 × 300 input achieves 81.4% mAP, 79.0% mAP, and 29.5% mAP on PASCAL VOC 2007, VOC 2012, and MS COCO datasets, respectively. Compared with SSD, the detection accuracy of DF-SSD on VOC 2007 is improved by 3.1% mAP. DF-SSD requires only 1/2 parameters to SSD and 1/9 parameters to Faster RCNN. We inject more semantic information into DF-SSD, which makes it have advanced detection effect on small objects and objects with specific relationships.

Highlights

  • Object detection is one of the important research topics in the field of computer vision

  • Since AlexNet [6] proposed by Krizhevsky et al made a significant improvement on ImageNet [7] in 2012, various deep learning methods represented by convolutional neural networks (CNNs) have

  • As an object detection algorithm based on deep learning, Single Shot MultiBox Detector (SSD) [8] has high performance in both detection accuracy and detection speed

Read more

Summary

Introduction

Object detection is one of the important research topics in the field of computer vision. Since AlexNet [6] proposed by Krizhevsky et al made a significant improvement on ImageNet [7] in 2012, various deep learning methods represented by convolutional neural networks (CNNs) have. Been widely applied in many visual tasks, including object detection. As an object detection algorithm based on deep learning, Single Shot MultiBox Detector (SSD) [8] has high performance in both detection accuracy and detection speed. SSD algorithm is proposed by Liu W et al in 2016 to solve the problem of insufficient detection accuracy of YOLO [9] series algorithm in object positioning. Its main idea is to sample densely and evenly at different locations of the image.

Methods
Findings
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call