Abstract

Convolutional neural network (CNN) has led to significant progress in object detection. In order to detect the objects in various sizes, the object detectors often exploit the hierarchy of the multi-scale feature maps called feature pyramid, which is readily obtained by the CNN architecture. However, such feature maps do not fully consider the supplementary effect of contextual information on semantics. In this work, we proposed a feature fusion method of residual attention based on the SSD benchmark network call Improved SSD to make full use of context information to improve the characterization ability of feature maps. Besides, we use the residual attention mechanism to reinforce the key features to further improve the detector performance. The experiment result on benchmark dataset PASCAL VOC shows that the map of the proposed method with input image sizes of 300×300 and 512×512 is 78.8% and 80.7%.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call