Abstract

In this paper, we will study the basic flow and the principle of state-of-the-art object detection technique (i.e. Faster R-CNN) and improve it further with the inclusion of two strategies into it. Firstly, we propose a multi-layer features merging strategy by using a concatenation layer. Secondly, we introduce a contextual learning scheme for Faster R-CNN. Previously, Faster R-CNN just uses regional features. Contextual features are added with the regional features for the classification and detection task. Our improvement on Faster R-CNN shows promising results. We call our improved Faster R-CNN network as ID-CNN (Intelligent Detection Using Convolutional Neural Network) as its detection accuracy is better. Therefore, we call it as an intelligent detector. We use a deep VGG-16 model as our base model, as Faster R-CNN did. We evaluated our ID-CNN on Pascal VOC public datasets. Experimental results show that ID-CNN can effectively improve the object detection average precision to some extent. On VOC 2007 and 2012, we achieved a mean average precision (mAP) of 74.7% and 71.9%, respectively. ID-CNN is also end-to-end trainable with the same alternating fine-tuning optimization scheme of Faster R-CNN. Finally, we compared ID-CNN with Faster R-CNN on ImageNet object detection dataset and we achieved mAP of 48.1% compared with 46.2% for Faster R-CNN.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call