Intelligent Detection Using Convolutional Neural Network (ID-CNN)

Amish Jahangir Kapoor,Hong Fan,Muhammad Sohail Sardar

doi:10.1088/1755-1315/234/1/012061

Amish Jahangir Kapoor, Hong Fan + Show 1 more

Open Access

https://doi.org/10.1088/1755-1315/234/1/012061

Copy DOI

Abstract

In this paper, we will study the basic flow and the principle of state-of-the-art object detection technique (i.e. Faster R-CNN) and improve it further with the inclusion of two strategies into it. Firstly, we propose a multi-layer features merging strategy by using a concatenation layer. Secondly, we introduce a contextual learning scheme for Faster R-CNN. Previously, Faster R-CNN just uses regional features. Contextual features are added with the regional features for the classification and detection task. Our improvement on Faster R-CNN shows promising results. We call our improved Faster R-CNN network as ID-CNN (Intelligent Detection Using Convolutional Neural Network) as its detection accuracy is better. Therefore, we call it as an intelligent detector. We use a deep VGG-16 model as our base model, as Faster R-CNN did. We evaluated our ID-CNN on Pascal VOC public datasets. Experimental results show that ID-CNN can effectively improve the object detection average precision to some extent. On VOC 2007 and 2012, we achieved a mean average precision (mAP) of 74.7% and 71.9%, respectively. ID-CNN is also end-to-end trainable with the same alternating fine-tuning optimization scheme of Faster R-CNN. Finally, we compared ID-CNN with Faster R-CNN on ImageNet object detection dataset and we achieved mAP of 48.1% compared with 46.2% for Faster R-CNN.

Full Text