Abstract

We propose a novel deep convolutional neural network (CNN) architecture able to perform the integrated object recognition and localization tasks. We propose the Focused Attention (FA) objective that aims to optimize the network to learn features only from objects of interest while suppress those features from the background. As a result, the features extracted by the learned models can be used to accurately predict both the object category and the bounding box of the recognized object in the input image. Experimental results show that the proposed CNN architecture trained with the FA objective achieves better performances than original AlexNet in both the object localization and recognition tasks.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call