Reinforcement Learning for Visual Object Detection

Stefan Mathe,Aleksis Pirinen,Cristian Sminchisescu

doi:10.1109/cvpr.2016.316

Abstract

One of the most widely used strategies for visual object detection is based on exhaustive spatial hypothesis search. While methods like sliding windows have been successful and effective for many years, they are still brute-force, independent of the image content and the visual category being searched. In this paper we present principled sequential models that accumulate evidence collected at a small set of image locations in order to detect visual objects effectively. By formulating sequential search as reinforcement learning of the search policy (including the stopping condition), our fully trainable model can explicitly balance for each class, specifically, the conflicting goals of exploration – sampling more image regions for better accuracy –, and exploitation – stopping the search efficiently when sufficiently confident about the target's location. The methodology is general and applicable to any detector response function. We report encouraging results in the PASCAL VOC 2012 object detection test set showing that the proposed methodology achieves almost two orders of magnitude speed-up over sliding window methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforcement Learning for Visual Object Detection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Distinct Mechanisms Mediate Visual Detection and Identification
James M Hillis ... David H Brainard
Current Biology | VOL. 17
James M Hillis, et. al.James M Hillis ... David H Brainard
27 Sep 2007
Current Biology | VOL. 17

Visual Object Detection with DETR to Support Video-Diagnosis Using Conference Tools
Attila Biró ... Sándor Miklós Szilágyi
Applied Sciences | VOL. 12
Attila Biró, et. al.Attila Biró ... Sándor Miklós Szilágyi
12 Jun 2022
Applied Sciences | VOL. 12

ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection
Zhuo Chen ... Ruizhou Ding
-
Zhuo Chen, et. al.Zhuo Chen ... Ruizhou Ding
01 Mar 2020
01 Mar 2020

Integrating Visual Context and Object Detection within a Probabilistic Framework
Roland Perko ... Bernt Schiele
-
Roland Perko, et. al.Roland Perko ... Bernt Schiele
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning for Visual Object Detection

Abstract

Talk to us

Similar Papers