HoughNet: Integrating Near and Long-Range Evidence for Visual Detection.

Nermin Samet,Emre Akbas,Samet Hicsonmez

doi:10.1109/tpami.2022.3200413

Abstract

This paper presents HoughNet, a one-stage, anchor-free, voting-based, bottom-up object detection method. Inspired by the Generalized Hough Transform, HoughNet determines the presence of an object at a certain location by the sum of the votes cast on that location. Votes are collected from both near and long-distance locations based on a log-polar vote field. Thanks to this voting mechanism, HoughNet is able to integrate both near and long-range, class-conditional evidence for visual recognition, thereby generalizing and enhancing current object detection methodology, which typically relies on only local evidence. On the COCO dataset, HoughNet's best model achieves 46.4 AP (and 65.1 AP50), performing on par with the state-of-the-art in bottom-up object detection and outperforming most major one-stage and two-stage methods. We further validate the effectiveness of our proposal in other visual detection tasks, namely, video object detection, instance segmentation, 3D object detection and keypoint detection for human pose estimation, and an additional "labels to photo'' image generation task, where the integration of our voting module consistently improves performance in all cases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HoughNet: Integrating Near and Long-Range Evidence for Visual Detection.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Jan 1, 2022
Citations: 6

Similar Papers

HoughNet: Integrating Near and Long-Range Evidence for Bottom-Up Object Detection
Nermin Samet ... Emre Akbas
-
Nermin Samet, et. al.Nermin Samet ... Emre Akbas
01 Jan 2020
01 Jan 2020

Monocular 3D object detection for construction scene analysis
Jie Shen ... Cong Zhang
Computer-Aided Civil and Infrastructure Engineering | VOL. -
Jie Shen, et. al.Jie Shen ... Cong Zhang
20 Dec 2023
Computer-Aided Civil and Infrastructure Engineering | VOL. -

3D Object Detection and Instance Segmentation from 3D Range and 2D Color Images.
Xiaoke Shen ... Ioannis Stamos
Sensors | VOL. 21
Xiaoke Shen, et. al.Xiaoke Shen ... Ioannis Stamos
09 Feb 2021
Sensors | VOL. 21

Medical image analysis methods for anatomical surface reconstruction using tracked 3D ultrasound

-

01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HoughNet: Integrating Near and Long-Range Evidence for Visual Detection.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence