MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization

Zengyi Qin,Jinglu Wang,Yan Lu

doi:10.1609/aaai.v33i01.33018851

Zengyi Qin, Jinglu Wang + Show 1 more

Open Access

https://doi.org/10.1609/aaai.v33i01.33018851

Copy DOI

Abstract

Localizing objects in the real 3D space, which plays a crucial role in scene understanding, is particularly challenging given only a single RGB image due to the geometric information loss during imagery projection. We propose MonoGRNet for the amodal 3D object localization from a monocular RGB image via geometric reasoning in both the observed 2D projection and the unobserved depth dimension. MonoGRNet is a single, unified network composed of four task-specific subnetworks, responsible for 2D object detection, instance depth estimation (IDE), 3D localization and local corner regression. Unlike the pixel-level depth estimation that needs per-pixel annotations, we propose a novel IDE method that directly predicts the depth of the targeting 3D bounding box’s center using sparse supervision. The 3D localization is further achieved by estimating the position in the horizontal and vertical dimensions. Finally, MonoGRNet is jointly learned by optimizing the locations and poses of the 3D bounding boxes in the global context. We demonstrate that MonoGRNet achieves state-of-the-art performance on challenging datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 247

Similar Papers

MonoGRNet: A General Framework for Monocular 3D Object Detection.
Zengyi Qin ... Yan Lu
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Zengyi Qin, et. al.Zengyi Qin ... Yan Lu
01 Jan 2020
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

The effect of horizontal and vertical dimensions of the interproximal space on the existence of interdental papillae – A clinical study
Ashwath B ... Kavitha P
IP International Journal of Periodontology and Implantology | VOL. 6
Ashwath B, et. al.Ashwath B ... Kavitha P
15 Oct 2021
IP International Journal of Periodontology and Implantology | VOL. 6

Lower-right and upper-left biases within upper and lower visual fields in a circular array task.
Izabela Szelest ... Lorin J Elias
Perceptual and Motor Skills | VOL. 119
Izabela Szelest, et. al.Izabela Szelest ... Lorin J Elias
01 Dec 2014
Perceptual and Motor Skills | VOL. 119

Influence of short incompatible practice on the Simon effect: transfer along the vertical dimension and across vertical and horizontal dimensions.
Erick F Q Conde ... Allan Pablo Lameira
Experimental brain research | VOL. 233
Erick F Q Conde, et. al.Erick F Q Conde ... Allan Pablo Lameira
12 Aug 2015
Experimental brain research | VOL. 233

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence