6D-VNet: End-To-End 6DoF Vehicle Pose Estimation From Monocular RGB Images

Di Wu,Wenbin Zou,Xia Li,Canqun Xiang,Zhaoyong Zhuang

doi:10.1109/cvprw.2019.00163

Abstract

We present a conceptually simple framework for 6DoF object pose estimation, especially for autonomous driving scenario. Our approach efficiently detects traffic participants in a monocular RGB image while simultaneously regressing their 3D translation and rotation vectors. The method, called 6D-VNet, extends Mask R-CNN by adding customised heads for predicting vehicle's finer class, rotation and translation. The proposed 6D-VNet is trained end-to-end compared to previous methods. Furthermore, we show that the inclusion of translational regression in the joint losses is crucial for the 6DoF pose estimation task, where object translation distance along longitudinal axis varies significantly, e.g., in autonomous driving scenarios. Additionally, we incorporate the mutual information between traffic participants via a modified non-local block. As opposed to the original non-local block implementation, the proposed weighting modification takes the spatial neighbouring information into consideration whilst counteracting the effect of extreme gradient values. Our 6D-VNet reaches the 1 st place in ApolloScape challenge 3D Car Instance task. Code has been made available at: https://github.com/stevenwudi/6DVNET .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

6D-VNet: End-To-End 6DoF Vehicle Pose Estimation From Monocular RGB Images

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

End-to-End 6DoF Pose Estimation From Monocular RGB Images
Wenbin Zou ... Xia Li
IEEE Transactions on Consumer Electronics | VOL. 67
Wenbin Zou, et. al.Wenbin Zou ... Xia Li
01 Feb 2021
IEEE Transactions on Consumer Electronics | VOL. 67

A Framework for 3D Object Detection and Pose Estimation in Unstructured Environment Using Single Shot Detector and Refined LineMOD Template Matching
Shili Chen ... Yisheng Guan
-
Shili Chen, et. al.Shili Chen ... Yisheng Guan
01 Sep 2019
01 Sep 2019

A probabilistic framework for joint head tracking and pose estimation
S.O Ba ... J.M Odobez
-
S.O Ba, et. al.S.O Ba ... J.M Odobez
01 Jan 2004
01 Jan 2004

HMD-EgoPose: head-mounted display-based egocentric marker-less tool and hand pose estimation for augmented surgical guidance.
Mitchell Doughty ... Nilesh R Ghugre
International journal of computer assisted radiology and surgery | VOL. 17
Mitchell Doughty, et. al.Mitchell Doughty ... Nilesh R Ghugre
14 Jun 2022
International journal of computer assisted radiology and surgery | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

6D-VNet: End-To-End 6DoF Vehicle Pose Estimation From Monocular RGB Images

Abstract

Talk to us

Similar Papers