Monocular 3D object detection with thermodynamic loss and decoupled instance depth

Gang Liu,Xiaoxiao Xie,Qingchen Yu

doi:10.1080/09540091.2024.2316022

Abstract

Monocular 3D detection is to obtain the 3D information of the object from the image. The mainstream methods mainly use L1 loss or L1-like loss to control the instance depth prediction. However, these methods have not achieved satisfactory results. One of the main reasons is that L1 loss or L1-like loss does not accurately reflect the fit between the predicted instance depth and the corresponding ground truth. Another of the main reason is that the instance depth on the RGB image hard to be directly learned in the network. In order to solve the above problems, a novel thermodynamic loss based on the principle of free energy minimisation and a novel depth decoupling method are proposed in this paper. The proposed method is called the monocular 3D object detection network with thermodynamic loss and decoupled instance depth (TDN). In TDN, the optimisation of the instance depth prediction is regarded as the thermodynamic process. Therefore, the thermodynamic loss is designed according to the principle of free energy minimisation. TDN decouples the instance depth into three different depths. By combining the thermodynamic loss and the different types of depths, we can obtain the final instance depth.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Monocular 3D object detection with thermodynamic loss and decoupled instance depth

Abstract

Talk to us

Similar Papers

More From: Connection Science

Lead the way for us

Journal: Connection Science	Publication Date: Feb 13, 2024
License type: CC BY 4.0

Similar Papers

Kinematic 3D Object Detection in Monocular Video
Garrick Brazil ... Xiaoming Liu
-
Garrick Brazil, et. al.Garrick Brazil ... Xiaoming Liu
01 Jan 2020
01 Jan 2020

Pseudo-Mono for Monocular 3D Object Detection in Autonomous Driving
Chongben Tao ... Jiecheng Cao
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Chongben Tao, et. al.Chongben Tao ... Jiecheng Cao
01 Aug 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

GPro3D: Deriving 3D BBox from ground plane in monocular 3D object detection
Fan Yang ... Guiguang Ding
Neurocomputing | VOL. 562
Fan Yang, et. al.Fan Yang ... Guiguang Ding
11 Oct 2023
Neurocomputing | VOL. 562

Weakly Supervised Monocular 3D Object Detection by Spatial-Temporal View Consistency.
Wencheng Han ... Jianbing Shen
IEEE transactions on pattern analysis and machine intelligence | VOL. PP
Wencheng Han, et. al.Wencheng Han ... Jianbing Shen
01 Jan 2024
IEEE transactions on pattern analysis and machine intelligence | VOL. PP

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Monocular 3D object detection with thermodynamic loss and decoupled instance depth

Abstract

Talk to us

Similar Papers

More From: Connection Science