Bi-Pose: Bidirectional 2D-3D Transformation for Human Pose Estimation From a Monocular Camera

Songlin Du,Takeshi Ikenaga,Zhiwei Yuan,Hao Wang

doi:10.1109/tase.2023.3279928

Abstract

Automatically estimating 3D human poses in video and inferring their meanings play an essential role in many human-centered automation systems. Existing researches made remarkable progresses by first estimating 2D human joints in video and then reconstructing 3D human pose from the 2D joints. However, mono-directionally reconstructing 3D pose from 2D joints ignores the interaction between information in 3D space and 2D space, losses rich information of original video, therefore limits the ceiling of estimation accuracy. To this end, this paper proposes a bidirectional 2D-3D transformation framework that bidirectionally exchanges 2D and 3D information and utilizes video information to estimate an offset for refining 3D human pose. In addition, a bone-length stability loss is utilized for the purpose of exploring human body structure to make the estimated 3D pose more natural and to further increase the overall accuracy. By evaluation, estimation error of the proposed method, measured by the mean per joint position error (MPJPE), is only 46.5 mm, which is much lower than state-of-the-art methods under the same experimental condition. The improvement on accuracy will make machines to better understand human poses for building superior human-centered automation systems. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Note to Practitioners</i> —This paper was motivated by the demand of human-centered automation systems needing to accurately understand human poses. Existing approaches mainly focus on inferring 3D human pose from 2D joints mono-directionally. Although they made remarkable contributions to estimating 3D human pose in such a mono-directional way, we found that they ignore the 2D-3D interaction and do not use original video when inferring 3D pose from 2D joints. This paper therefore suggests a bidirectional 2D-3D transformation that exchanges 2D and 3D information and utilizes video information to estimate more accurate 3D human pose for human-centered automation systems. This work is a pioneering attempt of interactively using 2D and 3D information for more accurate estimation of human pose. Benefited from the state-of-the-art accuracy, the proposed approach is expected to make significant contributions to many human-centered automation systems, such as human-machine interaction, biomimetic manipulation, and automatic surveillance systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bi-Pose: Bidirectional 2D-3D Transformation for Human Pose Estimation From a Monocular Camera

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automation Science and Engineering

Lead the way for us

Journal: IEEE Transactions on Automation Science and Engineering	Publication Date: Jan 1, 2024
Citations: 2

Similar Papers

3D Human Pose Machines with Self-Supervised Learning.
Keze Wang ... Chen Qian
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 42
Keze Wang, et. al.Keze Wang ... Chen Qian
01 Jan 2019
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 42

Weakly Supervised Adversarial Learning for 3D Human Pose Estimation from Point Clouds.
Zihao Zhang ... Lei Hu
IEEE Transactions on Visualization and Computer Graphics | VOL. 26
Zihao Zhang, et. al.Zihao Zhang ... Lei Hu
13 Feb 2020
IEEE Transactions on Visualization and Computer Graphics | VOL. 26

Lifting 2D Human Pose to 3D with Domain Adapted 3D Body Concept
Qiang Nie ... Yunhui Liu
International Journal of Computer Vision | VOL. 131
Qiang Nie, et. al.Qiang Nie ... Yunhui Liu
03 Feb 2023
International Journal of Computer Vision | VOL. 131

SVMAC: Unsupervised 3D Human Pose Estimation from a Single Image with Single-view-multi-angle Consistency
Yicheng Deng ... Yongqi Sun
-
Yicheng Deng, et. al.Yicheng Deng ... Yongqi Sun
01 Dec 2021
01 Dec 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bi-Pose: Bidirectional 2D-3D Transformation for Human Pose Estimation From a Monocular Camera

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automation Science and Engineering