3D Human Pose Estimation Using Two-Stream Architecture with Joint Training

Jian Kang,Rui Liu,Dongsheng Zhou,Yijing Li,Wanshu Fan

doi:10.32604/cmes.2023.024420

Abstract

With the advancement of image sensing technology, estimating 3D human pose from monocular video has become a hot research topic in computer vision. 3D human pose estimation is an essential prerequisite for subsequent action analysis and understanding. It empowers a wide spectrum of potential applications in various areas, such as intelligent transportation, human-computer interaction, and medical rehabilitation. Currently, some methods for 3D human pose estimation in monocular video employ temporal convolutional network (TCN) to extract inter-frame feature relationships, but the majority of them suffer from insufficient inter-frame feature relationship extractions. In this paper, we decompose the 3D joint location regression into the bone direction and length, we propose the TCG, a temporal convolutional network incorporating Gaussian error linear units (GELU), to solve bone direction. It enables more inter-frame features to be captured and makes the utmost of the feature relationships between data. Furthermore, we adopt kinematic structural information to solve bone length enhancing the use of intra-frame joint features. Finally, we design a loss function for joint training of the bone direction estimation network with the bone length estimation network. The proposed method has extensively experimented on the public benchmark dataset Human3.6M. Both quantitative and qualitative experimental results showed that the proposed method can achieve more accurate 3D human pose estimations.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

3D Human Pose Estimation Using Two-Stream Architecture with Joint Training

Abstract

Published Version

Talk to us

Similar Papers

More From: Computer Modeling in Engineering & Sciences

Lead the way for us

Journal: Computer Modeling in Engineering & Sciences	Publication Date: Jan 1, 2023
License type: cc-by

Similar Papers

An Improved 3D Human Pose Estimation Model Based on Temporal Convolution with Gaussian Error Linear Units
Jian Kang ... Dongsheng Zhou
-
Jian Kang, et. al.Jian Kang ... Dongsheng Zhou
26 May 2022
26 May 2022

Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos
Yu Cheng ... Robby T Tan
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Yu Cheng, et. al.Yu Cheng ... Robby T Tan
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Motion Projection Consistency-Based 3-D Human Pose Estimation With Virtual Bones From Monocular Videos
Guangming Wang ... Honghao Zeng
IEEE Transactions on Cognitive and Developmental Systems | VOL. 15
Guangming Wang, et. al.Guangming Wang ... Honghao Zeng
01 Jun 2023
IEEE Transactions on Cognitive and Developmental Systems | VOL. 15

Self-Supervised Learning of 3D Human Pose Using Multi-View Geometry
Muhammed Kocabas ... Emre Akbas
-
Muhammed Kocabas, et. al.Muhammed Kocabas ... Emre Akbas
01 Jun 2019
01 Jun 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

3D Human Pose Estimation Using Two-Stream Architecture with Joint Training

Abstract

Published Version

Talk to us

Similar Papers

More From: Computer Modeling in Engineering &amp; Sciences

More From: Computer Modeling in Engineering & Sciences