Semantic–Structural Graph Convolutional Networks for Whole-Body Human Pose Estimation

Weiwei Li,Shudong Chen,Rong Du

doi:10.3390/info13030109

Abstract

Existing whole-body human pose estimation methods mostly segment the parts of the body’s hands and feet for specific processing, which not only splits the overall semantics of the body, but also increases the amount of calculation and the complexity of the model. To address these drawbacks, we designed a novel semantic–structural graph convolutional network (SSGCN) for whole-body human pose estimation tasks, which leverages the whole-body graph structure to analyze the semantics of the whole-body keypoints through a graph convolutional network and improves the accuracy of pose estimation. Firstly, we introduced a novel heat-map-based keypoint embedding, which encodes the position information and feature information of the keypoints of the human body. Secondly, we propose a novel semantic–structural graph convolutional network consisting of several sets of cascaded structure-based graph layers and data-dependent whole-body non-local layers. Specifically, the proposed method extracts groups of keypoints and constructs a high-level abstract body graph to process the high-level semantic information of the whole-body keypoints. The experimental results showed that our method achieved very promising results on the challenging COCO whole-body dataset.

Highlights

Human pose estimation is a challenging computer vision task, which aims to locate the human body keypoints in images and videos
This work presents a novel graph convolutional network framework for whole-body human pose estimation tasks, which leverages the whole-body graph structure to analyze the semantics of each part of the body through the graph convolutional network; We propose a novel heat-map-based keypoint embedding module, which encodes the position information and feature information of the keypoints of the human body; The proposed semantic–structural graph convolutional network consists of a structurebased graph layer to capture skeleton structure information and a data-dependent non-local layer to analyze the long-range grouped joint features; We represent groups of keypoints and construct a high-level abstract body graph to process the high-level semantic information of the whole-body keypoints
We performed the semantic fusion of whole-body poses based on the whole-body skeleton and leveraged the heat-map-based graph convolutional network to calibrate human whole-body human pose estimation

Summary

Introduction

Human pose estimation is a challenging computer vision task, which aims to locate the human body keypoints in images and videos. Our main contributions are summarized as follows: This work presents a novel graph convolutional network framework for whole-body human pose estimation tasks, which leverages the whole-body graph structure to analyze the semantics of each part of the body through the graph convolutional network; We propose a novel heat-map-based keypoint embedding module, which encodes the position information and feature information of the keypoints of the human body; The proposed semantic–structural graph convolutional network consists of a structurebased graph layer to capture skeleton structure information and a data-dependent non-local layer to analyze the long-range grouped joint features; We represent groups of keypoints and construct a high-level abstract body graph to process the high-level semantic information of the whole-body keypoints.

Human Pose Estimation

Whole-Body Pose Estimation

Heat-Map-Based Skeletal–Structural Graph Convolutional Network

Heat-Map-Based Keypoint Position Embedding

Heat-Map-Based Keypoint Feature Embedding

Skeletal–Structural Graph Convolutional Network

Structure-Based Graph Layer

Data-Dependent Non-Local Layer

Keypoint Group Representations

Keypoint-Based Pose Estimation

Loss Functions

Datasets and Metrics

Implementation Details

Experimental Results

Method

Analysis

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information	Publication Date: Feb 25, 2022
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Semantic–Structural Graph Convolutional Networks for Whole-Body Human Pose Estimation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

Structure-aware human pose estimation with graph convolutional networks
Yanrui Bin ... Nong Sang
Pattern Recognition | VOL. 106
Yanrui Bin, et. al.Yanrui Bin ... Nong Sang
16 May 2020
Pattern Recognition | VOL. 106

Locally Connected Network for Monocular 3D Human Pose Estimation.
Hai Ci ... Yizhou Wang
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Hai Ci, et. al.Hai Ci ... Yizhou Wang
24 Aug 2020
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

Motion Capture for Sporting Events Based on Graph Convolutional Neural Networks and Single Target Pose Estimation Algorithms
Chengpeng Duan ... Bingliang Hu
Applied Sciences | VOL. 13
Chengpeng Duan, et. al.Chengpeng Duan ... Bingliang Hu
27 Jun 2023
Applied Sciences | VOL. 13

Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos
Yu Cheng ... Bo Wang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Yu Cheng, et. al.Yu Cheng ... Bo Wang
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semantic–Structural Graph Convolutional Networks for Whole-Body Human Pose Estimation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information