Abstract

An efficient method for simultaneous human body part segmentation and pose estimation is introduced. A conditional random field with a fully-connected graphical model is used. Possible node (image pixel) labels comprise of the human body parts and the background. In the human body skeleton model, the spatial dependencies among body parts are encoded in the definition of pairwise energy functions according to the conditional random fields. Proper pairwise edge potentials between image pixels are defined according to the presence or absence of human body parts that are near to each other. Various Gaussian kernels in position, color, and histogram of oriented gradients spaces are used for defining the pairwise energy terms. Shifted Gaussian kernels are defined between each two body parts that are connected to each other according to the human body skeleton model. As shifted Gaussian kernels impose a high computational cost to the inference, an efficient inference process is proposed by a mean field approximation method that uses high dimensional shifted Gaussian filtering. The experimental results evaluated on the challenging KTH Football, Leeds Sports Pose, HumanEva, and Penn-Fudan datasets show that the proposed method increases the per-pixel accuracy measure for human body part segmentation and also improves the probability of correct parts metric of human body joint locations.

Highlights

  • Human body part segmentation is the problem of segmenting a given image to human body (HB) parts and the background

  • A new and efficient method for simultaneous single-view human body part segmentation and pose estimation is introduced that opens a new approach to the problem of structured semantic segmentation

  • Despite the fact that shifted Gaussian kernels are used, it is shown that finding the minimum of the proposed energy function is possible by applying an efficient mean field approximation process

Read more

Summary

1- Introduction

Human body part segmentation is the problem of segmenting a given image to human body (HB) parts and the background. For semantic human segmentation, and on the HumanEva I [7], Extended Leeds Sports Pose [8], and KTH Football I [9] datasets show that the proposed method outperforms the method of Xia [10] that is the state-of-the-art in HB segmentation in terms of perpixel accuracy measure It achieves substantial improvement in finding the locations of corresponding joints according to the probability of correct pose (PCP) and probability of correct key points (PCK).

2- Related Work
4- Proposed Method
5- Experimental Results
6- Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.