Fashion Parsing With Video Context

Si Liu,Ke Lu,Liang Lin,Xiaochun Cao,Shuicheng Yan,Xiaodan Liang,Luoqi Liu

doi:10.1109/tmm.2015.2443559

Abstract

In this paper, we propose a novel semi- supervised learning strategy to address human parsing. Existing human parsing datasets are relatively small due to the required tedious human labeling. We present a general, affordable and scalable solution, which harnesses the rich contexts in those easily available web videos to boost any existing human parser. First, we crawl a large number of unlabeled videos from the web. Then for each video, the cross-frame contexts are utilized for human pose co- estimation , and then video co-parsing to obtain satisfactory human parsing results for all frames. More specifically, SIFT flow and super-pixel matching are used to build correspondences across different frames, and these correspondences then contextualize the pose estimation and human parsing in individual frames. Finally these parsed video frames are used as the reference corpus for the non-parametric human parsing component of the whole solution. To further improve the accuracy of video co-parsing, we propose an active learning method to incorporate human guidance, where the labelers are required to assess the accuracies of the pose estimation results of certain selected video frames. Then we take reliable frames as the seed frames to guide the video pose co-estimation. Our human parsing framework can then easily incorporate the human feedback to train a better fashion parser. Extensive experiments on two benchmark fashion datasets as well as a newly collected challenging Fashion Icon dataset well demonstrate the encouraging performance gain from our general pipeline for human parsing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fashion Parsing With Video Context

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: Aug 1, 2015
Citations: 83

Similar Papers

Fashion Parsing with Video Context
Si Liu ... Ke Lu
-
Si Liu, et. al.Si Liu ... Ke Lu
03 Nov 2014
03 Nov 2014

Mask-Guided Deformation Adaptive Network for Human Parsing
Aihua Mao ... Yuan Liang
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 18
Aihua Mao, et. al.Aihua Mao ... Yuan Liang
31 Jan 2022
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 18

Towards Unified Human Parsing and Pose Estimation
Jian Dong ... Shuicheng Yan
-
Jian Dong, et. al.Jian Dong ... Shuicheng Yan
01 Jun 2014
01 Jun 2014

Heterogeneous Interactive Attention Network for Human Parsing
Wenjia Wang ... Jiale Wang
-
Wenjia Wang, et. al.Wenjia Wang ... Jiale Wang
09 Oct 2022
09 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fashion Parsing With Video Context

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia