Slim DensePose: Thrifty Learning From Sparse Annotations and Motion Cues

Natalia Neverova,Iasonas Kokkinos,Riza Alp Guler,James Thewlis,Andrea Vedaldi

doi:10.1109/cvpr.2019.01117

Abstract

DensePose supersedes traditional landmark detectors by densely mapping image pixels to body surface coordinates. This power, however, comes at a greatly increased annotation cost, as supervising the model requires to manually label hundreds of points per pose instance. In this work, we thus seek methods to significantly slim down the DensePose annotations, proposing more efficient data collection strategies. In particular, we demonstrate that if annotations are collected in video frames, their efficacy can be multiplied for free by using motion cues. To explore this idea, we introduce DensePose-Track, a dataset of videos where selected frames are annotated in the traditional DensePose manner. Then, building on geometric properties of the DensePose mapping, we use the video dynamic to propagate ground-truth annotations in time as well as to learn from Siamese equivariance constraints. Having performed exhaustive empirical evaluation of various data annotation and learning strategies, we demonstrate that doing so can deliver significantly improved pose estimation results over strong baselines. However, despite what is suggested by some recent works, we show that merely synthesizing motion patterns by applying geometric transformations to isolated frames is significantly less effective, and that motion cues help much more when they are extracted from videos.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Slim DensePose: Thrifty Learning From Sparse Annotations and Motion Cues

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Click Carving: Interactive Object Segmentation in Images and Videos with Point Clicks
Suyog Dutt Jain ... Kristen Grauman
International Journal of Computer Vision | VOL. 127
Suyog Dutt Jain, et. al.Suyog Dutt Jain ... Kristen Grauman
27 May 2019
International Journal of Computer Vision | VOL. 127

Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks.
Ziyi Liu ... Qilin Zhang
IEEE Transactions on Image Processing | VOL. 27
Ziyi Liu, et. al.Ziyi Liu ... Qilin Zhang
30 Jul 2018
IEEE Transactions on Image Processing | VOL. 27

Data-centric annotation analysis for plant disease detection: Strategy, consistency, and performance.
Jiuqing Dong ... Alvaro Fuentes
Frontiers in Plant Science | VOL. 13
Jiuqing Dong, et. al.Jiuqing Dong ... Alvaro Fuentes
07 Dec 2022
Frontiers in Plant Science | VOL. 13

Discovering Primary Objects in Videos by Saliency Fusion and Iterative Appearance Estimation
Jiong Yang ... Xiaohui Shen
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 26
Jiong Yang, et. al.Jiong Yang ... Xiaohui Shen
01 Jun 2016
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Slim DensePose: Thrifty Learning From Sparse Annotations and Motion Cues

Abstract

Talk to us

Similar Papers