Reducing the Annotation Effort for Video Object Segmentation Datasets

Paul Voigtlaender,Yong Jiang,Bastian Leibe,Chun Yuan,Lishu Luo

doi:10.1109/wacv48630.2021.00310

Paul Voigtlaender, Yong Jiang + Show 3 more

Open Access

https://doi.org/10.1109/wacv48630.2021.00310

Copy DOI

Abstract

For further progress in video object segmentation (VOS), larger, more diverse, and more challenging datasets will be necessary. However, densely labeling every frame with pixel masks does not scale to large datasets. We use a deep convolutional network to automatically create pseudo-labels on a pixel level from much cheaper bounding box annotations and investigate how far such pseudo-labels can carry us for training state-of-the-art VOS approaches. A very encouraging result of our study is that adding a manually annotated mask in only a single video frame for each object is sufficient to generate pseudo-labels which can be used to train a VOS method to reach almost the same performance level as when training with fully segmented videos. We use this workflow to create pixel pseudo-labels for the training set of the challenging tracking dataset TAO, and we manually annotate a subset of the validation set. Together, we obtain the new TAO-VOS benchmark, which we make publicly available at www.vision.rwth-aachen.de/page/taovos. While the performance of state-of-the-art methods on existing datasets starts to saturate, TAO-VOS remains very challenging for current algorithms and reveals their shortcomings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reducing the Annotation Effort for Video Object Segmentation Datasets

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

RPM-Net: Robust Pixel-Level Matching Networks for Self-Supervised Video Object Segmentation
Youngeun Kim ... Hankyeol Lee
-
Youngeun Kim, et. al.Youngeun Kim ... Hankyeol Lee
01 Mar 2020
01 Mar 2020

Rejecting False Positives in Video Object Segmentation
Daniela Giordano ... Concetto Spampinato
-
Daniela Giordano, et. al.Daniela Giordano ... Concetto Spampinato
01 Jan 2015
01 Jan 2015

Divided attention
-
Electronics Letters | VOL. 55
--
01 Apr 2019
Electronics Letters | VOL. 55

Iterative Knowledge Exchange Between Deep Learning and Space-Time Spectral Clustering for Unsupervised Segmentation in Videos.
Emanuela Haller ... Adina Magda Florea
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Emanuela Haller, et. al.Emanuela Haller ... Adina Magda Florea
01 Nov 2022
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reducing the Annotation Effort for Video Object Segmentation Datasets

Abstract

Talk to us

Similar Papers