How Many Annotations Do We Need for Generalizing New-Coming Shadow Images?

Wen Wu,Xiao-Diao Chen,Wenya Yang,Weiyin Ma

doi:10.1109/tcsvt.2023.3263903

Abstract

Unlabeled data is often used to improve the generalization ability of one segmentation model. However, it tends to neglect the inherent difficulty of unlabeled samples, and then produces inaccurate pseudo masks in some unseen scenes, resulting in severe confirmation bias and potential performance degradation. These motivate two unexplored questions for new-coming data: (1) How many images do we need to annotate; and (2) how to annotate them? In this paper, two kinds of shadow detectors ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">i.e</i> ., SDTR and SDTR+) based on the Transformer and self-training scheme are successively proposed. The main difference between them is whether weak annotations are required for partial unlabeled data. Specifically, in SDTR, we first introduce an image-level sample selection scheme to separate the unlabeled data into reliable and unreliable samples from the holistic prediction-level stability. Then, we perform selective retraining to exploit the unlabeled images progressively in a curriculum learning manner. While in SDTR+, we further provide various weak labels ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">i.e</i> ., point, box and scribble) for the rest unreliable samples and design corresponding loss functions. By doing this, it can achieve a better trade-off between performance improvement and annotation cost. Experimental results on public benchmarks ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">i.e</i> ., SBU, UCF and ISTD) show that both SDTR and SDTR+ can be favorable against state-of-the-art methods.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

How Many Annotations Do We Need for Generalizing New-Coming Shadow Images?

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Nov 1, 2023
Citations: 4

Similar Papers

A semi-supervised approach to fault detection and diagnosis for building HVAC systems based on the modified generative adversarial network
Bingxu Li ... Wenjian Cai
Energy & Buildings | VOL. 246
Bingxu Li, et. al.Bingxu Li ... Wenjian Cai
26 Apr 2021
Energy & Buildings | VOL. 246

Omni-DETR: Omni-Supervised Object Detection with Transformers
Pei Wang ... Zhaowei Cai
-
Pei Wang, et. al.Pei Wang ... Zhaowei Cai
01 Jun 2022
01 Jun 2022

Cosine-similarity penalty to discriminate sound classes in weakly-supervised sound event detection
Thomas Pellegrini ... Leo Cances
-
Thomas Pellegrini, et. al.Thomas Pellegrini ... Leo Cances
01 Jul 2019
01 Jul 2019

3D-BoxSup: Positive-Unlabeled Learning of Brain Tumor Segmentation Networks From 3D Bounding Boxes.
Yanwu Xu ... Junxiang Chen
Frontiers in neuroscience | VOL. 14
Yanwu Xu, et. al.Yanwu Xu ... Junxiang Chen
28 Apr 2020
Frontiers in neuroscience | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How Many Annotations Do We Need for Generalizing New-Coming Shadow Images?

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society