Zero-shot Scene Graph Generation via Triplet Calibration and Reduction

Jiankai Li,Weixin Li,Yunhong Wang

doi:10.1145/3604284

Abstract

Scene Graph Generation (SGG) plays a pivotal role in downstream vision-language tasks. Existing SGG methods typically suffer from poor compositional generalizations on unseen triplets. They are generally trained on incompletely annotated scene graphs that contain dominant triplets and tend to bias toward these seen triplets during inference. To address this issue, we propose a Triplet Calibration and Reduction (T-CAR) framework in this article. In our framework, a triplet calibration loss is first presented to regularize the representations of diverse triplets and to simultaneously excavate the unseen triplets in incompletely annotated training scene graphs. Moreover, the unseen space of scene graphs is usually several times larger than the seen space, since it contains a huge number of unrealistic compositions. Thus, we propose an unseen space reduction loss to shift the attention of excavation to reasonable unseen compositions to facilitate the model training. Finally, we propose a contextual encoder to improve the compositional generalizations of unseen triplets by explicitly modeling the relative spatial relations between subjects and objects. Extensive experiments show that our approach achieves consistent improvements for zero-shot SGG over state-of-the-art methods. The code is available at https://github.com/jkli1998/T-CAR .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Zero-shot Scene Graph Generation via Triplet Calibration and Reduction

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Similar Papers

NeuSyRE: Neuro-symbolic visual understanding and reasoning framework based on scene graph enrichment
M Jaleed Khan ... Edward Curry
Semantic Web | VOL. -
M Jaleed Khan, et. al.M Jaleed Khan ... Edward Curry
13 Dec 2023
Semantic Web | VOL. -

Unbiased Scene Graph Generation From Biased Training
Kaihua Tang ... Yulei Niu
-
Kaihua Tang, et. al.Kaihua Tang ... Yulei Niu
01 Jun 2020
01 Jun 2020

Boosting Scene Graph Generation with Visual Relation Saliency
Yong Zhang ... Yingwei Pan
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 19
Yong Zhang, et. al.Yong Zhang ... Yingwei Pan
05 Jan 2023
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 19

Balanced scene graph generation assisted by an additional biased predictor
Wenbin Wang ... Xilin Chen
SCIENTIA SINICA Informationis | VOL. 52
Wenbin Wang, et. al.Wenbin Wang ... Xilin Chen
01 Nov 2022
SCIENTIA SINICA Informationis | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Zero-shot Scene Graph Generation via Triplet Calibration and Reduction

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications