Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task

Stan Weixian Lei,Wei Liu,Mengmi Zhang,Mike Zheng Shou,Jay Zhangjie Wu,Yuxuan Wang,Difei Gao

doi:10.1609/aaai.v37i1.25208

Abstract

VQA is an ambitious task aiming to answer any image-related question. However, in reality, it is hard to build such a system once for all since the needs of users are continuously updated, and the system has to implement new functions. Thus, Continual Learning (CL) ability is a must in developing advanced VQA systems. Recently, a pioneer work split a VQA dataset into disjoint answer sets to study this topic. However, CL on VQA involves not only the expansion of label sets (new Answer sets). It is crucial to study how to answer questions when deploying VQA systems to new environments (new Visual scenes) and how to answer questions requiring new functions (new Question types). Thus, we propose CLOVE, a benchmark for Continual Learning On Visual quEstion answering, which contains scene- and function-incremental settings for the two aforementioned CL scenarios. In terms of methodology, the main difference between CL on VQA and classification is that the former additionally involves expanding and preventing forgetting of reasoning mechanisms, while the latter focusing on class representation. Thus, we propose a real-data-free replay-based method tailored for CL on VQA, named Scene Graph as Prompt for Symbolic Replay. Using a piece of scene graph as a prompt, it replays pseudo scene graphs to represent the past images, along with correlated QA pairs. A unified VQA model is also proposed to utilize the current and replayed data to enhance its QA ability. Finally, experimental results reveal challenges in CLOVE and demonstrate the effectiveness of our method. Code and data are available at https://github.com/showlab/CLVQA.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 7

Similar Papers

Lightweight Visual Question Answering using Scene Graphs
Sai Vidyaranya Nuthalapati ... Maxime Kayser
-
Sai Vidyaranya Nuthalapati, et. al.Sai Vidyaranya Nuthalapati ... Maxime Kayser
26 Oct 2021
26 Oct 2021

DSGEM: Dual scene graph enhancement module‐based visual question answering
Boyue Wang ... Baocai Yin
IET Computer Vision | VOL. 17
Boyue Wang, et. al.Boyue Wang ... Baocai Yin
07 Mar 2023
IET Computer Vision | VOL. 17

Visual Question Answering over Scene Graph
Soohyeong Lee ... Joo Hyuk Jeon
-
Soohyeong Lee, et. al.Soohyeong Lee ... Joo Hyuk Jeon
01 Sep 2019
01 Sep 2019

A Comprehensive Survey of Scene Graphs: Generation and Application.
Xiaojun Chang ... Xiaojiang Chen
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Xiaojun Chang, et. al.Xiaojun Chang ... Xiaojiang Chen
01 Jan 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence