Diverse Visual Question Generation Based on Multiple Objects Selection

Wenhao Fang,Yi Cai,Jiayuan Xie,Hongfei Liu,Jiali Chen

doi:10.1145/3640014

Abstract

Visual question generation task aims at generating high-quality questions about a given image. To make this tak applicable to various scenarios, e.g., the growing demand for exams, it is important to generate diverse questions. The existing methods for this task control diverse question generation based on different question types, e.g., “what” and “when.” Although different question types lead to description diversity, they cannot guarantee semantic diversity when asking the same objects. Research in the field of psychology shows that humans pay attention to different objects in an image based on their preferences, which is beneficial to constructing semantically diverse questions. According to the research, we propose a multi-selector visual question generation (MS-VQG) model that aims to focus on different objects to generate diverse questions. Specifically, our MS-VQG model employs multiple selectors to imitate different humans to select different objects in a given image. Based on these different selected objects, our MS-VQG model can generate diverse questions corresponding to each selector. Extensive experiments on two datasets show that our proposed model outperforms the baselines in generating diverse questions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Diverse Visual Question Generation Based on Multiple Objects Selection

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Similar Papers

K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition
Kohei Uehara ... Tatsuya Harada
-
Kohei Uehara, et. al.Kohei Uehara ... Tatsuya Harada
01 Jan 2023
01 Jan 2023

Automated Question Generation and Answer Verification Using Visual Data
Shrey Nahar ... Niti Shah
-
Shrey Nahar, et. al.Shrey Nahar ... Niti Shah
01 Jan 2020
01 Jan 2020

UAV-VQG: Visual Question Generation Framework on UAV Images
Argho Sarkar ... Maryam Rahnemoonfar
-
Argho Sarkar, et. al.Argho Sarkar ... Maryam Rahnemoonfar
15 Dec 2021
15 Dec 2021

Knowledge-Based Visual Question Generation
Jiayuan Xie ... Qingbao Huang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Jiayuan Xie, et. al.Jiayuan Xie ... Qingbao Huang
01 Nov 2022
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Diverse Visual Question Generation Based on Multiple Objects Selection

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications