Investigation of Available Datasets and Techniques for Visual Question Answering

Lata Bhavnani,Dr Narendra Patel

doi:10.47164/ijngc.v14i3.767

Abstract

Visual Question Answering (VQA) is an emerging AI research problem that combines computer vision, natural language processing, knowledge representation & reasoning (KR). Given image and question related to the image as input, it requires analysis of visual components of the image, type of question, and common sense or general knowledge to predict the right answer. VQA is useful in different real-time applications like blind person assistance, autonomous driving, solving trivial tasks like spotting empty tables in hotels, parks, or picnic places, etc. Since its introduction in 2014, many researchers have worked and applied different techniques for Visual question answering. Also, different datasets have been introduced. This paper presents an overview of available datasets and evaluation metrices used in the VQA area. Further paper presents different techniques used in the VQA domain. Techniques are categorized based on the mechanism used. Based on the detailed discussion and performance comparison we discuss various challenges in the VQA domain and provide directions for future work.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Investigation of Available Datasets and Techniques for Visual Question Answering

Abstract

Talk to us

Similar Papers

More From: International Journal of Next-Generation Computing

Lead the way for us

Journal: International Journal of Next-Generation Computing	Publication Date: Aug 3, 2023
License type: CC BY 4.0

Similar Papers

NLP Meets Vision for Visual Interpretation - A Retrospective Insight and Future directions
Ahmed Jamshed ... Muhammad Moazam Fraz
-
Ahmed Jamshed, et. al.Ahmed Jamshed ... Muhammad Moazam Fraz
20 May 2021
20 May 2021

Towards Fine-Tuning of VQA Models in Public Datasets
Miguel E Ortiz ... Sergio Álvarez
-
Miguel E Ortiz, et. al.Miguel E Ortiz ... Sergio Álvarez
03 Nov 2020
03 Nov 2020

Visual Question Answering as Reading Comprehension
Hui Li ... Anton Van Den Hengel
-
Hui Li, et. al.Hui Li ... Anton Van Den Hengel
01 Jun 2019
01 Jun 2019

Optimal Image Feature Ranking and Fusion for Visual Question Answering
Sruthy Manmadhan ... Binsu C Kovoor
-
Sruthy Manmadhan, et. al.Sruthy Manmadhan ... Binsu C Kovoor
09 Sep 2020
09 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Investigation of Available Datasets and Techniques for Visual Question Answering

Abstract

Talk to us

Similar Papers

More From: International Journal of Next-Generation Computing