CS-VQA: Visual Question Answering with Compressively Sensed Images

Li-Chi Huang,Suhas Lohit,Suren Jayasuriya,Pavan Turaga,Kuldeep Kulkarni,Anik Jha

doi:10.1109/icip.2018.8451445

CS-VQA: Visual Question Answering with Compressively Sensed Images

Li-Chi Huang, Suhas Lohit + Show 4 more

Open Access

https://doi.org/10.1109/icip.2018.8451445

Copy DOI

Publication Date: Oct 1, 2018

Citations: 38

Affiliation: Arizona State University, Carnegie Mellon University

#Visual Question Answering #Visual Question Answering Performance + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Visual Question Answering (VQA) is a complex semantic task requiring both natural language processing and visual recognition. In this paper, we explore whether VQA is solvable when images are captured in a sub-Nyquist compressive paradigm. We develop a series of deep-network architectures that exploit available compressive data to increasing degrees of accuracy, and show that VQA is indeed solvable in the compressed domain. Our results show that there is nominal degradation in VQA performance when using compressive measurements, but that accuracy can be recovered when VQA pipelines are used in conjunction with state-of-the-art deep neural networks for CS reconstruction. The results presented yield important implications for resource-constrained VQA applications.

Full Text