Dataset bias: A case study for visual question answering

Anubrata Das,Samreen Anjum,Danna Gurari

doi:10.1002/pra2.7

Dataset bias: A case study for visual question answering

Anubrata Das, Samreen Anjum + Show 1 more

Open Access

https://doi.org/10.1002/pra2.7

Copy DOI

Journal: Proceedings of the Association for Information Science and Technology	Publication Date: Jan 1, 2019
Citations: 8	License type: publisher-specific, author manuscript

Affiliation: The University of Texas at Austin

#Visual Question #Visual Question Answering + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

ABSTRACTWe examine the issue of bias in datasets designed to train visual question answering (VQA) algorithms. These datasets include a collection of natural language questions about images (aka ‐ visual questions). We consider three popular datasets that are captured by people with sight, people who are blind, and generated by computers. We first demonstrate that machine learning algorithms can be trained to recognize each dataset's bias, and so determine the source of a novel visual question. We then discuss potential risks and benefits of biased VQA datasets and corresponding machine learning algorithms that can identify the source of a visual question; e.g., whether it comes from a person with sight, a person who is blind, or bot (aka ‐ computer). Our ultimate aim is to inspire the development of more inclusive VQA systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Proceedings of the Association for Information Science and Technology

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.