Hypergraph Transformer: Weakly-Supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering

Yu-Jung Heo,Eun-Sol Kim,Woo Suk Choi,Byoung-Tak Zhang

doi:10.18653/v1/2022.acl-long.29

Abstract

Knowledge-based visual question answering (QA) aims to answer a question which requires visually-grounded external knowledge beyond image content itself. Answering complex questions that require multi-hop reasoning under weak supervision is considered as a challenging problem since i) no supervision is given to the reasoning process and ii) high-order semantics of multi-hop knowledge facts need to be captured. In this paper, we introduce a concept of hypergraph to encode high-level semantics of a question and a knowledge base, and to learn high-order associations between them. The proposed model, Hypergraph Transformer, constructs a question hypergraph and a query-aware knowledge hypergraph, and infers an answer by encoding inter-associations between two hypergraphs and intra-associations in both hypergraph itself. Extensive experiments on two knowledge-based visual QA and two knowledge-based textual QA demonstrate the effectiveness of our method, especially for multi-hop reasoning problem. Our source code is available at https://github.com/yujungheo/kbvqa-public.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hypergraph Transformer: Weakly-Supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2022
Citations: 15	License type: cc-by

Similar Papers

Hypergraph Transformer: Weakly-Supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
...
-
, et. al. ...
07 May 2022
07 May 2022

Crake: Causal-Enhanced Table-Filler for Question Answering over Large Scale Knowledge Base
...
-
, et. al. ...
27 Jun 2022
27 Jun 2022

Multi-Task Learning with Multi-View Attention for Answer Selection and Knowledge Base Question Answering
Yang Deng ... Yuexiang Xie
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33
Yang Deng, et. al.Yang Deng ... Yuexiang Xie
17 Jul 2019
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33

BamnetTL: Bidirectional Attention Memory Network with Transfer Learning for Question Answering Matching
Lei Su ... Jiazhi Guo
International Journal of Intelligent Systems | VOL. 2023
Lei Su, et. al.Lei Su ... Jiazhi Guo
03 Aug 2023
International Journal of Intelligent Systems | VOL. 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hypergraph Transformer: Weakly-Supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering

Abstract

Talk to us

Similar Papers