CE-BART: Cause-and-Effect BART for Visual Commonsense Generation

Junyeong Kim,Chang D Yoo,Ji Woo Hong,Sunjae Yoon

doi:10.3390/s22239399

Junyeong Kim, Chang D Yoo + Show 2 more

Open Access

https://doi.org/10.3390/s22239399

Copy DOI

Abstract

"A Picture is worth a thousand words". Given an image, humans are able to deduce various cause-and-effect captions of past, current, and future events beyond the image. The task of visual commonsense generation has the aim of generating three cause-and-effect captions for a given image: (1) what needed to happen before, (2) what is the current intent, and (3) what will happen after. However, this task is challenging for machines, owing to two limitations: existing approaches (1) directly utilize conventional vision-language transformers to learn relationships between input modalities and (2) ignore relations among target cause-and-effect captions, but consider each caption independently. Herein, we propose Cause-and-Effect BART (CE-BART), which is based on (1) a structured graph reasoner that captures intra- and inter-modality relationships among visual and textual representations and (2) a cause-and-effect generator that generates cause-and-effect captions by considering the causal relations among inferences. We demonstrate the validity of CE-BART on the VisualCOMET and AVSD benchmarks. CE-BART achieved SOTA performance on both benchmarks, while an extensive ablation study and qualitative analysis demonstrated the performance gain and improved interpretability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Dec 2, 2022
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CE-BART: Cause-and-Effect BART for Visual Commonsense Generation

Abstract

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

The impact of group identity on the interaction between collective memory and collective future thinking negativity: Evidence from a Turkish sample.
Deniz Hacıbektaşoğlu ... Ali İ Tekcan
Memory & Cognition | VOL. 51
Deniz Hacıbektaşoğlu, et. al.Deniz Hacıbektaşoğlu ... Ali İ Tekcan
06 Jun 2022
Memory & Cognition | VOL. 51

Chemical understanding and graphing skills in an honors case‐based computerized chemistry laboratory environment: The value of bidirectional visual and textual representations
Yehudit J Dori ... Irit Sasson
Journal of Research in Science Teaching | VOL. 45
Yehudit J Dori, et. al.Yehudit J Dori ... Irit Sasson
15 Jan 2008
Journal of Research in Science Teaching | VOL. 45

Prescribed journeys through life: Cultural differences in mental time travel between Middle Easterners and Scandinavians.
Christina Lundsgaard Ottsen ... Dorthe Berntsen
Consciousness and Cognition | VOL. 37
Christina Lundsgaard Ottsen, et. al.Christina Lundsgaard Ottsen ... Dorthe Berntsen
29 Sep 2015
Consciousness and Cognition | VOL. 37

"I can see clearly now": the effect of cue imageability on mental time travel.
Katrine W Rasmussen ... Dorthe Berntsen
Memory & Cognition | VOL. 42
Katrine W Rasmussen, et. al.Katrine W Rasmussen ... Dorthe Berntsen
30 May 2014
Memory & Cognition | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CE-BART: Cause-and-Effect BART for Visual Commonsense Generation

Abstract

Talk to us

Similar Papers

More From: Sensors