SentiStory: A Multi-Layered Sentiment-Aware Generative Model for Visual Storytelling

Wei Chen,Xuefeng Liu,Jianwei Niu

doi:10.1109/tcsvt.2022.3183648

Abstract

The visual storytelling (VIST) task aims at generating reasonable, human-like and coherent stories with the image streams as input. Although many deep learning models have achieved promising results, most of them do not directly leverage the sentiment information of stories. In this paper, we propose a sentiment-aware generative model for VIST called SentiStory. The key of SentiStory is a multi-layered sentiment extraction module (MLSEM). For a given image stream, the higher layer gives coarse-grained but accurate sentiments, while the lower layer of the MLSEM extracts fine-grained but usually unreliable ones. The two layers are combined strategically to generate coherent and rich visual sentiment concepts for the VIST task. Results from both automatic and human evaluations demonstrate that with the help of the MLSEM, SentiStory achieves improvement in generating more coherent and human-like stories.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SentiStory: A Multi-Layered Sentiment-Aware Generative Model for Visual Storytelling

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Nov 1, 2022
Citations: 4

Similar Papers

Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling
Pengcheng Yang ... Zhiyi Yin
-
Pengcheng Yang, et. al.Pengcheng Yang ... Zhiyi Yin
01 Aug 2019
01 Aug 2019

Associative Learning Network for Coherent Visual Storytelling
Xin Li ... Chunping Liu
-
Xin Li, et. al.Xin Li ... Chunping Liu
04 Jun 2023
04 Jun 2023

Storytelling from an Image Stream Using Scene Graphs
Ruize Wang ... Piji Li
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Ruize Wang, et. al.Ruize Wang ... Piji Li
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Knowledge-Enriched Attention Network With Group-Wise Semantic for Visual Storytelling.
Tengpeng Li ... Chang Wen Chen
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Tengpeng Li, et. al.Tengpeng Li ... Chang Wen Chen
01 Jul 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SentiStory: A Multi-Layered Sentiment-Aware Generative Model for Visual Storytelling

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology