A Multi-Modal Story Generation Framework with AI-Driven Storyline Guidance

Juntae Kim,Hogeon Yu,Yoonseok Heo,Jongho Nang

doi:10.3390/electronics12061289

Juntae Kim, Hogeon Yu + Show 2 more

Open Access

PDF Available

https://doi.org/10.3390/electronics12061289

Copy DOI

Export

Save

Cite

Journal: Electronics	Publication Date: Mar 8, 2023
Citations: 2	License type: CC BY 4.0

Affiliation: Sogang University

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

An automatic story generation system continuously generates stories with a natural plot. The major challenge of automatic story generation is to maintain coherence between consecutive generated stories without the need for human intervention. To address this, we propose a novel multi-modal story generation framework that includes automated storyline decision-making capabilities. Our framework consists of three independent models: a transformer encoder-based storyline guidance model, which predicts a storyline using a multiple-choice question-answering problem; a transformer decoder-based story generation model that creates a story that describes the storyline determined by the guidance model; and a diffusion-based story visualization model that generates a representative image visually describing a scene to help readers better understand the story flow. Our proposed framework was extensively evaluated through both automatic and human evaluations, which demonstrate that our model outperforms the previous approach, suggesting the effectiveness of our storyline guidance model in making proper plans.

Full Text