Deepsing: Generating sentiment-aware visual stories using cross-modal music translation

Nikolaos Passalis,Stavros Doropoulos

doi:10.1016/j.eswa.2020.114059

Abstract

In this paper we propose a deep learning method for performing attributed-based music-to-image translation. The proposed method is applied for synthesizing visual stories according to the sentiment expressed by songs. The generated images aim to induce the same feelings to the viewers, as the original song does, reinforcing the primary aim of music, i.e., communicating feelings. The process of music-to-image translation poses unique challenges, mainly due to the unstable mapping between the different modalities involved in this process. In this paper, we employ a trainable cross-modal translation method to overcome this limitation, leading to the first, to the best of our knowledge, deep learning method for generating sentiment-aware visual stories. The proposed method was evaluated both quantitatively and qualitatively using a collection of songs that belong to 10 different genres, demonstrating that it is indeed possible to generate visual content that can match the sentiment expressed in songs. A user study was also conducted further validating the ability of the proposed method to provide sentiment-enriched visualizations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deepsing: Generating sentiment-aware visual stories using cross-modal music translation

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Journal: Expert Systems With Applications	Publication Date: Sep 25, 2020
Citations: 4

Similar Papers

Remember That Fats Waller Idea?
Joshua Rosenblum
-
Joshua RosenblumJoshua Rosenblum
07 Jun 2024
07 Jun 2024

Virtual Citizenship and Revolutionary Transatlantic Republicanism in the Musical Lives of Exiled United Irishmen
Laura Lohman
American Music | VOL. 40
Laura LohmanLaura Lohman
01 Jul 2022
American Music | VOL. 40

Music chord recommendation of self composed melodic lines for making instrumental sound
Eui Chul Lee ... Min Woo Park
Multimedia Tools and Applications | VOL. 76
Eui Chul Lee, et. al.Eui Chul Lee ... Min Woo Park
03 Oct 2016
Multimedia Tools and Applications | VOL. 76

Artificial intelligence snapchat: Visual conversation agent
Sasa Arsovski ... Adrian David Cheok
Applied Intelligence | VOL. 50
Sasa Arsovski, et. al.Sasa Arsovski ... Adrian David Cheok
26 Feb 2020
Applied Intelligence | VOL. 50

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deepsing: Generating sentiment-aware visual stories using cross-modal music translation

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications