Sound to expression: Using emotional sound to guide facial expression editing

Wenjin Liu,Shudong Zhang,Lijuan Zhou,Ning Luo,Qian Chen

doi:10.1016/j.jksuci.2024.101998

Wenjin Liu, Shudong Zhang + Show 3 more

Open Access

https://doi.org/10.1016/j.jksuci.2024.101998

Copy DOI

Abstract

Recently, image generation technology has demonstrated surprising effects. However, precisely recognizing the emotion in sound to accurately express it on the face of a designated person is a huge challenge. To address this challenge, a new framework, Sound to Expression (S2E), which can use the emotion in sound to guide facial expression image generation, is proposed. A speech dataset for emotion recognition is constructed. S2E can edit facial expressions with different emotions in sounds for different people. S2E consists of Continuous Wavelet Transform (CWT), YOLOv3, ChatGPT-3, and facial expression diffusion editing model (FEDEM). CWT is utilized to extract emotional features from different sounds. YOLOv3 is employed to identify the emotion categories. The emotion category and a specific person's name are input into ChatGPT-3 to randomly generate a description of the person and emotion. The description is input into FEDEM to generate a facial expression image. To generate more accurate images and address emotional semantic deviation, a new facial detail emotional preservation loss is proposed. The experimental results show that S2E can accurately recognize the emotion in the voice and use this emotion to guide the editing of the facial expression for the specified person to generate more accurate images.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sound to expression: Using emotional sound to guide facial expression editing

Abstract

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences

Lead the way for us

Journal: Journal of King Saud University - Computer and Information Sciences	Publication Date: Feb 28, 2024
License type: cc-by-nc-nd

Similar Papers

Enhanced deep learning algorithm development to detect pain intensity from facial expression images
Ghazal Bargshady ... Hua Wang
Expert Systems with Applications | VOL. 149
Ghazal Bargshady, et. al.Ghazal Bargshady ... Hua Wang
16 Feb 2020
Expert Systems with Applications | VOL. 149

Facial Expression Guided Diagnosis of Parkinson's Disease via High-Quality Data Augmentation
Wei Huang ... Yiu-Ming Cheung
IEEE Transactions on Multimedia | VOL. 25
Wei Huang, et. al.Wei Huang ... Yiu-Ming Cheung
01 Jan 2023
IEEE Transactions on Multimedia | VOL. 25

Production of facial expressions using facial feature positioning and deformation
Jia-Shing Sheu ... Ying-Ming Wu
-
Jia-Shing Sheu, et. al. Jia-Shing Sheu ... Ying-Ming Wu
01 May 2012
01 May 2012

Design of Facial Expression Recognition Algorithm Based on CNN Model
Yue Luo ... Jiaxin Wu
-
Yue Luo, et. al.Yue Luo ... Jiaxin Wu
29 Jan 2023
29 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sound to expression: Using emotional sound to guide facial expression editing

Abstract

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences