An Empathy Evaluation System Using Spectrogram Image Features of Audio.

Jing Zhang,Ayoung Cho,Mincheol Whang,Xingyu Wen

doi:10.3390/s21217111

Abstract

Watching videos online has become part of a relaxed lifestyle. The music in videos has a sensitive influence on human emotions, perception, and imaginations, which can make people feel relaxed or sad, and so on. Therefore, it is particularly important for people who make advertising videos to understand the relationship between the physical elements of music and empathy characteristics. The purpose of this paper is to analyze the music features in an advertising video and extract the music features that make people empathize. This paper combines both methods of the power spectrum of MFCC and image RGB analysis to find the audio feature vector. In spectral analysis, the eigenvectors obtained in the analysis process range from blue (low range) to green (medium range) to red (high range). The machine learning random forest classifier is used to classify the data obtained by machine learning, and the trained model is used to monitor the development of an advertisement empathy system in real time. The result is that the optimal model is obtained with the training accuracy result of 99.173% and a test accuracy of 86.171%, which can be deemed as correct by comparing the three models of audio feature value analysis. The contribution of this study can be summarized as follows: (1) the low-frequency and high-amplitude audio in the video is more likely to resonate than the high-frequency and high-amplitude audio; (2) it is found that frequency and audio amplitude are important attributes for describing waveforms by observing the characteristics of the machine learning classifier; (3) a new audio extraction method is proposed to induce human empathy. That is, the feature value extracted by the method of spectrogram image features of audio has the most ability to arouse human empathy.

Highlights

Empathy, a phenomenon of characterizing our understanding and sharing of others’ feelings, is vital to our everyday communication and survival in a social environment [1]
The assumption we have made is that the extracted audio feature can distinguish between empathetic and non-empathetic videos
The mel-frequency cepstral coefficients (MFCCs) of a signal are a small set of features that concisely describe the overall shape of a spectral envelope

Summary

Introduction

A phenomenon of characterizing our understanding and sharing of others’ feelings, is vital to our everyday communication and survival in a social environment [1]. Empathy evaluation has many methods but can be mainly classified into three categories: subjective size evaluation, image processing, and bio-signals [2]. A four-camera vision system was established to sample a target from different perspectives. In this system, a global calibration technique was deployed to correlate each individual system. The subjective evaluation method usually uses a questionnaire but has limitations due to subjective ambiguity and individual differences. In order to compensate for its limitations, researchers use objective methods to evaluate empathy

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Oct 26, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Empathy Evaluation System Using Spectrogram Image Features of Audio.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Radiomics-Based Prediction of Long-Term Treatment Response of Vestibular Schwannomas Following Stereotactic Radiosurgery.
Patrick P. J. H. Langenhuizen ... Sieger Leenstra
Otology & Neurotology | VOL. 41
Patrick P. J. H. Langenhuizen, et. al.Patrick P. J. H. Langenhuizen ... Sieger Leenstra
01 Jan 2020
Otology & Neurotology | VOL. 41

Sound Identification Method for Gas and Coal Dust Explosions Based on MLP.
Xingchen Yu ... Xiaowei Li
Entropy (Basel, Switzerland) | VOL. 25
Xingchen Yu, et. al.Xingchen Yu ... Xiaowei Li
09 Aug 2023
Entropy (Basel, Switzerland) | VOL. 25

Human emotion recognition from videos using spatio-temporal and audio features
Munaf Rashid ... S A R Abu-Bakar
The Visual Computer | VOL. 29
Munaf Rashid, et. al.Munaf Rashid ... S A R Abu-Bakar
07 Dec 2012
The Visual Computer | VOL. 29

Machine Learning based Comparison of Different Emotional Dimensional Models for Tamil Cine Music
Pranav V V ... Supriya P
-
Pranav V V, et. al.Pranav V V ... Supriya P
21 Sep 2022
21 Sep 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Empathy Evaluation System Using Spectrogram Image Features of Audio.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors