Text and Sound-Based Feature Extraction and Speech Emotion Classification for Korean

Jaechoon Jo,Soo Kyun Kim,Yeo-Chan Yoon

doi:10.18517/ijaseit.14.3.18544

Abstract

Embracing the complexities of human emotions conveyed through speech, this study ventures into Speech Emotion Recognition (SER) within the human-computer interaction domain, leveraging cutting-edge artificial intelligence technologies. Focusing on the auditory attributes of speech, such as tone, pitch, and rhythm, the research introduces an innovative approach that amalgamates deep learning techniques with the A Learnable Frontend for Audio Classification (LEAF) algorithm and wav2vec 2.0 pre-trained on a large corpus, specifically targeting Korean voice samples. This methodology underlines the capacity of these technologies to process and decipher complex vocal expressions, aiming to elevate emotion classification precision notably. The exploration extends the horizons of SER by accentuating auditory emotion cues and aspires to enrich machine interactions to be more intuitive and empathetic across various applications like healthcare and customer service. The outcomes underscore the efficacy of transformer-based models, particularly wav2vec 2.0 and LEAF, in capturing the subtle emotional states expressed in speech, thereby affirming the importance of auditory cues over conventional visual and textual indicators. The study's implications for further research herald a promising trajectory for evolving AI systems adept at nuanced emotion detection, thereby forging pathways toward more natural and human-centric interactions between individuals and machines. This advancement is crucial for developing empathetic AI that can seamlessly integrate into our daily lives, understanding and reacting to human emotions in a way that mirrors human understanding and compassion.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Text and Sound-Based Feature Extraction and Speech Emotion Classification for Korean

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology

Lead the way for us

Journal: International Journal on Advanced Science, Engineering and Information Technology	Publication Date: Jun 13, 2024
License type: CC BY 4.0

Similar Papers

Multi-task Learning for Speech Emotion and Emotion Intensity Recognition
Pengcheng Yue ... Leyuan Qu
-
Pengcheng Yue, et. al.Pengcheng Yue ... Leyuan Qu
07 Nov 2022
07 Nov 2022

Speech Signal Imaging and Emotion Recognition Based on Symmetric-Diagonal Matrix Model
Zijun Yang ... Aoran Xi
-
Zijun Yang, et. al.Zijun Yang ... Aoran Xi
01 Jan 2023
01 Jan 2023

Progress in speech emotion recognition
Xueying Zhang ... Shufei Duan
-
Xueying Zhang, et. al.Xueying Zhang ... Shufei Duan
01 Nov 2015
01 Nov 2015

Children’s recognition of emotion in music and speech
Dianna Vidas ... Genevieve A Dingle
Music & Science | VOL. 1
Dianna Vidas, et. al.Dianna Vidas ... Genevieve A Dingle
01 Jan 2018
Music & Science | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text and Sound-Based Feature Extraction and Speech Emotion Classification for Korean

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology