Affective social anthropomorphic intelligent system

Md Adyelullahil Mamun,Hasnat Md Abdullah,Md Golam Rabiul Alam,Muhammad Mehedi Hassan,Md Zia Uddin

doi:10.1007/s11042-023-14597-6

Md Adyelullahil Mamun, Hasnat Md Abdullah + Show 3 more

Open Access

https://doi.org/10.1007/s11042-023-14597-6

Copy DOI

Abstract

Human conversational styles are measured by the sense of humor, personality, and tone of voice. These characteristics have become essential for conversational intelligent virtual assistants. However, most of the state-of-the-art intelligent virtual assistants (IVAs) are failed to interpret the affective semantics of human voices. This research proposes an anthropomorphic intelligent system that can hold a proper human-like conversation with emotion and personality. A voice style transfer method is also proposed to map the attributes of a specific emotion. Initially, the frequency domain data (Mel-Spectrogram) is created by converting the temporal audio wave data, which comprises discrete patterns for audio features such as notes, pitch, rhythm, and melody. A collateral CNN-Transformer-Encoder is used to predict seven different affective states from voice. The voice is also fed parallelly to the deep-speech, an RNN model that generates the text transcription from the spectrogram. Then the transcripted text is transferred to the multi-domain conversation agent using blended skill talk, transformer-based retrieve-and-generate generation strategy, and beam-search decoding, and an appropriate textual response is generated. The system learns an invertible mapping of data to a latent space that can be manipulated and generates a Mel-spectrogram frame based on previous Mel-spectrogram frames to voice synthesize and style transfer. Finally, the waveform is generated using WaveGlow from the spectrogram. The outcomes of the studies we conducted on individual models were auspicious. Furthermore, users who interacted with the system provided positive feedback, demonstrating the system’s effectiveness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Multimedia Tools and Applications	Publication Date: Mar 7, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Affective social anthropomorphic intelligent system

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Similar Papers

Investigating EFL learners’ humorous interactions with an intelligent personal assistant
Talip Gonulal
Interactive Learning Environments | VOL. 31
Talip GonulalTalip Gonulal
04 Sep 2021
Interactive Learning Environments | VOL. 31

A Mind in Intelligent Personal Assistants: An Empirical Study of Mind-Based Anthropomorphism, Fulfilled Motivations, and Exploratory Usage of Intelligent Personal Assistants
Cuicui Cao ... Haoxuan Xu
Frontiers in Psychology | VOL. 13
Cuicui Cao, et. al.Cuicui Cao ... Haoxuan Xu
29 Apr 2022
Frontiers in Psychology | VOL. 13

AI-Based Virtual Assistant Using Python: A Systematic Review
Patil Kavita Manojkumar ... Sakshi Shinde
International Journal for Research in Applied Science and Engineering Technology | VOL. 11
Patil Kavita Manojkumar, et. al.Patil Kavita Manojkumar ... Sakshi Shinde
31 Mar 2023
International Journal for Research in Applied Science and Engineering Technology | VOL. 11

PAIGE
Yilei Liang ... Nishanth Sastry
-
Yilei Liang, et. al.Yilei Liang ... Nishanth Sastry
27 Apr 2020
27 Apr 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Affective social anthropomorphic intelligent system

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications