Multimodal robotic music performance art based on GRU-GoogLeNet model fusing audiovisual perception.

Lu Wang

doi:10.3389/fnbot.2023.1324831

Abstract

The field of multimodal robotic musical performing arts has garnered significant interest due to its innovative potential. Conventional robots face limitations in understanding emotions and artistic expression in musical performances. Therefore, this paper explores the application of multimodal robots that integrate visual and auditory perception to enhance the quality and artistic expression in music performance. Our approach involves integrating GRU (Gated Recurrent Unit) and GoogLeNet models for sentiment analysis. The GRU model processes audio data and captures the temporal dynamics of musical elements, including long-term dependencies, to extract emotional information. The GoogLeNet model excels in image processing, extracting complex visual details and aesthetic features. This synergy deepens the understanding of musical and visual elements, aiming to produce more emotionally resonant and interactive robot performances. Experimental results demonstrate the effectiveness of our approach, showing significant improvements in music performance by multimodal robots. These robots, equipped with our method, deliver high-quality, artistic performances that effectively evoke emotional engagement from the audience. Multimodal robots that merge audio-visual perception in music performance enrich the art form and offer diverse human-machine interactions. This research demonstrates the potential of multimodal robots in music performance, promoting the integration of technology and art. It opens new realms in performing arts and human-robot interactions, offering a unique and innovative experience. Our findings provide valuable insights for the development of multimodal robots in the performing arts sector.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal robotic music performance art based on GRU-GoogLeNet model fusing audiovisual perception.

Abstract

Talk to us

Similar Papers

More From: Frontiers in neurorobotics

Lead the way for us

Journal: Frontiers in neurorobotics	Publication Date: Jan 30, 2024
License type: CC BY 4.0

Similar Papers

Application of recurrent neural network in prognosis of peritoneal dialysis
...
Beijing da xue xue bao. Yi xue ban = Journal of Peking University. Health sciences | VOL. 51
, et. al. ...
18 Jun 2019
Beijing da xue xue bao. Yi xue ban = Journal of Peking University. Health sciences | VOL. 51

Short-term runoff prediction with GRU and LSTM networks without requiring time step optimization during sample generation
Shuai Gao ... Qingsheng Lin
Journal of Hydrology | VOL. 589
Shuai Gao, et. al.Shuai Gao ... Qingsheng Lin
17 Jun 2020
Journal of Hydrology | VOL. 589

Using LSTM GRU and Hybrid Models for Streamflow Forecasting
Abdullahi Uwaisu Muhammad ... Xiaodong Li
-
Abdullahi Uwaisu Muhammad, et. al.Abdullahi Uwaisu Muhammad ... Xiaodong Li
01 Jan 2019
01 Jan 2019

Mid- to Long-Term Runoff Prediction Based on Deep Learning at Different Time Scales in the Upper Yangtze River Basin
Yuanxin Ren ... Xiaojun Hua
Water | VOL. 14
Yuanxin Ren, et. al.Yuanxin Ren ... Xiaojun Hua
25 May 2022
Water | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal robotic music performance art based on GRU-GoogLeNet model fusing audiovisual perception.

Abstract

Talk to us

Similar Papers

More From: Frontiers in neurorobotics