Before, Between, and After: Enriching Robot Communication Surrounding Collaborative Creative Activities.

Richard Savery,Lisa Zahray,Gil Weinberg

doi:10.3389/frobt.2021.662355

Richard Savery, Lisa Zahray + Show 1 more

Open Access

https://doi.org/10.3389/frobt.2021.662355

Copy DOI

Abstract

Research in creative robotics continues to expand across all creative domains, including art, music and language. Creative robots are primarily designed to be task specific, with limited research into the implications of their design outside their core task. In the case of a musical robot, this includes when a human sees and interacts with the robot before and after the performance, as well as in between pieces. These non-musical interaction tasks such as the presence of a robot during musical equipment set up, play a key role in the human perception of the robot however have received only limited attention. In this paper, we describe a new audio system using emotional musical prosody, designed to match the creative process of a musical robot for use before, between and after musical performances. Our generation system relies on the creation of a custom dataset for musical prosody. This system is designed foremost to operate in real time and allow rapid generation and dialogue exchange between human and robot. For this reason, the system combines symbolic deep learning through a Conditional Convolution Variational Auto-encoder, with an emotion-tagged audio sampler. We then compare this to a SOTA text-to-speech system in our robotic platform, Shimon the marimba player.We conducted a between-groups study with 100 participants watching a musician interact for 30 s with Shimon. We were able to increase user ratings for the key creativity metrics; novelty and coherence, while maintaining ratings for expressivity across each implementation. Our results also indicated that by communicating in a form that relates to the robot’s core functionality, we can raise likeability and perceived intelligence, while not altering animacy or anthropomorphism. These findings indicate the variation that can occur in the perception of a robot based on interactions surrounding a performance, such as initial meetings and spaces between pieces, in addition to the core creative algorithms.

Highlights

There is a growing body of work focusing on robots collaborating with humans on creative tasks such as art, language, and music
We present a model for generating emotional musical prosody in embedded platforms in real time for creative robots
The paper presents a new generative system for emotional musical prosody that is implemented in Shimon, a creative robot

Summary

Introduction

There is a growing body of work focusing on robots collaborating with humans on creative tasks such as art, language, and music. Embodying a creative robot with speech capabilities that do not address its creative capabilities risks distancing collaborators and misrepresenting artistic opportunities. In robotic literature this is referred to. One under-explored approach for an artificial agent to convey emotions is through non-linguistic musical prosody (Savery et al, 2020a) We propose that such an approach could be effective in human-robot collaboration in creative tasks, where emotional expression is at the core of the activity, and where subtle background conveyance of mood can enhance, rather than distract, from the creative activity

Objectives

Results

Conclusion