Soundgen: An open-source tool for synthesizing nonverbal vocalizations

Andrey Anikin

doi:10.3758/s13428-018-1095-7

Abstract

Voice synthesis is a useful method for investigating the communicative role of different acoustic features. Although many text-to-speech systems are available, researchers of human nonverbal vocalizations and bioacousticians may profit from a dedicated simple tool for synthesizing and manipulating natural-sounding vocalizations. Soundgen (https://CRAN.R-project.org/package=soundgen) is an open-source R package that synthesizes nonverbal vocalizations based on meaningful acoustic parameters, which can be specified from the command line or in an interactive app. This tool was validated by comparing the perceived emotion, valence, arousal, and authenticity of 60 recorded human nonverbal vocalizations (screams, moans, laughs, and so on) and their approximate synthetic reproductions. Each synthetic sound was created by manually specifying only a small number of high-level control parameters, such as syllable length and a few anchors for the intonation contour. Nevertheless, the valence and arousal ratings of synthetic sounds were similar to those of the original recordings, and the authenticity ratings were comparable, maintaining parity with the originals for less complex vocalizations. Manipulating the precise acoustic characteristics of synthetic sounds may shed light on the salient predictors of emotion in the human voice. More generally, soundgen may prove useful for any studies that require precise control over the acoustic features of nonspeech sounds, including research on animal vocalizations and auditory perception.

Highlights

Voice synthesis is a useful method for investigating the communicative role of different acoustic features
Speech synthesis is a diverse and mature field (Schröder, 2009), but fewer options are available to researchers who wish to synthesize or modify human nonverbal vocalizations, such as laughs and screams, or sounds produced by nonhuman animals
This is the context in which soundgen was developed as an open-source tool designed for the manual, fully controlled synthesis and manipulation of nonverbal vocalizations

Summary

Introduction

Voice synthesis is a useful method for investigating the communicative role of different acoustic features. When the goal is to generate a sound with high precision (e.g., when synthesizing multiple modifications of the same basic vocalization for perceptual testing), stochastic behavior is not desirable, and temperature should be set to a small positive value (setting it to exactly zero disables the addition of new formants above the user-specified ones and is not recommended).

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Behavior research methods	Publication Date: Jul 27, 2018
Citations: 73	License type: open-access

R Discovery Prime

R Discovery Prime

Soundgen: An open-source tool for synthesizing nonverbal vocalizations

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Behavior research methods

Lead the way for us

Similar Papers

Human Non-linguistic Vocal Repertoire: Call Types and Their Meaning
Andrey Anikin ... Tomas Persson
Journal of Nonverbal Behavior | VOL. 42
Andrey Anikin, et. al.Andrey Anikin ... Tomas Persson
30 Sep 2017
Journal of Nonverbal Behavior | VOL. 42

Humans rely on the same rules to assess emotional valence and intensity in conspecific and dog vocalizations
Tamás Faragó ... Anna Kis
Biology Letters | VOL. 10
Tamás Faragó, et. al.Tamás Faragó ... Anna Kis
01 Jan 2014
Biology Letters | VOL. 10

What is the Melody of That Voice? Probing Unbiased Recognition Accuracy with the Montreal Affective Voices
Margarida Vasconcelos ... Ana P Soares
Journal of Nonverbal Behavior | VOL. 41
Margarida Vasconcelos, et. al.Margarida Vasconcelos ... Ana P Soares
06 Apr 2017
Journal of Nonverbal Behavior | VOL. 41

A Moan of Pleasure Should Be Breathy: The Effect of Voice Quality on the Meaning of Human Nonverbal Vocalizations
Andrey Anikin
Phonetica | VOL. 77
Andrey AnikinAndrey Anikin
21 Jan 2020
Phonetica | VOL. 77

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Soundgen: An open-source tool for synthesizing nonverbal vocalizations

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Behavior research methods