Non-speech voice for sonic interaction: a catalogue

Alan Del Piccolo,Davide Rocchesso

doi:10.1007/s12193-016-0227-6

Alan Del Piccolo, Davide Rocchesso

Open Access

https://doi.org/10.1007/s12193-016-0227-6

Copy DOI

Abstract

This paper surveys the uses of non-speech voice as an interaction modality within sonic applications. Three main contexts of use have been identified: sound retrieval, sound synthesis and control, and sound design. An overview of different choices and techniques regarding the style of interaction, the selection of vocal features and their mapping to sound features or controls is here displayed. A comprehensive collection of examples instantiates the use of non-speech voice in actual tools for sonic interaction. It is pointed out that while voice-based techniques are already being used proficiently in sound retrieval and sound synthesis, their use in sound design is still at an exploratory phase. An example of creation of a voice-driven sound design tool is here illustrated.

Highlights

The voice is the primary communication channel among humans
Such definition represents an issue in the first place: A semi-automatic classification, performed for instance by machine learning algorithms relying on generic audio features, is likely to produce different categories than a classification built on perceptual basis [4]
In a sound synthesizer controlled via singing voice [31], descriptors are extracted from the vocal signal via Shorttime Fourier Transform (STFT) and classified in four groups related to their use for control: Excitation (F0 and energy), Vocal Tract, Voice Quality and Context

Summary

Introduction

The voice is the primary communication channel among humans. While speech is considered to be the most important form of voice communication, non-speech voice as well is a means to convey a wide array of information. Mimicking and imitating sounds are typical actions that are intuitively performed by means of non-speech voice They require no production or recollection of verbal information and provided that adequate techniques to match the voice to the sounds are made available, vocal imitation is a potentially effective and immediate retrieval strategy. While a structured use of non-speech voice in such context is still missing, partly due to the lack of an engineered approach to the discipline, past and present research focus on exploiting non-speech voice to perform fast prototyping in sonic interaction and to facilitate the communication of audio concepts.

Motivations and related work

Sound retrieval

Category-dependent feature selection

Vocal query strategies

Matching strategies

Examples

Query by whistling

Query by beatboxing

Generic sound retrieval from voice imitation queries

Sound synthesis and control

Vocal features

Roles of the voice

Mapping strategies

Extending voice-driven synthesis to audio mosaicing

Auracle

Voice-controlled plucked bass guitar

Singing-driven interfaces for sound synthesizers

Making music through real-time voice timbre analysis

A voice interface for sound generators

Billaboop

Pitch-based commercial applications

The singing tree

4.4.10 Wahwactor

4.4.12 Synthassist

4.4.13 Intuitive sound design using vocal mimicking

Vocalization for sound design

Vocal sketching: a prototype tool for designing multimodal interaction

Using vocal sketching for designing sonic interactions

VOGST project

VocalSketch: vocally imitating audio concepts

SkAT-VG project

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal on Multimodal User Interfaces	Publication Date: Jul 22, 2016
Citations: 1	License type: open-access

R Discovery Prime

R Discovery Prime

Non-speech voice for sonic interaction: a catalogue

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal on Multimodal User Interfaces

Lead the way for us

Similar Papers

Nonlinear Dynamics in Physical Models: Simple Feedback-Loop Systems and Properties
Xavier Rodet ... Christophe Vergez
Computer Music Journal | VOL. 23
Xavier Rodet, et. al.Xavier Rodet ... Christophe Vergez
01 Sep 1999
Computer Music Journal | VOL. 23

Intelligent and perceptual-based approach to musical instruments sound design
Brahim Hamadicharef ... Emmanuel C Ifeachor
Expert Systems With Applications | VOL. 39
Brahim Hamadicharef, et. al.Brahim Hamadicharef ... Emmanuel C Ifeachor
29 Dec 2011
Expert Systems With Applications | VOL. 39

A prospect of the future of automotive sound quality development
Koo Tae Kang
The Journal of the Acoustical Society of America | VOL. 131
Koo Tae KangKoo Tae Kang
01 Apr 2012
The Journal of the Acoustical Society of America | VOL. 131

A Small Knowledge-Based System for Selecting Interaction Styles
Jean Vanderdonckt
-
Jean VanderdoncktJean Vanderdonckt
01 Jan 2001
01 Jan 2001

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Non-speech voice for sonic interaction: a catalogue

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal on Multimodal User Interfaces