A General Framework for Visualization of Sound Collections in Musical Interfaces

Gerard Roma,Anna Xambó,Pierre Alexandre Tremblay,Owen Green

doi:10.3390/app112411926

Gerard Roma, Anna Xambó + Show 2 more

Open Access

https://doi.org/10.3390/app112411926

Copy DOI

Abstract

While audio data play an increasingly central role in computer-based music production, interaction with large sound collections in most available music creation and production environments is very often still limited to scrolling long lists of file names. This paper describes a general framework for devising interactive applications based on the content-based visualization of sound collections. The proposed framework allows for a modular combination of different techniques for sound segmentation, analysis, and dimensionality reduction, using the reduced feature space for interactive applications. We analyze several prototypes presented in the literature and describe their limitations. We propose a more general framework that can be used flexibly to devise music creation interfaces. The proposed approach includes several novel contributions with respect to previously used pipelines, such as using unsupervised feature learning, content-based sound icons, and control of the output space layout. We present an implementation of the framework using the SuperCollider computer music language, and three example prototypes demonstrating its use for data-driven music interfaces. Our results demonstrate the potential of unsupervised machine learning and visualization for creative applications in computer music.

Highlights

Computers, in their many incarnations, are nowadays ubiquitous at different points in most music creation and production workflows
The only exception is continuity, where all four algorithms score high. This shows that t-SNE and Uniform Manifold Approximation and Projection (UMAP) produce more clustered plots, which include only relevant neighbors for each point, whereas all four algorithms tend to preserve all neighbors in the feature space as neighbors in the reduced space
This opens up many possibilities for novel interfaces, by making use of other features available in the SuperCollider environment

Summary

Introduction

In their many incarnations, are nowadays ubiquitous at different points in most music creation and production workflows One reason for this prevalence is the convenience of digital storage: compared with analog storage media such as magnetic tape, digital storage makes it much easier to access and manipulate large quantities of audio. Many software samplers feature skeuomorphic user interfaces that emulate with surprising detail the interface of early hardware samplers and sampling synthesizers Computer music languages such as Max, Pure Data or SuperCollider [1,2], mostly based on the Music N paradigm [3] Creative stages of music production are typically driven by musical intuitions and auditory cues In this context, dealing with labels and file systems can be disruptive, which hinders the use of large collections of sounds.

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Dec 15, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A General Framework for Visualization of Sound Collections in Musical Interfaces

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

ContrastNet: Unsupervised feature learning by autoencoder and prototypical contrastive learning for hyperspectral imagery classification
Zeyu Cao ... Liaoying Zhao
Neurocomputing | VOL. 460
Zeyu Cao, et. al.Zeyu Cao ... Liaoying Zhao
07 Jul 2021
Neurocomputing | VOL. 460

A Primer on Machine Learning.
Audrene S Edwards ... Tun Jie
Transplantation | VOL. 105
Audrene S Edwards, et. al.Audrene S Edwards ... Tun Jie
18 Aug 2020
Transplantation | VOL. 105

Evaluation of Physical Education Multimedia Teaching for Data Assimilation
Xing Lifu
International Journal of Online and Biomedical Engineering (iJOE) | VOL. 14
Xing LifuXing Lifu
26 Apr 2018
International Journal of Online and Biomedical Engineering (iJOE) | VOL. 14

User-guided Dimensionality Reduction Ensembles
Gladys M Hilasaca ... Fernando V Paulovich
-
Gladys M Hilasaca, et. al.Gladys M Hilasaca ... Fernando V Paulovich
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A General Framework for Visualization of Sound Collections in Musical Interfaces

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences