A framework for generating large-scale microphone array data for machine learning

Adam Kujawski,Art J R Pelling,Simon Jekosch,Ennes Sarradj

doi:10.1007/s11042-023-16947-w

Adam Kujawski, Art J R Pelling + Show 2 more

Open Access

https://doi.org/10.1007/s11042-023-16947-w

Copy DOI

Abstract

The use of machine learning for localization of sound sources from microphone array data has increased rapidly in recent years. Newly developed methods are of great value for hearing aids, speech technologies, smart home systems or engineering acoustics. The existence of openly available data is crucial for the comparability and development of new data-driven methods. However, the literature review reveals a lack of openly available datasets, especially for large microphone arrays. This contribution introduces a framework for generation of acoustic data for machine learning. It implements tools for the reproducible random sampling of virtual measurement scenarios. The framework allows computations on multiple machines, which significantly speeds up the process of data generation. Using the framework, an example of a development dataset for sound source characterization with a 64-channel array is given. A containerized environment running the simulation source code is openly available. The presented approach enables the user to calculate large datasets, to store only the features necessary for training, and to share the source code which is needed to reproduce datasets instead of sharing the data itself. This avoids the problem of distributing large datasets and enables reproducible research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Multimedia Tools and Applications	Publication Date: Sep 25, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A framework for generating large-scale microphone array data for machine learning

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Similar Papers

Predictive performance of machine and statistical learning methods: Impact of data-generating processes on external validity in the "large N, small p" setting.
Peter C Austin ... Ewout W Steyerberg
Statistical Methods in Medical Research | VOL. 30
Peter C Austin, et. al.Peter C Austin ... Ewout W Steyerberg
13 Apr 2021
Statistical Methods in Medical Research | VOL. 30

Automatic 3D scanning surface generation for microphone array acoustic imaging
Mathew Legg ... Stuart Bradley
Applied Acoustics | VOL. 76
Mathew Legg, et. al.Mathew Legg ... Stuart Bradley
14 Sep 2013
Applied Acoustics | VOL. 76

Fast grid-free strength mapping of multiple sound sources from microphone array data using a Transformer architecture.
Adam Kujawski ... Ennes Sarradj
The Journal of the Acoustical Society of America | VOL. 152
Adam Kujawski, et. al.Adam Kujawski ... Ennes Sarradj
01 Nov 2022
The Journal of the Acoustical Society of America | VOL. 152

Orthogonal circular microphone array system and method for detecting three-dimensional direction of sound source using the same
Sun-Do June
The Journal of the Acoustical Society of America | VOL. 121
Sun-Do JuneSun-Do June
01 Jan 2007
The Journal of the Acoustical Society of America | VOL. 121

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A framework for generating large-scale microphone array data for machine learning

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications