Audiogmenter: a MATLAB toolbox for audio data augmentation

Gianluca Maguolo,Ludovico Bonan,Michelangelo Paci,Loris Nanni

doi:10.1108/aci-03-2021-0064

Gianluca Maguolo, Ludovico Bonan + Show 2 more

Open Access

https://doi.org/10.1108/aci-03-2021-0064

Copy DOI

Abstract

Purpose Create and share a MATLAB library that performs data augmentation algorithms for audio data. This study aims to help machine learning researchers to improve their models using the algorithms proposed by the authors. Design/methodology/approach The authors structured our library into methods to augment raw audio data and spectrograms. In the paper, the authors describe the structure of the library and give a brief explanation of how every function works. The authors then perform experiments to show that the library is effective. Findings The authors prove that the library is efficient using a competitive dataset. The authors try multiple data augmentation approaches proposed by them and show that they improve the performance. Originality/value A MATLAB library specifically designed for data augmentation was not available before. The authors are the first to provide an efficient and parallel implementation of a large number of algorithms.

Highlights

Deep neural networks achieved state of the art performances in many artificial intelligence fields, such as image classification [1], object detection [2] and audio classification [3]
X 1⁄4 fx1;1; . . . xn1;1; x1;2; . . . xn2;2; . . . ; x1;M ; . . . xnM ;Mg, where xi;j represents a generic audio sample i from the class j, we propose to augment xi;j with techniques working on raw audio signals and to augment the spectrogram Sðxi;jÞ produced by the same raw audio signals
We used the function sgram included in the large time-frequency analysis toolbox (LTFAT) [21] to convert raw audios into spectrograms

Summary

Introduction

Deep neural networks achieved state of the art performances in many artificial intelligence fields, such as image classification [1], object detection [2] and audio classification [3] They usually need a very large amount of labeled data to obtain good results and these data might not be available due to high labeling costs or due to the scarcity of the samples. Data augmentation is a powerful tool to improve the performance of neural networks It consists in modifying the original samples to create new ones, without changing their labels [4]. In case of limited memory availability, one CNN can be trained with the H AugSA spectrograms, another with the K AugSS spectrograms and the scores can be combined by a fusion rule

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Computing and Informatics	Publication Date: Sep 22, 2021
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

Audiogmenter: a MATLAB toolbox for audio data augmentation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Computing and Informatics

Lead the way for us

Similar Papers

EEG data augmentation: towards class imbalance problem in sleep staging tasks
Jiahao Fan ... Xinyu Jiang
Journal of Neural Engineering | VOL. 17
Jiahao Fan, et. al.Jiahao Fan ... Xinyu Jiang
01 Oct 2020
Journal of Neural Engineering | VOL. 17

An Evolutionary-based Generative Approach for Audio Data Augmentation
Silvan Mertes ... Alice Baird
-
Silvan Mertes, et. al.Silvan Mertes ... Alice Baird
21 Sep 2020
21 Sep 2020

A spectral analytic comparison of trace-class data augmentation algorithms and their sandwich variants
Kshitij Khare ... James P Hobert
The Annals of Statistics | VOL. 39
Kshitij Khare, et. al.Kshitij Khare ... James P Hobert
01 Oct 2011
The Annals of Statistics | VOL. 39

A comparison theorem for data augmentation algorithms with applications
Hee Min Choi ... James P Hobert
Electronic Journal of Statistics | VOL. 10
Hee Min Choi, et. al.Hee Min Choi ... James P Hobert
01 Jan 2015
Electronic Journal of Statistics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Audiogmenter: a MATLAB toolbox for audio data augmentation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Computing and Informatics