When Deep is not Enough: Towards Understanding Shallow and Continual Learning Models in Realistic Environmental Sound Classification for Robots

Omar Eldardeer,Francesco Rea,Giulio Sandini,Doreen Jirak

doi:10.1142/s0219843623500081

Abstract

Although deep learning models are state-of-the-art models in audio classification, they fall short when applied in developmental robotic settings and human–robot interaction (HRI). The major drawback is that deep learning relies on supervised training with a large amount of data and annotations. In contrast, developmental learning strategies in human–robot interaction often deal with small-scale data acquired from HRI experiments and require the incremental addition of novel classes. Alternatively, shallow learning architectures that enable fast and yet robust learning are provided by simple distance metric-based learning and neural architectures implementing the reservoir computing paradigm. Similarly, continual learning algorithms receive more attention in the last years as they can integrate stable perceptual feature extraction using pre-trained deep learning models with open-set classification. As our research centers around reenacting the incremental learning of audio cues, we conducted a study on environmental sound classification using the iCaRL as well as the GDumb continual learning algorithms in comparison with a popular classifier in this domain, the kNN classifier, as well as employing an Echo State Network. We contrast our results with those obtained from a VGGish network that serves here as the performance upper bound that allows us to quantify the performance differences and to discuss current issues with continual learning in the audio domain. As only little is known about using shallow models or continual learning in the audio domain, we pass on additional techniques like data augmentation and create a simple experimental pipeline that is easy to reproduce. Although our selected algorithms are partially inferior in performance compared to the upper bound, our evaluation on three environmental sound datasets shows promising performance using continual learning for a subset of the DCASE2019 challenge dataset and the ESC10 dataset. As we do not address benchmarking in this paper, our study provides a good foundation for further research and computational improvements on shallow and continual learning models for robotic applications in the audio domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

When Deep is not Enough: Towards Understanding Shallow and Continual Learning Models in Realistic Environmental Sound Classification for Robots

Abstract

Talk to us

Similar Papers

More From: International Journal of Humanoid Robotics

Lead the way for us

Journal: International Journal of Humanoid Robotics	Publication Date: Jul 27, 2023
Citations: 1

Similar Papers

Shallow and Deep Learning Models for Vessel Motions Forecasting during Adverse Weather Conditions
Jake M Walker ... Andrea Coraddu
Modelling and Optimisation of Ship Energy Systems 2023 | VOL. -
Jake M Walker, et. al.Jake M Walker ... Andrea Coraddu
31 Dec 2024
Modelling and Optimisation of Ship Energy Systems 2023 | VOL. -

Spatial modelling of soil salinity: deep or shallow learning models?
Aliakbar Mohammadifar ... Adrian L Collins
Environmental Science and Pollution Research | VOL. 28
Aliakbar Mohammadifar, et. al.Aliakbar Mohammadifar ... Adrian L Collins
23 Mar 2021
Environmental Science and Pollution Research | VOL. 28

MLSEC - Benchmarking Shallow and Deep Machine Learning Models for Network Security
Pedro Casas ... German Capdehourat
-
Pedro Casas, et. al.Pedro Casas ... German Capdehourat
01 May 2019
01 May 2019

DRLSTM: A dual-stage deep learning approach driven by raw monitoring data for dam displacement prediction
Mingchao Li ... Lingguang Song
Advanced Engineering Informatics | VOL. 51
Mingchao Li, et. al.Mingchao Li ... Lingguang Song
30 Dec 2021
Advanced Engineering Informatics | VOL. 51

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

When Deep is not Enough: Towards Understanding Shallow and Continual Learning Models in Realistic Environmental Sound Classification for Robots

Abstract

Talk to us

Similar Papers

More From: International Journal of Humanoid Robotics