Singing Voice Detection in Opera Recordings: A Case Study on Robustness and Generalization

Michael Krause,Christof Weiß,Meinard Müller

doi:10.3390/electronics10101214

Michael Krause, Christof Weiß + Show 1 more

Open Access

https://doi.org/10.3390/electronics10101214

Copy DOI

Journal: Electronics	Publication Date: May 20, 2021
Citations: 10	License type: CC BY 4.0

Affiliation: International Audio Laboratories Erlangen

Abstract

Automatically detecting the presence of singing in music audio recordings is a central task within music information retrieval. While modern machine-learning systems produce high-quality results on this task, the reported experiments are usually limited to popular music and the trained systems often overfit to confounding factors. In this paper, we aim to gain a deeper understanding of such machine-learning methods and investigate their robustness in a challenging opera scenario. To this end, we compare two state-of-the-art methods for singing voice detection based on supervised learning: A traditional approach relying on hand-crafted features with a random forest classifier, as well as a deep-learning approach relying on convolutional neural networks. To evaluate these algorithms, we make use of a cross-version dataset comprising 16 recorded performances (versions) of Richard Wagner’s four-opera cycle Der Ring des Nibelungen. This scenario allows us to systematically investigate generalization to unseen versions, musical works, or both. In particular, we study the trained systems’ robustness depending on the acoustic and musical variety, as well as the overall size of the training dataset. Our experiments show that both systems can robustly detect singing voice in opera recordings even when trained on relatively small datasets with little variety.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Singing Voice Detection in Opera Recordings: A Case Study on Robustness and Generalization

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Music Information Retrieval using Deep Learning Techniques
Vignesh Subramanian
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 08
Vignesh SubramanianVignesh Subramanian
12 May 2024
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 08

Evolution and Emerging Trends in Musical Information Retrieval: A Comprehensive Review and Future Prospects
Yuxin Ding
Highlights in Science, Engineering and Technology | VOL. 85
Yuxin DingYuxin Ding
13 Mar 2024
Highlights in Science, Engineering and Technology | VOL. 85

Evaluation of deep learning models for quality control of MR spectra.
Sana Vaziri ... Duan Xu
Frontiers in neuroscience | VOL. 17
Sana Vaziri, et. al.Sana Vaziri ... Duan Xu
29 Aug 2023
Frontiers in neuroscience | VOL. 17

Deep convolutional neural networks with transfer learning for automated brain image classification
Taranjit Kaur ... Tapan Kumar Gandhi
Machine Vision and Applications | VOL. 31
Taranjit Kaur, et. al.Taranjit Kaur ... Tapan Kumar Gandhi
01 Mar 2020
Machine Vision and Applications | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Singing Voice Detection in Opera Recordings: A Case Study on Robustness and Generalization

Abstract

Talk to us

Similar Papers

More From: Electronics