Automatic Segmentation of Ethnomusicological Field Recordings

Matija Marolt,Matevž Pesek,Ciril Bohak,Alenka Kavčič

doi:10.3390/app9030439

Abstract

The article presents a method for segmentation of ethnomusicological field recordings. Field recordings are integral documents of folk music performances captured in the field, and typically contain performances, intertwined with interviews and commentaries. As these are live recordings, captured in non-ideal conditions, they usually contain significant background noise. We present a segmentation method that segments field recordings into individual units labelled as speech, solo singing, choir singing, and instrumentals. Classification is based on convolutional deep networks, and is augmented with a probabilistic approach for segmentation. We describe the dataset gathered for the task and the tools developed for gathering the reference annotations. We outline a deep network architecture based on residual modules for labelling short audio segments and compare it to the more standard feature based approaches, where an improvement in classification accuracy of over 10% was obtained. We also present the SeFiRe segmentation tool that incorporates the presented segmentation method.

Highlights

Ethnomusicological field recordings are recordings gathered in the field, capturing folk music performances, usually intertwined with interviews with performers or commentaries by the folklorist
While there is a large number of published works dedicated to speech/music classification, they mostly deal with broadcast recordings, where their per-frame classification accuracies reach over 90%
We presented a method for segmentation of field recordings into individual units

Summary

Introduction

Ethnomusicological field recordings are recordings gathered in the field, capturing folk music performances, usually intertwined with interviews with performers (informants) or commentaries by the folklorist. As their role is to document the legacy of folk musicians in their actual environment, the recordings are usually taken in non-ideal spaces (e.g., a performer’s home) and may contain environmental noises (e.g., people entering and leaving the room, background talking) or interruptions. One of the first tasks that ethnomusicologists (or algorithms) face when studying a field recording, is its segmentation into smaller coherent units, such as units containing speech or individual folk song performances. Segmentation approaches relied on hand-crafted features, tuned to discriminate between speech and music, which were the two categories most works aimed to

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jan 28, 2019
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Automatic Segmentation of Ethnomusicological Field Recordings

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Choir versus Solo Singing: Effects on Mood, and Salivary Oxytocin and Cortisol Concentrations.
T Moritz Schladt ... Brigitte M Kudielka
Frontiers in Human Neuroscience | VOL. 11
T Moritz Schladt, et. al.T Moritz Schladt ... Brigitte M Kudielka
14 Sep 2017
Frontiers in Human Neuroscience | VOL. 11

Probabilistic Segmentation and Labeling of Ethnomusicological Field Recordings.

-

26 Oct 2009
26 Oct 2009

Knowledge graph learning algorithm based on deep convolutional networks
Yuzhong Zhou ... Jiahao Shi
Intelligent Systems with Applications | VOL. 22
Yuzhong Zhou, et. al.Yuzhong Zhou ... Jiahao Shi
07 May 2024
Intelligent Systems with Applications | VOL. 22

Deep Convolutional Network Based on Interleaved Fusion Group
Enhui Lv ... C L Philip Chen
IEEE Transactions on Cognitive and Developmental Systems | VOL. 13
Enhui Lv, et. al.Enhui Lv ... C L Philip Chen
18 Feb 2020
IEEE Transactions on Cognitive and Developmental Systems | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Segmentation of Ethnomusicological Field Recordings

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences