A dataset for multimodal music information retrieval of Sotho-Tswana musical videos

Osondu Oguike,Mpho Primus

doi:10.1016/j.dib.2024.110672

Abstract

The existence of diverse traditional machine learning and deep learning models designed for various multimodal music information retrieval (MIR) applications, such as multimodal music sentiment analysis, genre classification, recommender systems, and emotion recognition, renders the machine learning and deep learning models indispensable for the MIR tasks. However, solving these tasks in a data-driven manner depends on the availability of high-quality benchmark datasets. Hence, the necessity for datasets tailored for multimodal music information retrieval applications is paramount. While a handful of multimodal datasets exist for distinct music information retrieval applications, they are not available in low-resourced languages, like Sotho-Tswana languages. In response to this gap, we introduce a novel multimodal music information retrieval dataset for various music information retrieval applications. This dataset centres on Sotho-Tswana musical videos, encompassing both textual, visual, and audio modalities specific to Sotho-Tswana musical content. The musical videos were downloaded from YouTube, but Python programs were written to process the musical videos and extract relevant spectral-based acoustic features, using different Python libraries. Annotation of the dataset was done manually by native speakers of Sotho-Tswana languages, who understand the culture and traditions of the Sotho-Tswana people. It is distinctive as, to our knowledge, no such dataset has been established until now.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A dataset for multimodal music information retrieval of Sotho-Tswana musical videos

Abstract

Talk to us

Similar Papers

More From: Data in Brief

Lead the way for us

Journal: Data in Brief	Publication Date: Jun 26, 2024
License type: cc-by

Similar Papers

Ranking Tag Pairs for Music Recommendation Using Acoustic Similarity
Jaesung Lee ... Dae-Won Kim
The International Journal of Fuzzy Logic and Intelligent Systems | VOL. 15
Jaesung Lee, et. al.Jaesung Lee ... Dae-Won Kim
30 Sep 2015
The International Journal of Fuzzy Logic and Intelligent Systems | VOL. 15

Acoustic features from the recording studio for Music Information Retrieval Tasks
Tim Ziemer ... Pattararat Kiattipadungkul
-
Tim Ziemer, et. al.Tim Ziemer ... Pattararat Kiattipadungkul
01 Jan 2020
01 Jan 2020

Music recommendation based on acoustic features from the recording studio
Tim Ziemer ... Tanyarin Karuchit
The Journal of the Acoustical Society of America | VOL. 148
Tim Ziemer, et. al.Tim Ziemer ... Tanyarin Karuchit
01 Oct 2020
The Journal of the Acoustical Society of America | VOL. 148

MPEG-7—Standardized tools for music information retrieval
Jürgen Herre
The Journal of the Acoustical Society of America | VOL. 118
Jürgen HerreJürgen Herre
01 Sep 2005
The Journal of the Acoustical Society of America | VOL. 118

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A dataset for multimodal music information retrieval of Sotho-Tswana musical videos

Abstract

Talk to us

Similar Papers

More From: Data in Brief