Vocal92: Audio Dataset With a Cappella Solo Singing and Speech

Zhuo Deng,Ruohua Zhou

doi:10.1109/access.2023.3253207

Zhuo Deng, Ruohua Zhou

Open Access

https://doi.org/10.1109/access.2023.3253207

Copy DOI

Abstract

Singer recognition plays a vital role in music information retrieval systems. Most songs in the singer recognition system are mixed audios of music and voice. In contrast, there is a lack of labeled a cappella solo singing data suitable for singer recognition. Text-independent singer recognition systems successfully encode audio features such as voice pitch, intensity, and timbre to achieve good performance. Most such systems are trained and evaluated using data from music with accompaniment. However, due to the influence of background music, the performance of the singer recognition model was limited. Contrarily, a powerful singer identification system can be trained and evaluated using a cappella solo singing voices with a clear and broad range of qualities. There needs to be labeled clear singing data suitable for singer recognition research. To address this issue, we present Vocal92, a multivariate a cappella solo singing and speech audio dataset spanning around 146.73 hours sourced from volunteers. Furthermore, we use two current state-of-the-art models to construct the singer recognition baseline system. In experiments, the singer recognition model developed by a cappella solo singing data performs well in both single-mode and cross-modal verification data, significantly improving related works. The dataset is accessible to everyone at https://pan.baidu.com/s/1Pn62DHfal2OOZ_5JqgGBdQ with jnz5 as the validation code. For non-commercial use, the dataset will be available free of charge at the IEEE DataPort (after acceptance).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Vocal92: Audio Dataset With a Cappella Solo Singing and Speech

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Younger Adults Derive Pleasure and Utilitarian Benefits from Browsing for Music Information Seeking in Physical and Digital Spaces
Diana K Wakimoto
Evidence Based Library and Information Practice | VOL. 7
Diana K WakimotoDiana K Wakimoto
12 Sep 2012
Evidence Based Library and Information Practice | VOL. 7

Analysis of user needs and information features in natural language queries seeking music information
Jin Ha Lee
Journal of the American Society for Information Science and Technology | VOL. 61
Jin Ha LeeJin Ha Lee
29 Jan 2010
Journal of the American Society for Information Science and Technology | VOL. 61

The Music Information Retrieval Evaluation eXchange: Some Observations and Insights
J Stephen Downie ... Mert Bay
-
J Stephen Downie, et. al.J Stephen Downie ... Mert Bay
01 Jan 2009
01 Jan 2009

Evaluation of a simple and effective music information retrieval method
Stephen Downie ... Michael Nelson
-
Stephen Downie, et. al.Stephen Downie ... Michael Nelson
01 Jul 2000
01 Jul 2000

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Vocal92: Audio Dataset With a Cappella Solo Singing and Speech

Abstract

Talk to us

Similar Papers

More From: IEEE Access