NHSS: A speech and singing parallel database

Bidisha Sharma,Xiaoxue Gao,Karthika Vijayan,Xiaohai Tian,Haizhou Li

doi:10.1016/j.specom.2021.07.002

Bidisha Sharma, Xiaoxue Gao + Show 3 more

Open Access

https://doi.org/10.1016/j.specom.2021.07.002

Copy DOI

Abstract

We present a database of parallel recordings of speech and singing, collected and released by the Human Language Technology (HLT) laboratory at the National University of Singapore (NUS), that is called NUS-HLT Speak–Sing (NHSS) database. We release this database11https://hltnus.github.io/NHSSDatabase/. to the public to support research activities, that include, but not limited to comparative studies of acoustic attributes of speech and singing signals, cooperative synthesis of speech and singing voices, and speech-to-singing conversion. This database consists of recordings of sung vocals of English pop songs, the spoken counterpart of lyrics of the songs read by the singers in their natural reading manner, and manually prepared utterance-level and word-level annotations. The audio recordings in the NHSS database correspond to 100 songs sung and spoken by 10 singers, resulting in a total of 7 h of audio data. There are 5 male and 5 female singers, singing and reading the lyrics of 10 songs each. In this paper, we discuss the design methodology of the database, analyze the similarities and dissimilarities in characteristics of speech and singing voices, and provide some strategies to address relationships between these characteristics for converting one to another. We develop benchmark systems, which can be used as reference for speech-to-singing alignment, spectral mapping, and conversion using the NHSS database.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

NHSS: A speech and singing parallel database

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Jul 12, 2021
Citations: 13

Similar Papers

Multiple brain atlas database and atlas-based neuroimaging system
Wieslaw L Nowinski ... Anthony Fang
Computer Aided Surgery | VOL. 2
Wieslaw L Nowinski, et. al.Wieslaw L Nowinski ... Anthony Fang
01 Jan 1997
Computer Aided Surgery | VOL. 2

Structural basis for RNA‐silencing suppression by Tomato aspermy virus protein 2b
Hong‐Ying Chen ... Y Adam Yuan
EMBO reports | VOL. 9
Hong‐Ying Chen, et. al.Hong‐Ying Chen ... Y Adam Yuan
04 Jul 2008
EMBO reports | VOL. 9

Neuromuscular compartments in the long head of triceps: a morphological study in rabbits.
Jie Liu ... Barry P Pereira
Muscle & nerve | VOL. 20
Jie Liu, et. al.Jie Liu ... Barry P Pereira
01 Jul 1997
Muscle & nerve | VOL. 20

Listeria Septicaemia in a Young Healthy Pregnant Woman
A Kurup ... S Arulkumaran
Australian and New Zealand Journal of Obstetrics and Gynaecology | VOL. 35
A Kurup, et. al.A Kurup ... S Arulkumaran
01 Aug 1995
Australian and New Zealand Journal of Obstetrics and Gynaecology | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

NHSS: A speech and singing parallel database

Abstract

Talk to us

Similar Papers

More From: Speech Communication