The KTH speech database

Rolf Carlson,Björn Granström,Lennart Nord

doi:10.1016/0167-6393(90)90013-y

Abstract

In current speech research, there is a need for large databases to be able to test production and perception models at different linguistic levels. There are considerable problems in administering databases, both to label the speech and to easily access stored material. In order to alleviate some of the problems we have created a speech analysis system. Speech data are stored in sentence-sized files. These files are segmented and transcribed semi-automatically given a phonetic transcription of the utterance. This transcription is generated by the letter-to-sound rules of our text-to-speech system. The emphasis on the database is the use for acoustic-phonetic research rather than the use in e.g. evaluation of speech recognizers. This makes demands on flexible and linguistically specified retrieval patterns. Our unorthodox solution to this is to use the synthesis rule structure, similar to the notation used in generative phonology, for accessing the data. By a brief rule statement, speech segments meeting the specified contextual conditions can be identified. Durational data can be collected directly during the database search. Spectral analysis programs operating with a variety of spectral representations have also been created that display the result, typically as a mean/standard deviation spectrum or as a contour histogram spectrum.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The KTH speech database

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Aug 1, 1990
Citations: 2

Similar Papers

Rule‐controlled data base search
Rolf Carlson ... Björn Granström
The Journal of the Acoustical Society of America | VOL. 78
Rolf Carlson, et. al.Rolf Carlson ... Björn Granström
01 Nov 1985
The Journal of the Acoustical Society of America | VOL. 78

Expressive Meaning Across Linguistic Levels and Frameworks
-
-
--
26 Aug 2021
26 Aug 2021

Prediction of perceived sound quality of hearing aids (algorithms) using perceptual models
Rainer Huber
The Journal of the Acoustical Society of America | VOL. 123
Rainer HuberRainer Huber
01 May 2008
The Journal of the Acoustical Society of America | VOL. 123

Speech errors: old data in search of new theories
Brian Butterworth
Linguistics | VOL. 19
Brian ButterworthBrian Butterworth
01 Jan 1981
Linguistics | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The KTH speech database

Abstract

Talk to us

Similar Papers

More From: Speech Communication