Abstract

Spontaneous speech is rarely fluent due to human nature. And among other characteristics of spontaneous speech there are the speech variation and the presence of speech disfluencies such as hesitations, fillers, artefacts. Such elements are an obstacle for automatic speech processing as well as for its tran-scriptions processing. For automatic detection of these elements a corpus of spontaneous Russian speech was collected basing on a task methodology. Corpus was annotated taking into account such types of disfluencies as hesitations, repairs, sound lengthening, as well as artefacts. For hesitation and artefacts detection there were used such parameters as duration, energy, fundamental frequency, and other spectral characteristics.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call