Transcription and forced alignment of the digital archive of southern speech

Margaret E Renwick,Joseph A Stanley,Rachel M Olsen,Michael Olsen

doi:10.1121/1.4989090

Abstract

We describe transcription and forced alignment of the Digital Archive of Southern Speech (DASS), a project that will provide a large corpus of historical, semi-spontaneous Southern speech for acoustic analysis. 372 hours of recordings (64 interviews) comprise a subset of the Linguistic Atlas of the Gulf States, an extensive dialect study of 1121 speakers conducted across eight southern U.S. states from 1968 to 1983. Manual orthographic transcription of full DASS interviews is carried out according to in-house guidelines that ensure consistency across files and transcribers. Separate codes are used for the interviewee, interviewer, non-speech, overlapping, and unintelligible speech. Transcriber output is converted to Praat TextGrids using LaBB-CAT, a tool for maintaining large speech corpora. TextGrids containing only the interviewee’s speech are generated, and subjected to forced alignment by DARLA, which accommodates the levels of variation and noise in the DASS files with a high degree of success. Toward acoustic analysis, we evaluate three methods for vowel formant extraction: the native output of DARLA, a local implementation of FAVE-Extract, and a Praat-based extractor that incorporates separate formant tracks for different regions of the vowel space. We present this workflow of transcription and analysis to benefit other projects of similar size and scope.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transcription and forced alignment of the digital archive of southern speech

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: May 1, 2017
Citations: 2

Similar Papers

Methods for transcription and forced alignment of a legacy speech corpus
Rachel M. Olsen ... Michael L. Olsen
-
Rachel M. Olsen, et. al.Rachel M. Olsen ... Michael L. Olsen
01 Jan 2017
01 Jan 2017

Deficits in fine motor coordination in children with unintelligible speech.
H Amorosa ... M Dames
European archives of psychiatry and neurological sciences | VOL. 236
H Amorosa, et. al.H Amorosa ... M Dames
01 Jan 1986
European archives of psychiatry and neurological sciences | VOL. 236

Using automatic alignment on child speech: Directions for improvement
Thea Knowles ... Meghan Clayards
The Journal of the Acoustical Society of America | VOL. 138
Thea Knowles, et. al.Thea Knowles ... Meghan Clayards
01 Sep 2015
The Journal of the Acoustical Society of America | VOL. 138

Interior structure-borne noise reduction by controlling the automotive body panel vibration
Rong Guo ... Jing Zhao
Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering | VOL. 226
Rong Guo, et. al.Rong Guo ... Jing Zhao
25 Jan 2012
Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering | VOL. 226

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transcription and forced alignment of the digital archive of southern speech

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America