Accessing Phonetic Variation in Spoken Language Corpora through Non-standard Orthography

Andrea C Schalley,Simon Musgrave,Michael Haugh

doi:10.1080/07268602.2014.875459

Andrea C Schalley, Simon Musgrave + Show 1 more

https://doi.org/10.1080/07268602.2014.875459

Copy DOI

Abstract

Much of the sociolinguistic and stylistic variation which is of interest to linguists is phonetic in nature, but the access route to corpus data is typically via a textual transcription. This poses a significant problem for a researcher who wishes to access the original recordings of speech in order to analyse variation: how can they search for relevant data? Many transcription traditions allow for the representation of such variation through non-standard orthography, and such conventions should therefore allow access to data relevant to the study of variation. However, the specific conventions used vary between traditions (and indeed may not be applied consistently by individual transcribers). This then creates another problem where the researcher wishes to access data across an aggregated collection, which is a practical necessity given the relatively limited size of most corpora of spoken language. In this paper, we analyse the conventions used in two of the component collections in the Australian National Corpus, the Australian Radio Talkback Corpus and the Monash Corpus of Spoken English. On the basis of this analysis, we develop a fragment of an ontology which gives an explicit account of the phenomena related to non-standard pronunciation represented in the transcripts and which can therefore act as the basis for better searching of the collections and better access to relevant data for analysing sociolinguistic and stylistic variation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Accessing Phonetic Variation in Spoken Language Corpora through Non-standard Orthography

Abstract

Talk to us

Similar Papers

More From: Australian Journal of Linguistics

Lead the way for us

Journal: Australian Journal of Linguistics	Publication Date: Jan 2, 2014
Citations: 2

Similar Papers

Spiral construction of syntactically annotated spoken language corpus
T Ohno ... S Matsuhara
-
T Ohno, et. al.T Ohno ... S Matsuhara
26 Oct 2003
26 Oct 2003

Style and Sociolinguistic Variation
John R Rickford
-
John R RickfordJohn R Rickford
03 Jan 2002
03 Jan 2002

Sociolinguistic Variation in Contemporary French
-
-
--
14 Oct 2009
14 Oct 2009

Modality in Contemporary English (review)
Rong Chen
Language | VOL. 82
Rong ChenRong Chen
01 Mar 2006
Language | VOL. 82

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accessing Phonetic Variation in Spoken Language Corpora through Non-standard Orthography

Abstract

Talk to us

Similar Papers

More From: Australian Journal of Linguistics