The Sociolinguistic Speech Corpus of Chilean Spanish (COSCACH)

Scott Sadowsky

doi:10.1075/ijcl.19103.sad

Abstract

Abstract This paper presents the Sociolinguistic Speech Corpus of Chilean Spanish (COSCACH) v1.0, a 9.3-million-word corpus containing transcribed, lemmatized and morphologically tagged text, audio recordings and videos from 1,237 L1 speakers of Chilean Spanish, as well as a control sample of 21 non-Chilean L1 Spanish speakers. The COSCACH is the first freely available corpus of spoken Chilean Spanish of substantial size, as well as one of the largest speech corpora of any variety of Spanish. Following a review of other Chilean speech corpora, I describe how the COSCACH was constructed, covering corpus design, speaker recruitment and metadata collection, speech elicitation and recording, transcription, lemmatization and morphological tagging, and corpus compilation. I thereby aim to provide a blueprint for creating modern, large-scale speech corpora suitable for phonetic, sociophonetic and sociolinguistic research, in addition to traditional inquiry into semantics, lexis, grammar, pragmatics and discourse.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Sociolinguistic Speech Corpus of Chilean Spanish (COSCACH)

Abstract

Talk to us

Similar Papers

More From: International Journal of Corpus Linguistics

Lead the way for us

Journal: International Journal of Corpus Linguistics	Publication Date: Jan 31, 2022
Citations: 3

Similar Papers

Interlocutor accommodation on the variation of /tɾ/ in Chilean radio
Tanya L Flores
Discourse, Context & Media | VOL. 16
Tanya L FloresTanya L Flores
09 Feb 2017
Discourse, Context & Media | VOL. 16

Forms of address in interaction: Evidence from Chilean Spanish
Víctor Fernández-Mallat
Journal of Pragmatics | VOL. 161
Víctor Fernández-MallatVíctor Fernández-Mallat
16 Apr 2020
Journal of Pragmatics | VOL. 161

Evidence for Incomplete Neutralization in Chilean Spanish
Mariška A Bolyanatz
Phonetica | VOL. 77
Mariška A BolyanatzMariška A Bolyanatz
04 Dec 2018
Phonetica | VOL. 77

Voiceless stop lenition and reduction as linguistic and social phenomena in Concepción, Chile
Brandon M.A. Rogers ... Christina A. Mirisis
Borealis – An International Journal of Hispanic Linguistics | VOL. 7
Brandon M.A. Rogers, et. al.Brandon M.A. Rogers ... Christina A. Mirisis
03 Dec 2018
Borealis – An International Journal of Hispanic Linguistics | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Sociolinguistic Speech Corpus of Chilean Spanish (COSCACH)

Abstract

Talk to us

Similar Papers

More From: International Journal of Corpus Linguistics