Acquiring Speech Transcriptions Using Mismatched Crowdsourcing

Preethi Jyothi,Mark Hasegawa-Johnson

doi:10.1609/aaai.v29i1.9343

Abstract

Transcribed speech is a critical resource for building statistical speech recognition systems. Recent work has looked towards soliciting transcriptions for large speech corpora from native speakers of the language using crowdsourcing techniques. However, native speakers of the target language may not be readily available for crowdsourcing. We examine the following question: can humans unfamiliar with the target language help transcribe? We follow an information-theoretic approach to this problem: (1) We learn the characteristics of a noisy channel that models the transcribers' systematic perception biases. (2) We use an error-correcting code, specifically a repetition code, to encode the inputs to this channel, in conjunction with a maximum-likelihood decoding rule. To demonstrate the feasibility of this approach, we transcribe isolated Hindi words with the help of Mechanical Turk workers unfamiliar with Hindi. We successfully recover Hindi words with an accuracy of over 85% (and 94% in a 4-best list) using a 15-fold repetition code. We also estimate the conditional entropy of the input to this channel (Hindi words) given the channel output (transcripts from crowdsourced workers) to be less than 2 bits; this serves as a theoretical estimate of the average number of bits of auxiliary information required for errorless recovery.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Acquiring Speech Transcriptions Using Mismatched Crowdsourcing

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Feb 16, 2015
Citations: 23

Similar Papers

Perceptual Benefits of Linguistic Diversity and Language Background: Evidence from Auditory Free Classification of English Dialect Accents and Asian-Accented English
Kristen Syrett ... Joy Lu
Glossa Psycholinguistics | VOL. 3
Kristen Syrett, et. al.Kristen Syrett ... Joy Lu
26 Aug 2024
Glossa Psycholinguistics | VOL. 3

RESEARCH OF ORAL SPEECH WITH SIGNS OF INTERLINGUAL INTERFERENCE, VERNACULAR WITH FOREIGN LANGUAGE ELEMENTS, IMITATION OF SPEECH IN NON-NATIVE LANGUAGE
A Ovannisian
Criminalistics and Forensics | VOL. -
A OvannisianA Ovannisian
01 Jan 2020
Criminalistics and Forensics | VOL. -

Does learning a foreign language affect object categorization in native speakers of a language with grammatical gender? The case of Lithuanian speakers learning three languages with different types of gender systems (Italian, Russian and German).
Luca At Vernich
International Journal of Bilingualism | VOL. 23
Luca At VernichLuca At Vernich
23 Sep 2017
International Journal of Bilingualism | VOL. 23

Speech acts and politeness across cultures
Heather Bowe ... Kylie Martin
-
Heather Bowe, et. al.Heather Bowe ... Kylie Martin
12 Apr 2007
12 Apr 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Acquiring Speech Transcriptions Using Mismatched Crowdsourcing

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence