Who's talking and what are they saying: Phonetic cue distributions link speech and talker recognition

Dave F Kleinschmidt

doi:10.1121/1.5101194

Abstract

On the one hand, talker variability is one of the fundamental challenges for speech recognition: each talker has their own mapping from linguistic units to sounds, which means that an effective listener must use a different recognition function for each talker. On the other hand, talker variability means that speech is a source of rich information about who the talker is. This dual nature of talker variability means that speech and talker recognition are inextricably linked: knowing something about who is talking makes it easier to understand what they are saying, and knowing something about how someone talks unlocks the rich social meaning of speech. I argue that the concept of a talker's generative model, or the probabilistic distributions of sounds associated with each phonetic/linguistic category, is a useful general purpose conceptual tool for understanding the link between talker variability, speech recognition, and social identity. With such phonetic cue distributions, we can use information theoretic tools to quantify both the extent and structure of talker variability across different phonetic systems and establish in-principle consequences of talker variability for both speech recognition and socio-indexical inferences from speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Who's talking and what are they saying: Phonetic cue distributions link speech and talker recognition

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

Assessment of High-variability Speech Recognition in Adult Cochlear Implant Users using PRESTO.
David Pisoni ... Terrin N Tamati
Journal of the American Academy of Audiology | VOL. -
David Pisoni, et. al.David Pisoni ... Terrin N Tamati
25 Sep 2023
Journal of the American Academy of Audiology | VOL. -

Speech perception in children with cochlear implants: effects of lexical difficulty, talker variability, and word length.
Karen Iler Kirk ... Susan Todd Sehgal
Annals of Otology, Rhinology & Laryngology | VOL. 185
Karen Iler Kirk, et. al.Karen Iler Kirk ... Susan Todd Sehgal
01 Dec 2000
Annals of Otology, Rhinology & Laryngology | VOL. 185

Talker variability in real-life speech recognition by cochlear implant users
Terrin N Tamati ... Deniz Baskent
The Journal of the Acoustical Society of America | VOL. 141
Terrin N Tamati, et. al.Terrin N Tamati ... Deniz Baskent
01 May 2017
The Journal of the Acoustical Society of America | VOL. 141

Talker variability in spoken word recognition: Evidence from repetition priming
Yu Zhang ... Chao-Yang Lee
The Journal of the Acoustical Society of America | VOL. 136
Yu Zhang, et. al.Yu Zhang ... Chao-Yang Lee
01 Oct 2014
The Journal of the Acoustical Society of America | VOL. 136

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Who's talking and what are they saying: Phonetic cue distributions link speech and talker recognition

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America