Comparing pre-linguistic normalization models against US English listeners’ vowel perception

Anna Persson,T Florian Jaeger

doi:10.1121/10.0018218

Abstract

One of the central computational challenges for speech perception is that talkers differ in pronunciation--i.e., how they map linguistic categories and meanings onto the acoustic signal. Yet, listeners typically overcome these difficulties within minutes (Clarke & Garrett, 2004; Xie et al., 2018). The mechanisms that underlie these adaptive abilities remain unclear. One influential hypothesis holds that listeners achieve robust speech perception across talkers through low-level pre-linguistic normalization. We investigate the role of normalization in the perception of L1-US English vowels. We train ideal observers (IOs) on unnormalized or normalized acoustic cues using a phonetic database of 8 /h-VOWEL-d/ words of US English (N = 1240 recordings from 16 talkers, Xie & Jaeger, 2020). All IOs had 0 DFs in predicting perception—i.e., their predictions are completely determined by pronunciation statistics. We compare the IOs’ predictions against L1-US English listeners’ 8-way categorization responses for /h-VOWEL-d/ words in a web-based experiment. We find that (1) pre-linguistic normalization substantially improves the fit to human responses from 74% to 90% of best-possible performance (chance = 12.5%); (2) the best-performing normalization accounts centered and/or scaled formants by talker; and (3) general purpose normalization (C-CuRE, McMurray & Jongman, 2011) performed as well as vowel-specific normalization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparing pre-linguistic normalization models against US English listeners’ vowel perception

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

Perceptual Benefits of Linguistic Diversity and Language Background: Evidence from Auditory Free Classification of English Dialect Accents and Asian-Accented English
Kristen Syrett ... Joy Lu
Glossa Psycholinguistics | VOL. 3
Kristen Syrett, et. al.Kristen Syrett ... Joy Lu
26 Aug 2024
Glossa Psycholinguistics | VOL. 3

Native Italian speakers' perception and production of English vowels.
James Emil Flege ... Ian R A Mackay
The Journal of the Acoustical Society of America | VOL. 106
James Emil Flege, et. al.James Emil Flege ... Ian R A Mackay
01 Nov 1999
The Journal of the Acoustical Society of America | VOL. 106

Comparing pre-linguistic normalization models against US English listeners’ vowel perception
Anna Persson ... Florian Jaeger
-
Anna Persson, et. al.Anna Persson ... Florian Jaeger
01 Nov 2022
01 Nov 2022

Australian English listeners' perception of Japanese vowel length reveals underlying phonological knowledge.
Kakeru Yazawa ... Paola Escudero
Frontiers in psychology | VOL. 14
Kakeru Yazawa, et. al.Kakeru Yazawa ... Paola Escudero
26 Oct 2023
Frontiers in psychology | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparing pre-linguistic normalization models against US English listeners’ vowel perception

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America