Male Talkers Research Articles

The relative contributions of superior temporal vs. inferior frontal and parietal networks to recognition of speech in a background of competing speech remain unclear, although the contributions themselves are well established. Here, we use fMRI with spectrotemporal modulation transfer function (ST-MTF) modeling to examine the speech information represented in temporal vs. frontoparietal networks for two speech recognition tasks with and without a competing talker. Specifically, 31 listeners completed two versions of a three-alternative forced choice competing speech task: “Unison” and “Competing”, in which a female (target) and a male (competing) talker uttered identical or different phrases, respectively. Spectrotemporal modulation filtering (i.e., acoustic distortion) was applied to the two-talker mixtures and ST-MTF models were generated to predict brain activation from differences in spectrotemporal-modulation distortion on each trial. Three cortical networks were identified based on differential patterns of ST-MTF predictions and the resultant ST-MTF weights across conditions (Unison, Competing): a bilateral superior temporal (S-T) network, a frontoparietal (F-P) network, and a network distributed across cortical midline regions and the angular gyrus (M-AG). The S-T network and the M-AG network responded primarily to spectrotemporal cues associated with speech intelligibility, regardless of condition, but the S-T network responded to a greater range of temporal modulations suggesting a more acoustically driven response. The F-P network responded to the absence of intelligibility-related cues in both conditions, but also to the absence (presence) of target-talker (competing-talker) vocal pitch in the Competing condition, suggesting a generalized response to signal degradation. Task performance was best predicted by activation in the S-T and F-P networks, but in opposite directions (S-T: more activation = better performance; F-P: vice versa). Moreover, S-T network predictions were entirely ST-MTF mediated while F-P network predictions were ST-MTF mediated only in the Unison condition, suggesting an influence from non-acoustic sources (e.g., informational masking) in the Competing condition. Activation in the M-AG network was weakly positively correlated with performance and this relation was entirely superseded by those in the S-T and F-P networks. Regarding contributions to speech recognition, we conclude: (a) superior temporal regions play a bottom-up, perceptual role that is not qualitatively dependent on the presence of competing speech; (b) frontoparietal regions play a top-down role that is modulated by competing speech and scales with listening effort; and (c) performance ultimately relies on dynamic interactions between these networks, with ancillary contributions from networks not involved in speech processing per se (e.g., the M-AG network).

Read full abstract

Talker sex and spatial cues can facilitate segregation of competing speech. However, the spectrotemporal degradation associated with cochlear implants (CIs) can limit the benefit of talker sex and spatial cues. Acoustic hearing in the nonimplanted ear can improve access to talker sex cues in CI users. However, it's unclear whether the CI can improve segregation of competing speech when maskers are symmetrically placed around the target (i.e., when spatial cues are available), compared with acoustic hearing alone. The aim of this study was to investigate whether a CI can improve segregation of competing speech by individuals with unilateral hearing loss. Speech recognition thresholds (SRTs) for competing speech were measured in 16 normal-hearing (NH) adults and 16 unilaterally deaf CI users. All participants were native speakers of Mandarin Chinese. CI users were divided into two groups according to thresholds in the nonimplanted ear: (1) single-sided deaf (SSD); pure-tone thresholds <25 dB HL at all audiometric frequencies, and (2) Asymmetric hearing loss (AHL; one or more thresholds > 25 dB HL). SRTs were measured for target sentences produced by a male talker in the presence of two masker talkers (different male or female talkers). The target sentence was always presented via loudspeaker directly in front of the listener (0°), and the maskers were either colocated with the target (0°) or spatially separated from the target at ±90°. Three segregation cue conditions were tested to measure masking release (MR) relative to the baseline condition: (1) Talker sex, (2) Spatial, and (3) Talker sex + Spatial. For CI users, SRTs were measured with the CI on or off. Binaural MR was significantly better for the NH group than for the AHL or SSD groups ( P < 0.001 in all cases). For the NH group, mean MR was largest with the Talker sex + spatial cues (18.8 dB) and smallest for the Talker sex cues (10.7 dB). In contrast, mean MR for the SSD group was largest with the Talker sex + spatial cues (14.7 dB), and smallest with the Spatial cues (4.8 dB). For the AHL group, mean MR was largest with the Talker sex + spatial cues (7.8 dB) and smallest with the Talker sex (4.8 dB) and the Spatial cues (4.8 dB). MR was significantly better with the CI on than off for both the AHL ( P = 0.014) and SSD groups ( P < 0.001). Across all unilaterally deaf CI users, monaural (acoustic ear alone) and binaural MR were significantly correlated with unaided pure-tone average thresholds in the nonimplanted ear for the Talker sex and Talker sex + spatial conditions ( P < 0.001 in both cases) but not for the Spatial condition. Although the CI benefitted unilaterally deaf listeners' segregation of competing speech, MR was much poorer than that observed in NH listeners. Different from previous findings with steady noise maskers, the CI benefit for segregation of competing speech from a different talker sex was greater in the SSD group than in the AHL group.

Read full abstract

Male Talkers Research Articles

Related Topics

Articles published on Male Talkers

Fundamental frequency predominantly drives talker differences in auditory brainstem responses to continuous speech.

A corpus of audio-visual recordings of linguistically balanced, Danish sentences for speech-in-noise experiments

Development of Persian Monosyllabic and Disyllabic Words for Auditory Test of Adults and Evaluation of Their Face Validity Using Psychometric Function

Evaluating Hearing Status and Word Recognition Ability in the Hmong Population Using Four Validated Monosyllabic White Hmong Dialect Word Recognition Tests.

Intelligibility of British and American English in different listening conditions

Learning to identify talkers: Do 4.5-month-old infants distinguish between unfamiliar males?

Factors that can affect divided speech intelligibility

Statistical learning at a virtual cocktail party

Cortical networks for recognition of speech with simultaneous talkers

Tracking talker-specific cues to lexical stress: Evidence from perceptual learning.

Right Posterior Temporal Cortex Supports Integration of Phonetic and Talker Information.

Speech recognition in the presence of speech maskers in children

Analyses of speech recordings from an acoustic head and torso simulator with and without face coverings using both spectrogram and transcription tools

Validating Four Hmong Word Recognition Tests With Normal-Hearing Bilingual Hmong Individuals

An investigation of interference between electromagnetic articulography and electroglottography

Role of superior temporal gyrus and planum temporale in talker segregation

Validation of Male Talker Recordings of the Spanish Pediatric Speech Recognition Threshold Test and the Spanish Pediatric Picture Identification Test.

The Influence of Male- and Female-Spoken Vowel Acoustics on Envelope-Following Responses.

Effects of tonotopic matching and spatial cues on segregation of competing speech in simulations of bilateral cochlear implants.

Cochlear Implant Facilitates the Use of Talker Sex and Spatial Cues to Segregate Competing Speech in Unilaterally Deaf Listeners.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Male Talkers Research Articles

Related Topics

Articles published on Male Talkers

Fundamental frequency predominantly drives talker differences in auditory brainstem responses to continuous speech.

A corpus of audio-visual recordings of linguistically balanced, Danish sentences for speech-in-noise experiments

Development of Persian Monosyllabic and Disyllabic Words for Auditory Test of Adults and Evaluation of Their Face Validity Using Psychometric Function

Evaluating Hearing Status and Word Recognition Ability in the Hmong Population Using Four Validated Monosyllabic White Hmong Dialect Word Recognition Tests.

Intelligibility of British and American English in different listening conditions

Learning to identify talkers: Do 4.5-month-old infants distinguish between unfamiliar males?

Factors that can affect divided speech intelligibility

Statistical learning at a virtual cocktail party

Cortical networks for recognition of speech with simultaneous talkers

Tracking talker-specific cues to lexical stress: Evidence from perceptual learning.

Right Posterior Temporal Cortex Supports Integration of Phonetic and Talker Information.

Speech recognition in the presence of speech maskers in children

Analyses of speech recordings from an acoustic head and torso simulator with and without face coverings using both spectrogram and transcription tools

Validating Four Hmong Word Recognition Tests With Normal-Hearing Bilingual Hmong Individuals

An investigation of interference between electromagnetic articulography and electroglottography

Role of superior temporal gyrus and planum temporale in talker segregation

Validation of Male Talker Recordings of the Spanish Pediatric Speech Recognition Threshold Test and the Spanish Pediatric Picture Identification Test.

The Influence of Male- and Female-Spoken Vowel Acoustics on Envelope-Following Responses.

Effects of tonotopic matching and spatial cues on segregation of competing speech in simulations of bilateral cochlear implants.

Cochlear Implant Facilitates the Use of Talker Sex and Spatial Cues to Segregate Competing Speech in Unilaterally Deaf Listeners.