Multi-talker Babble Research Articles

Little is known about the influence of vocal emotions on speech understanding. Word recognition accuracy for stimuli spoken to portray seven emotions (anger, disgust, fear, sadness, neutral, happiness, and pleasant surprise) was tested in younger and older listeners. Emotions were presented in either mixed (heterogeneous emotions mixed in a list) or blocked (homogeneous emotion blocked in a list) conditions. Three main hypotheses were tested. First, vocal emotion affects word recognition accuracy; specifically, portrayals of fear enhance word recognition accuracy because listeners orient to threatening information and/or distinctive acoustical cues such as high pitch mean and variation. Second, older listeners recognize words less accurately than younger listeners, but the effects of different emotions on intelligibility are similar across age groups. Third, blocking emotions in list results in better word recognition accuracy, especially for older listeners, and reduces the effect of emotion on intelligibility because as listeners develop expectations about vocal emotion, the allocation of processing resources can shift from emotional to lexical processing. Emotion was the within-subjects variable: all participants heard speech stimuli consisting of a carrier phrase followed by a target word spoken by either a younger or an older talker, with an equal number of stimuli portraying each of seven vocal emotions. The speech was presented in multi-talker babble at signal to noise ratios adjusted for each talker and each listener age group. Listener age (younger, older), condition (mixed, blocked), and talker (younger, older) were the main between-subjects variables. Fifty-six students (Mage= 18.3 years) were recruited from an undergraduate psychology course; 56 older adults (Mage= 72.3 years) were recruited from a volunteer pool. All participants had clinically normal pure-tone audiometric thresholds at frequencies ≤3000 Hz. There were significant main effects of emotion, listener age group, and condition on the accuracy of word recognition in noise. Stimuli spoken in a fearful voice were the most intelligible, while those spoken in a sad voice were the least intelligible. Overall, word recognition accuracy was poorer for older than younger adults, but there was no main effect of talker, and the pattern of the effects of different emotions on intelligibility did not differ significantly across age groups. Acoustical analyses helped elucidate the effect of emotion and some intertalker differences. Finally, all participants performed better when emotions were blocked. For both groups, performance improved over repeated presentations of each emotion in both blocked and mixed conditions. These results are the first to demonstrate a relationship between vocal emotion and word recognition accuracy in noise for younger and older listeners. In particular, the enhancement of intelligibility by emotion is greatest for words spoken to portray fear and presented heterogeneously with other emotions. Fear may have a specialized role in orienting attention to words heard in noise. This finding may be an auditory counterpart to the enhanced detection of threat information in visual displays. The effect of vocal emotion on word recognition accuracy is preserved in older listeners with good audiograms and both age groups benefit from blocking and the repetition of emotions.

Natural variability in speech is a significant challenge to robust successful spoken word recognition. In everyday listening environments, listeners must quickly adapt and adjust to multiple sources of variability in both the signal and listening environments. High-variability speech may be particularly difficult to understand for non-native listeners, who have less experience with the second language (L2) phonological system and less detailed knowledge of sociolinguistic variation of the L2. The purpose of this study was to investigate the effects of high-variability sentences on non-native speech recognition and to explore the underlying sources of individual differences in speech recognition abilities of non-native listeners. Participants completed two sentence recognition tasks involving high-variability and low-variability sentences. They also completed a battery of behavioral tasks and self-report questionnaires designed to assess their indexical processing skills, vocabulary knowledge, and several core neurocognitive abilities. Native speakers of Mandarin (n = 25) living in the United States recruited from the Indiana University community participated in the current study. A native comparison group consisted of scores obtained from native speakers of English (n = 21) in the Indiana University community taken from an earlier study. Speech recognition in high-variability listening conditions was assessed with a sentence recognition task using sentences from PRESTO (Perceptually Robust English Sentence Test Open-Set) mixed in 6-talker multitalker babble. Speech recognition in low-variability listening conditions was assessed using sentences from HINT (Hearing In Noise Test) mixed in 6-talker multitalker babble. Indexical processing skills were measured using a talker discrimination task, a gender discrimination task, and a forced-choice regional dialect categorization task. Vocabulary knowledge was assessed with the WordFam word familiarity test, and executive functioning was assessed with the BRIEF-A (Behavioral Rating Inventory of Executive Function - Adult Version) self-report questionnaire. Scores from the non-native listeners on behavioral tasks and self-report questionnaires were compared with scores obtained from native listeners tested in a previous study and were examined for individual differences. Non-native keyword recognition scores were significantly lower on PRESTO sentences than on HINT sentences. Non-native listeners' keyword recognition scores were also lower than native listeners' scores on both sentence recognition tasks. Differences in performance on the sentence recognition tasks between non-native and native listeners were larger on PRESTO than on HINT, although group differences varied by signal-to-noise ratio. The non-native and native groups also differed in the ability to categorize talkers by region of origin and in vocabulary knowledge. Individual non-native word recognition accuracy on PRESTO sentences in multitalker babble at more favorable signal-to-noise ratios was found to be related to several BRIEF-A subscales and composite scores. However, non-native performance on PRESTO was not related to regional dialect categorization, talker and gender discrimination, or vocabulary knowledge. High-variability sentences in multitalker babble were particularly challenging for non-native listeners. Difficulty under high-variability testing conditions was related to lack of experience with the L2, especially L2 sociolinguistic information, compared with native listeners. Individual differences among the non-native listeners were related to weaknesses in core neurocognitive abilities affecting behavioral control in everyday life.

Multi-talker Babble Research Articles

Related Topics

Articles published on Multi-talker Babble

An algorithm that generalizes to novel noise segments to improve speech intelligibility for hearing-impaired listeners

Segmental contribution to the intelligibility of noise-suppressed speech

Talker variation in the perception of speech in noise

Task demands and cognitive abilities impact listening effort for older adult hearing aid users

Speech recognition and listening effort across various conditions in adults with aphasia

Relationship Between Speech Intelligibility and Speech Comprehension in Babble Noise.

Cortical activity predicts which older adults recognize speech in noise and when.

Availability of binaural cues for pediatric bilateral cochlear implant recipients.

Masked sentence recognition assessed at ascending target-to-masker ratios: modest effects of repeating stimuli.

How "proficient" is proficient? Bilingual listeners' recognition of English words in noise.

Functional neuroanatomy of auditory scene analysis in Alzheimer's disease.

An analysis of error patterns in children's backward digit recall in noise.

Raspberry, not a car: context predictability and a phonological advantage in early and late learners' processing of speech in noise.

Intelligibility of emotional speech in younger and older adults.

The effect of increased vocal intensity on interarticulator timing in speakers with Parkinson's disease: A preliminary analysis

Non-native listeners' recognition of high-variability speech using PRESTO.

English vowel identification in quiet and noise: effects of listeners' native language background.

Speaker's voice as a memory cue

Ideal time-frequency masking algorithms lead to different speech intelligibility and quality in normal-hearing and cochlear implant listeners.

How difficult is difficult? Speech perception in noise in the elderly hearing impaired.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-talker Babble Research Articles

Related Topics

Articles published on Multi-talker Babble

An algorithm that generalizes to novel noise segments to improve speech intelligibility for hearing-impaired listeners

Segmental contribution to the intelligibility of noise-suppressed speech

Talker variation in the perception of speech in noise

Task demands and cognitive abilities impact listening effort for older adult hearing aid users

Speech recognition and listening effort across various conditions in adults with aphasia

Relationship Between Speech Intelligibility and Speech Comprehension in Babble Noise.

Cortical activity predicts which older adults recognize speech in noise and when.

Availability of binaural cues for pediatric bilateral cochlear implant recipients.

Masked sentence recognition assessed at ascending target-to-masker ratios: modest effects of repeating stimuli.

How "proficient" is proficient? Bilingual listeners' recognition of English words in noise.

Functional neuroanatomy of auditory scene analysis in Alzheimer's disease.

An analysis of error patterns in children's backward digit recall in noise.

Raspberry, not a car: context predictability and a phonological advantage in early and late learners' processing of speech in noise.

Intelligibility of emotional speech in younger and older adults.

The effect of increased vocal intensity on interarticulator timing in speakers with Parkinson's disease: A preliminary analysis

Non-native listeners' recognition of high-variability speech using PRESTO.

English vowel identification in quiet and noise: effects of listeners' native language background.

Speaker's voice as a memory cue

Ideal time-frequency masking algorithms lead to different speech intelligibility and quality in normal-hearing and cochlear implant listeners.

How difficult is difficult? Speech perception in noise in the elderly hearing impaired.