Multiple levels of linguistic and paralinguistic features contribute to voice recognition.

Jean Mary Zarate,Xing Tian,Kevin J P Woods,David Poeppel

doi:10.1038/srep11475

Abstract

Voice or speaker recognition is critical in a wide variety of social contexts. In this study, we investigated the contributions of acoustic, phonological, lexical, and semantic information toward voice recognition. Native English speaking participants were trained to recognize five speakers in five conditions: non-speech, Mandarin, German, pseudo-English, and English. We showed that voice recognition significantly improved as more information became available, from purely acoustic features in non-speech to additional phonological information varying in familiarity. Moreover, we found that the recognition performance is transferable between training and testing in phonologically familiar conditions (German, pseudo-English, and English), but not in unfamiliar (Mandarin) or non-speech conditions. These results provide evidence suggesting that bottom-up acoustic analysis and top-down influence from phonological processing collaboratively govern voice recognition.

Highlights

Such evidence suggests that both linguistic and paralinguistic characteristics underlie voice recognition, this is based mostly on studies that isolate only one class of characteristics
While it can be argued that word strings reduce prosody that may help participants distinguish between languages, we utilized word strings for all speech conditions to isolate the features of interest – acoustic, phonological, lexical, and semantic cues – rather than focusing on additional cues, such as prosody
While the previous work could have introduced performance bias because of partial overlaps between stimuli used in training and testing, the stimuli used for training completely differed from those used for testing in our study, such that we could examine generalization of voice recognition performance in a more unbiased fashion and assess the contributions of learned acoustic, phonological, lexical, and semantic cues towards recognizing voices in new stimuli

Summary

Introduction

Such evidence suggests that both linguistic and paralinguistic characteristics underlie voice recognition, this is based mostly on studies that isolate only one class of characteristics. In an experimental design modeled closely on Perrachione’s (2007, 2011) studies of voice recognition, monolingual English speakers were trained to associate five voices with avatars in five conditions (non-speech, Mandarin, German, pseudo-English, and English) — rather than just two conditions as in previous work (Mandarin and English). Three factors distinguish this from Perrachione’s earlier design. If each type of information has a distinctive contribution, voice recognition performance should improve systematically as a function of the amount of information available: from acoustic features (non-speech) to the availability of unknown/unfamiliar phonological information (Mandarin) to increasingly familiar phonological content (from German to pseudo-English) to full lexical and semantic access (English). If lexical-semantic access further contributes to voice recognition in the current experimental context, we might see increased voice recognition progressing from German to pseudo-English, and to English; otherwise, we expected to see similar performance among these three conditions

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Jun 19, 2015
Citations: 43	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multiple levels of linguistic and paralinguistic features contribute to voice recognition.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

How Different Types of Linguistic Information Impact Voice Perception: Evidence From the Language-Familiarity Effect.
Keke Yu ... Linjun Zhang
Language and Speech | VOL. 66
Keke Yu, et. al.Keke Yu ... Linjun Zhang
21 Jan 2023
Language and Speech | VOL. 66

The processing of voice identity in developmental prosopagnosia
Ran R Liu ... Jason J.S Barton
Cortex | VOL. 71
Ran R Liu, et. al.Ran R Liu ... Jason J.S Barton
04 Aug 2015
Cortex | VOL. 71

The meter of nursery rhymes: An application of the Halle-Keyser theory of meter
Jacqueline Guéron
Poetics | VOL. 3
Jacqueline GuéronJacqueline Guéron
01 Jan 1974
Poetics | VOL. 3

Accessing short-term memory with semantic and phonological information: a time-course analysis.
Brian Mcelree
Memory & Cognition | VOL. 24
Brian McelreeBrian Mcelree
01 Mar 1996
Memory & Cognition | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiple levels of linguistic and paralinguistic features contribute to voice recognition.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports