How the human brain recognizes speech in the context of changing speakers.

Katharina Von Kriegstein,Stefan J Kiebel,Timothy D Griffiths,David R R Smith,Roy D Patterson

doi:10.1523/jneurosci.2742-09.2010

Abstract

We understand speech from different speakers with ease, whereas artificial speech recognition systems struggle with this task. It is unclear how the human brain solves this problem. The conventional view is that speech message recognition and speaker identification are two separate functions and that message processing takes place predominantly in the left hemisphere, whereas processing of speaker-specific information is located in the right hemisphere. Here, we distinguish the contribution of specific cortical regions, to speech recognition and speaker information processing, by controlled manipulation of task and resynthesized speaker parameters. Two functional magnetic resonance imaging studies provide evidence for a dynamic speech-processing network that questions the conventional view. We found that speech recognition regions in left posterior superior temporal gyrus/superior temporal sulcus (STG/STS) also encode speaker-related vocal tract parameters, which are reflected in the amplitude peaks of the speech spectrum, along with the speech message. Right posterior STG/STS activated specifically more to a speaker-related vocal tract parameter change during a speech recognition task compared with a voice recognition task. Left and right posterior STG/STS were functionally connected. Additionally, we found that speaker-related glottal fold parameters (e.g., pitch), which are not reflected in the amplitude peaks of the speech spectrum, are processed in areas immediately adjacent to primary auditory cortex, i.e., in areas in the auditory hierarchy earlier than STG/STS. Our results point to a network account of speech recognition, in which information about the speech message and the speaker's vocal tract are combined to solve the difficult task of understanding speech from different speakers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Journal of Neuroscience	Publication Date: Jan 13, 2010
Citations: 88	License type: CC BY-NC-SA 4.0

R Discovery Prime

R Discovery Prime

How the human brain recognizes speech in the context of changing speakers.

Abstract

Talk to us

Similar Papers

More From: The Journal of Neuroscience

Lead the way for us

Similar Papers

Hemispheric lateralization of linguistic prosody recognition in comparison to speech and speaker recognition
Jens Kreitewolf ... Katharina Von Kriegstein
NeuroImage | VOL. 102
Jens Kreitewolf, et. al.Jens Kreitewolf ... Katharina Von Kriegstein
01 Aug 2014
NeuroImage | VOL. 102

A neural mechanism for recognizing speech spoken by different speakers
Jens Kreitewolf ... Katharina Von Kriegstein
NeuroImage | VOL. 91
Jens Kreitewolf, et. al.Jens Kreitewolf ... Katharina Von Kriegstein
13 Jan 2014
NeuroImage | VOL. 91

Genetic Algorithm for Combined Speaker and Speech Recognition using Deep Neural Networks
Gurpreet Kaur ... Mohit Srivastava
Journal of Telecommunications and Information Technology | VOL. 2
Gurpreet Kaur, et. al.Gurpreet Kaur ... Mohit Srivastava
29 Jun 2018
Journal of Telecommunications and Information Technology | VOL. 2

Fifty years of progress in speech and speaker recognition
Sadaoki Furui
The Journal of the Acoustical Society of America | VOL. 116
Sadaoki FuruiSadaoki Furui
01 Oct 2004
The Journal of the Acoustical Society of America | VOL. 116

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How the human brain recognizes speech in the context of changing speakers.

Abstract

Talk to us

Similar Papers

More From: The Journal of Neuroscience