Shared functional specialization in transformer-based language models and the human brain

Sreejan Kumar,Theodore R Sumers,Takateru Yamakoshi,Ariel Goldstein,Uri Hasson,Kenneth A Norman,Thomas L Griffiths,Robert D Hawkins,Samuel A Nastase

doi:10.1038/s41467-024-49173-5

Abstract

When processing language, the brain is thought to deploy specialized computations to construct meaning from complex linguistic structures. Recently, artificial neural networks based on the Transformer architecture have revolutionized the field of natural language processing. Transformers integrate contextual information across words via structured circuit computations. Prior work has focused on the internal representations (“embeddings”) generated by these circuits. In this paper, we instead analyze the circuit computations directly: we deconstruct these computations into the functionally-specialized “transformations” that integrate contextual information across words. Using functional MRI data acquired while participants listened to naturalistic stories, we first verify that the transformations account for considerable variance in brain activity across the cortical language network. We then demonstrate that the emergent computations performed by individual, functionally-specialized “attention heads” differentially predict brain activity in specific cortical regions. These heads fall along gradients corresponding to different layers and context lengths in a low-dimensional cortical space.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature Communications	Publication Date: Jun 29, 2024
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Shared functional specialization in transformer-based language models and the human brain

Abstract

Talk to us

Similar Papers

More From: Nature Communications

Lead the way for us

Similar Papers

Topic-Controlled Text Generation
Cansen Caglayan ... Murat Karakaya
-
Cansen Caglayan, et. al.Cansen Caglayan ... Murat Karakaya
15 Sep 2021
15 Sep 2021

Applications of transformer-based language models in bioinformatics: a survey.
Shuang Zhang ... Wanwen Zeng
Bioinformatics Advances | VOL. 3
Shuang Zhang, et. al.Shuang Zhang ... Wanwen Zeng
05 Jan 2023
Bioinformatics Advances | VOL. 3

Incorporating Residual and Normalization Layers into Analysis of Masked Language Models
...
-
, et. al. ...
21 Oct 2021
21 Oct 2021

Incorporating Residual and Normalization Layers into Analysis of Masked Language Models
Goro Kobayashi ... Sho Yokoi
-
Goro Kobayashi, et. al.Goro Kobayashi ... Sho Yokoi
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Shared functional specialization in transformer-based language models and the human brain

Abstract

Talk to us

Similar Papers

More From: Nature Communications