ADT Network: A Novel Nonlinear Method for Decoding Speech Envelopes From EEG Signals.

Ruixiang Liu,Chang Liu,Dan Cui,Huan Zhang,Xinmeng Xu,Yuxin Duan,Yihu Chao,Xianzheng Sha,Limin Sun,Xiulan Ma,Shuo Li,Shijie Chang

doi:10.1177/23312165241282872

Abstract

Decoding speech envelopes from electroencephalogram (EEG) signals holds potential as a research tool for objectively assessing auditory processing, which could contribute to future developments in hearing loss diagnosis. However, current methods struggle to meet both high accuracy and interpretability. We propose a deep learning model called the auditory decoding transformer (ADT) network for speech envelope reconstruction from EEG signals to address these issues. The ADT network uses spatio-temporal convolution for feature extraction, followed by a transformer decoder to decode the speech envelopes. Through anticausal masking, the ADT considers only the current and future EEG features to match the natural relationship of speech and EEG. Performance evaluation shows that the ADT network achieves average reconstruction scores of 0.168 and 0.167 on the SparrKULee and DTU datasets, respectively, rivaling those of other nonlinear models. Furthermore, by visualizing the weights of the spatio-temporal convolution layer as time-domain filters and brain topographies, combined with an ablation study of the temporal convolution kernels, we analyze the behavioral patterns of the ADT network in decoding speech envelopes. The results indicate that low- (0.5-8 Hz) and high-frequency (14-32 Hz) EEG signals are more critical for envelope reconstruction and that the active brain regions are primarily distributed bilaterally in the auditory cortex, consistent with previous research. Visualization of attention scores further validated previous research. In summary, the ADT network balances high performance and interpretability, making it a promising tool for studying neural speech envelope tracking.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ADT Network: A Novel Nonlinear Method for Decoding Speech Envelopes From EEG Signals.

Abstract

Talk to us

Similar Papers

More From: Trends in hearing

Lead the way for us

Journal: Trends in hearing	Publication Date: Jan 1, 2024
License type: CC BY-NC 4.0

Similar Papers

Continuous speech with pauses inserted between words increases cortical tracking of speech envelope.
Suwijak Deoisres ... Steven L Bell
PLOS ONE | VOL. 18
Suwijak Deoisres, et. al.Suwijak Deoisres ... Steven L Bell
27 Jul 2023
PLOS ONE | VOL. 18

Human Cortical Responses to the Speech Envelope
Steven J Aiken ... Terence W Picton
Ear & Hearing | VOL. 29
Steven J Aiken, et. al.Steven J Aiken ... Terence W Picton
01 Apr 2008
Ear & Hearing | VOL. 29

Synchronization patterns reveal neuronal coding of working memory content.
Fahimeh Mamashli ... Aapo Nummenmaa
Cell Reports | VOL. 36
Fahimeh Mamashli, et. al.Fahimeh Mamashli ... Aapo Nummenmaa
01 Aug 2021
Cell Reports | VOL. 36

The Tracking of Speech Envelope in the Human Cortex
Jan Kubanek ... Gerwin Schalk
PLoS ONE | VOL. 8
Jan Kubanek, et. al.Jan Kubanek ... Gerwin Schalk
10 Jan 2013
PLoS ONE | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ADT Network: A Novel Nonlinear Method for Decoding Speech Envelopes From EEG Signals.

Abstract

Talk to us

Similar Papers

More From: Trends in hearing