Vowel speech recognition from rat electroencephalography using long short-term memory neural network.

Jinsil Ham,Hyun-Joon Yoo,Jongin Kim,Boreom Lee

doi:10.1371/journal.pone.0270405

Jinsil Ham, Hyun-Joon Yoo + Show 2 more

Open Access

https://doi.org/10.1371/journal.pone.0270405

Copy DOI

Abstract

Over the years, considerable research has been conducted to investigate the mechanisms of speech perception and recognition. Electroencephalography (EEG) is a powerful tool for identifying brain activity; therefore, it has been widely used to determine the neural basis of speech recognition. In particular, for the classification of speech recognition, deep learning-based approaches are in the spotlight because they can automatically learn and extract representative features through end-to-end learning. This study aimed to identify particular components that are potentially related to phoneme representation in the rat brain and to discriminate brain activity for each vowel stimulus on a single-trial basis using a bidirectional long short-term memory (BiLSTM) network and classical machine learning methods. Nineteen male Sprague-Dawley rats subjected to microelectrode implantation surgery to record EEG signals from the bilateral anterior auditory fields were used. Five different vowel speech stimuli were chosen, /a/, /e/, /i/, /o/, and /u/, which have highly different formant frequencies. EEG recorded under randomly given vowel stimuli was minimally preprocessed and normalized by a z-score transformation to be used as input for the classification of speech recognition. The BiLSTM network showed the best performance among the classifiers by achieving an overall accuracy, f1-score, and Cohen’s κ values of 75.18%, 0.75, and 0.68, respectively, using a 10-fold cross-validation approach. These results indicate that LSTM layers can effectively model sequential data, such as EEG; hence, informative features can be derived through BiLSTM trained with end-to-end learning without any additional hand-crafted feature extraction methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PloS one	Publication Date: Jun 23, 2022
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Vowel speech recognition from rat electroencephalography using long short-term memory neural network.

Abstract

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

An ensemble method to forecast 24-h ahead solar irradiance using wavelet decomposition and BiLSTM deep learning network.
Pardeep Singla ... Manoj Duhan
Earth Science Informatics | VOL. 15
Pardeep Singla, et. al.Pardeep Singla ... Manoj Duhan
17 Nov 2021
Earth Science Informatics | VOL. 15

Post Text Processing of Chinese Speech Recognition Based on Bidirectional LSTM Networks and CRF
Li Yang ... Ying Li
Electronics | VOL. 8
Li Yang, et. al.Li Yang ... Ying Li
31 Oct 2019
Electronics | VOL. 8

Water quality assessment using Bi-LSTM and computational fluid dynamics (CFD) techniques
Wafa F Alfwzan ... Ibrahim Saleem Alharbi
Alexandria Engineering Journal | VOL. 97
Wafa F Alfwzan, et. al.Wafa F Alfwzan ... Ibrahim Saleem Alharbi
25 Apr 2024
Alexandria Engineering Journal | VOL. 97

Automatic gear shift strategy for manual transmission of mine truck based on Bi-LSTM network
Liyong Wang ... Min Xie
Expert Systems With Applications | VOL. 209
Liyong Wang, et. al.Liyong Wang ... Min Xie
03 Aug 2022
Expert Systems With Applications | VOL. 209

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Vowel speech recognition from rat electroencephalography using long short-term memory neural network.

Abstract

Talk to us

Similar Papers

More From: PloS one