Training Deep Neural Networks Research Articles

Decoding speech from brain activity can enable communication for individuals with speech disorders. Deep neural networks have shown great potential for speech decoding applications. However, the limited availability of large datasets containing neural recordings from speech-impaired subjects poses a challenge. Leveraging data from healthy participants can mitigate this limitation and expedite the development of speech neuroprostheses while minimizing the need for patient-specific training data. Approach. In this study, we collected a substantial dataset consisting of recordings from 56 healthy participants using 64 EEG channels. Multiple neural networks were trained to classify perceived sentences in the Spanish language using subject-independent, mixed-subjects, and fine-tuning approaches. The dataset has been made publicly available to foster further research in this area.Main results. Our results demonstrate a remarkable level of accuracy in distinguishing sentence identity across 30 classes, showcasing the feasibility of training Deep Neural Networks (DNNs) to decode sentence identity from perceived speech using EEG. Notably, the subject-independent approach rendered accuracy comparable to the mixed-subjects approach, although with higher variability among subjects. Additionally, our fine-tuning approach yielded even higher accuracy, indicating an improved capability to adapt to individual subject characteristics, which enhances performance. This suggests that DNNs have effectively learned to decode universal features of brain activity across individuals while also being adaptable to specific participant data. Furthermore, our analyses indicate that EEGNet and DeepConvNet exhibit comparable performance, outperforming ShallowConvNet for sentence identity decoding. Finally, our Grad-CAM visualization analysis identifies key areas influencing the network's predictions, offering valuable insights into the neural processes underlying language perception and comprehension.Significance. These findings advance our understanding of EEG-based speech perception decoding and hold promise for the development of speech neuroprostheses, particularly in scenarios where subjects cannot provide their own training data.

Despite significant success of deep learning in object detection tasks, the standard training of deep neural networks requires access to a substantial quantity of annotated images across all classes. Data annotation is an arduous and time-consuming endeavor, particularly when dealing with infrequent objects. Few-shot object detection (FSOD) methods have emerged as a solution to the limitations of classic object detection approaches based on deep learning. FSOD methods demonstrate remarkable performance by achieving robust object detection using a significantly smaller amount of training data. A challenge for FSOD is that instances from novel classes that do not belong to the fixed set of training classes appear in the background and the base model may pick them up as potential objects. These objects behave similarly to label noise because they are classified as one of the training dataset classes, leading to FSOD performance degradation. We develop a semi-supervised algorithm to detect and then utilize these unlabeled novel objects as positive samples during the FSOD training stage to improve FSOD performance. Specifically, we develop a hierarchical ternary classification region proposal network (HTRPN) to localize the potential unlabeled novel objects and assign them new objectness labels to distinguish these objects from the base training dataset classes. Our improved hierarchical sampling strategy for the region proposal network (RPN) also boosts the perception ability of the object detection model for large objects. We test our approach and COCO and PASCAL VOC baselines that are commonly used in FSOD literature. Our experimental results indicate that our method is effective and outperforms the existing state-of-the-art (SOTA) FSOD methods. Our implementation is provided as a supplement to support reproducibility of the results https://github.com/zshanggu/HTRPN.11Early partial results of this work is presented in the 2023 ICCV Workshop on Visual Continual Learning (Shangguan and Rostami, 2023).

Training Deep Neural Networks Research Articles

Related Topics

Articles published on Training Deep Neural Networks

Stochastic collapse: how gradient noise attracts SGD dynamics towards simpler subnetworks*

Identification of perceived sentences using deep neural networks in EEG.

Exploiting Compress Sensing in Training of Deep Neural Network for Self-Noise Cancellation in Underwater Acoustics

Impact of Mask Type as Training Target for Speech Intelligibility and Quality in Cochlear-Implant Noise Reduction

Simple integrated circuit reverse-engineering with deep learning: A proof of concept for automating die-polygon-capturing

Deep learning as Ricci flow.

Multi-fault diagnosis of industrial rotating machines using advanced sliding window and WSST-CNN

Sentinel-1 SAR and deep learning for peatland fire detection in Ireland

Soft sensor for melting flow rate prediction based on data-enhanced classification method

Development of AI-assisted microscopy frameworks through realistic simulation with pySTED

Constructed encoded data based coded distributed DNN training for edge computing scenario

Activated biochar production from young coconut waste (Cocos nucifera) as bioadsorbent: a pathway through Artificial Neural Network (ANN) optimization.

Deep-neural-network model for predicting ground motion parameters using earthquake horizontal-to-vertical spectral ratios

Tuning parameters of deep neural network training algorithms pays off: a computational study

Nondeterministic Features in Deep neural network design, training and inference

An approach of 2D convolutional neural network–based seismic data fault interpretation with linear annotation and pixel thinking

Enhancing early Parkinson’s disease detection through multimodal deep learning and explainable AI: insights from the PPMI database

MixTrain: accelerating DNN training via input mixing.

Improved region proposal network for enhanced few-shot object detection

Synthetic ground motions in heterogeneous geologies from various sources: the HEMEWS-3D database

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Training Deep Neural Networks Research Articles

Related Topics

Articles published on Training Deep Neural Networks

Stochastic collapse: how gradient noise attracts SGD dynamics towards simpler subnetworks*

Identification of perceived sentences using deep neural networks in EEG.

Exploiting Compress Sensing in Training of Deep Neural Network for Self-Noise Cancellation in Underwater Acoustics

Impact of Mask Type as Training Target for Speech Intelligibility and Quality in Cochlear-Implant Noise Reduction

Simple integrated circuit reverse-engineering with deep learning: A proof of concept for automating die-polygon-capturing

Deep learning as Ricci flow.

Multi-fault diagnosis of industrial rotating machines using advanced sliding window and WSST-CNN

Sentinel-1 SAR and deep learning for peatland fire detection in Ireland

Soft sensor for melting flow rate prediction based on data-enhanced classification method

Development of AI-assisted microscopy frameworks through realistic simulation with pySTED

Constructed encoded data based coded distributed DNN training for edge computing scenario

Activated biochar production from young coconut waste (Cocos nucifera) as bioadsorbent: a pathway through Artificial Neural Network (ANN) optimization.

Deep-neural-network model for predicting ground motion parameters using earthquake horizontal-to-vertical spectral ratios

Tuning parameters of deep neural network training algorithms pays off: a computational study

Nondeterministic Features in Deep neural network design, training and inference

An approach of 2D convolutional neural network–based seismic data fault interpretation with linear annotation and pixel thinking

Enhancing early Parkinson’s disease detection through multimodal deep learning and explainable AI: insights from the PPMI database

MixTrain: accelerating DNN training via input mixing.

Improved region proposal network for enhanced few-shot object detection

Synthetic ground motions in heterogeneous geologies from various sources: the HEMEWS-3D database