Training Recurrent Neural Networks Research Articles

Depression is a common illness worldwide with potentially severe implications. Early identification of depressive symptoms is a crucial first step towards assessment, intervention, and relapse prevention. With an increase in data sets with relevance for depression, and the advancement of machine learning, there is a potential to develop intelligent systems to detect symptoms of depression in written material. This work proposes an efficient approach using Long Short-Term Memory (LSTM)-based Recurrent Neural Network (RNN) to identify texts describing self-perceived symptoms of depression. The approach is applied on a large dataset from a public online information channel for young people in Norway. The dataset consists of youth’s own text-based questions on this information channel. Features are then provided from a one-hot process on robust features extracted from the reflection of possible symptoms of depression pre-defined by medical and psychological experts. The features are better than conventional approaches, which are mostly based on the word frequencies (i.e., some topmost frequent words are chosen as features from the whole text dataset and applied to model the underlying events in any text message) rather than symptoms. Then, a deep learning approach is applied (i.e., RNN) to train the time-sequential features discriminating texts describing depression symptoms from posts with no such descriptions (non-depression posts). Finally, the trained RNN is used to automatically predict depression posts. The system is compared against conventional approaches where it achieved superior performance than others. The linear discriminant space clearly reveals the robustness of the features by generating better clustering than other traditional features. Besides, since the features are based on the possible symptoms of depression, the system may generate meaningful explanations of the decision from machine learning models using an explainable Artificial Intelligence (XAI) algorithm called Local Interpretable Model-Agnostic Explanations (LIME). The proposed depression symptom feature-based approach shows superior performance compared to the traditional general word frequency-based approaches where frequency of the features gets more importance than the specific symptoms of depression. Although the proposed approach is applied on a Norwegian dataset, a similar robust approach can be applied on other depression datasets developed in other languages with proper annotations and symptom-based feature extraction. Thus, the depression prediction approach can be adopted to contribute to develop better mental health care technologies such as intelligent chatbots.

Read full abstract

Many practical materials are multicrystalline to contain complex microstructures and crystal defects. Owing to its complexity, universal guideline regarding ideal microstructures to maximize macroscopic performance has not been fully established. Therefore, we attempt to pioneer “multicrystalline informatics” through collaboration of experiments, theory, computation, and machine learning to show how we can obtain high-performance multicrystalline materials. We employ silicon as a model material, and prepare various useful machine learning tools for efficient materials development.One example is to predict distribution of crystal orientations in a large sample (e.g. whole 6-inch wafer) from optical images. We started with surface treatment of multicrystalline silicon wafers so that the reflection become sensitive to the crystal orientations. Multiple optical images were captured from each wafer with white-light illumination from different orientations. In addition, orientations of each crystal grain for a couple of wafers were measured by X-ray Laue scanner. The obtained data sets of reflection patterns and crystal orientations were used for training, validation, and test of a long short-term memory recurrent neural network. The trained model was applied to a test wafer to contain ~1,000 crystal grains, and the median of the estimated errors was ~3 degree.Another example is to predict probability of generation of dislocation clusters. For this purpose, a large quantity of photoluminescence (PL) images was collected from wafers obtained from a same ingot. Three-dimensional reconstruction of PL images after extracting dislocation clusters by image processing successfully visualized distribution of dislocation clusters. Therefore, we can collect many small PL images as positive samples to act as the source of dislocations as well as negative samples. The obtained data sets were used for transfer learning of pre-trained convolutional neural network for image classification. The final output layer was changed to contain two outputs if the input image acts as the source of dislocations or not, and weights for the last edges were determined using the data sets. The network was applied to test wafers so that we can visualize probability distribution of generation of dislocation clusters. A part of sources was used for multiscale structural characterizations to clarify microscopic origins.In addition, we are working on various machine learning models and statistical analysis for multicrystalline materials. Integration of such models is believed to be beneficial to show how we can maximize macroscopic performance of multicrystalline materials.This work was supported by JST/CREST, Grant No. JPMJCR17J1 (2017-2023).

Read full abstract

Training Recurrent Neural Networks Research Articles

Related Topics

Articles published on Training Recurrent Neural Networks

Task-Aware Verifiable RNN-Based Policies for Partially Observable Markov Decision Processes

Machine learning astrophysics from 21 cm lightcones: impact of network architectures and signal contamination

Multitask neural networks for predicting bladder pressure with time series data

Evaluation of Recurrent Neural Network Models for Parkinson's Disease Classification Using Drawing Data.

Imaging subsurface orebodies with airborne electromagnetic data using a recurrent neural network

Efficient Kinect Sensor-based Kurdish Sign Language Recognition Using Echo System Network

Multi-time-scale input approaches for hourly-scale rainfall–runoff modeling based on recurrent neural networks

Learning various length dependence by dual recurrent neural networks

Emergence of prefrontal neuron maturation properties by training recurrent neural networks in cognitive tasks

Diagnosis of Heart Disease Using Feature Selection Methods Based On Recurrent Fuzzy Neural Networks

Faster Region-Convolutional Neural network oriented feature learning with optimal trained Recurrent Neural Network for bone age assessment for pediatrics

Deep learning for prediction of depressive symptoms in a large textual dataset

Short-term memory by transient oscillatory dynamics in recurrent neural networks

Statistical Machine Learning in Model Predictive Control of Nonlinear Processes

RNN-Based Counterfactual Prediction, With an Application to Homestead Policy and Public Schooling

Online Fall Detection Using Recurrent Neural Networks on Smart Wearable Devices

(Invited) Application of Machine Learning for High-Performance Multicrystalline Materials

Optimization of a Fuzzy Automatic Voltage Controller Using Real-Time Recurrent Learning

A recurrent neural network framework with an adaptive training strategy for long-time predictive modeling of nonlinear dynamical systems

Encoding-based memory for recurrent neural networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Training Recurrent Neural Networks Research Articles

Related Topics

Articles published on Training Recurrent Neural Networks

Task-Aware Verifiable RNN-Based Policies for Partially Observable Markov Decision Processes

Machine learning astrophysics from 21 cm lightcones: impact of network architectures and signal contamination

Multitask neural networks for predicting bladder pressure with time series data

Evaluation of Recurrent Neural Network Models for Parkinson's Disease Classification Using Drawing Data.

Imaging subsurface orebodies with airborne electromagnetic data using a recurrent neural network

Efficient Kinect Sensor-based Kurdish Sign Language Recognition Using Echo System Network

Multi-time-scale input approaches for hourly-scale rainfall–runoff modeling based on recurrent neural networks

Learning various length dependence by dual recurrent neural networks

Emergence of prefrontal neuron maturation properties by training recurrent neural networks in cognitive tasks

Diagnosis of Heart Disease Using Feature Selection Methods Based On Recurrent Fuzzy Neural Networks

Faster Region-Convolutional Neural network oriented feature learning with optimal trained Recurrent Neural Network for bone age assessment for pediatrics

Deep learning for prediction of depressive symptoms in a large textual dataset

Short-term memory by transient oscillatory dynamics in recurrent neural networks

Statistical Machine Learning in Model Predictive Control of Nonlinear Processes

RNN-Based Counterfactual Prediction, With an Application to Homestead Policy and Public Schooling

Online Fall Detection Using Recurrent Neural Networks on Smart Wearable Devices

(Invited) Application of Machine Learning for High-Performance Multicrystalline Materials

Optimization of a Fuzzy Automatic Voltage Controller Using Real-Time Recurrent Learning

A recurrent neural network framework with an adaptive training strategy for long-time predictive modeling of nonlinear dynamical systems

Encoding-based memory for recurrent neural networks