Deep Learning Community Research Articles

Species descriptions are stored in textual form in corpora such as in floras and faunas, but this large amount of information cannot be used directly by algorithms, nor can it be linked to other data sources. The production of knowledge bases expressing structured data can benefit from collaborative and easy-to-use platforms like Xper3 (Vignes-Lebbe et al. 2017, Kerner and Vignes 2019, Saucède et al. 2021) but is very time-consuming at the human level. It is therefore mandatory for this task to make the information contained in species descriptions measurable and compatible with computer techniques. One of the most used data structures on the web and by the deep learning community is the triplet structure. Each piece of information is represented by a set of 3 elements (subject, predicate, object). One of the first steps towards species information accessibility is developing a text-to-triplet model, also known as text-to-graph, for monograph descriptions. In this work, we developed NEARSIDE, a text-to-graph model adapted to biology corpora to create normalized morphological characteristic knowledge bases for species descriptions. In Natural Language Processing, deep learning models have proven to be effective in extracting knowledge from open domain corpora (Lample et al. 2016, Sutskever et al. 2014), especially since the emergence of attention-based models (Devlin et al. 2019b, Devlin et al. 2019a). Several works have been made also on biomedical corpora (Fries et al. 2017,Cho and Lee 2019). In our case, we propose a model adapted to floras. Fully supervised deep learning models require a large amount of annotated data for training, nevertheless, the annotation process for the text-to-triplet task implies an expensive human intervention. Distant supervision is a technique that can be used to reduce this cost. This paradigm uses a small annotated glossary to project classes at the word level on a new complex and longer text (see Fig. 1). Named Entity Recognition (NER) is an Natural Language Processing (NLP) task that consists of extracting and classifying words of interest from a text (Sutskever et al. 2014, Devlin et al. 2019b, Lample et al. 2016), while triplet extraction can be compared to the Relation Extraction task (RE) which consists of extracting the words and the semantic relations between pairs of words. Distantly supervised NER is an often studied subject in the literature in comparison to distantly supervised RE (Liang et al. 2020, Meng et al. 2021) simply because NER is a subtask to RE and distant annotations generation is less expensive for the NER task (see Fig. 2). Our first contribution is creating a distantly annotated species description dataset for Named Entity Recognition with a well-balanced test set that allows us to bypass several biases that can be induced by the distant annotation and that are often observed in NER datasets (Taillé et al. 2021). In this dataset, each word of interest will be classified into one of 15 classes, each class being a specific kind of organ or descriptor. Our second contribution is proposing a distantly supervised model trained on our dataset, since fauna and flora corpora are particularly long and use a very specific technical vocabulary. We develop a context-oriented model adapted to this data by pretraining the language model. Thus the encoder of our model provides contextualized vectors for each extracted word that can be used to measure description similarities between different species. Our model reaches 96% accuracy in named entity classification on the test set. Our third contribution is the triplet construction module that can directly be applied to our model's outputs. This module is based on class dependency rules that are inspired by Xper3’s data representation format (see Fig. 3). Finally, NEARSIDE is an end-to-end structured knowledge extraction framework from unstructured species description corpora, that can be applied to several data sources. Thus making species descriptions from different corpora easily linked, compared and measured.

As the training process of deep neural networks involves expensive computational cost, speeding up the convergence is of great importance. Nesterov’s accelerated gradient (NAG) is one of the most popular accelerated optimizers in the deep learning community, which often exhibits improved convergence performance over gradient descent (GD) in practice. However, theoretical investigations of NAG mainly focus on the convex setting. Since the optimization landscape of the neural network is non-convex, little is known about the convergence and acceleration of NAG. Nowadays, some works make progress towards understanding the convergence of NAG in training over-parameterized neural networks, where the number of the parameters exceeds that of the training instances. Nonetheless, previous studies are limited to the two-layer neural network, which are far from explaining the remarkable success of NAG in optimizing deep neural networks. In this paper, we investigate the convergence of NAG in training two architectures of deep linear networks: deep fully-connected linear neural networks and deep linear ResNets. Based on the over-parameterization regime, we first analyze the residual dynamics induced by the training trajectory of NAG for a deep fully-connected linear neural network under random Gaussian initialization. Our results show that NAG can converge to the global minimum at a (1-O(1/κ))t rate when the width is near-linear in the depth of the network, where t is the number of iterations and κ>1 is a constant depending on the condition number of the feature matrix. Compared to the (1-O(1/κ))t rate of GD, NAG achieves an acceleration over GD. For deep linear ResNets, we utilize the same analytical approach and obtain a similar convergence result, while the width requirement is independent of the depth. To the best of our knowledge, these are the first theoretical guarantees for the convergence and acceleration of NAG in training deep neural networks. Numerical results show the acceleration of NAG compared to GD in terms of iterations. In addition, we conduct experiments to evaluate the effect of the depth on the convergence rate of NAG, which validate our derived conditions of the width. We hope our results may shed light on understanding the optimization behavior of NAG for modern deep neural networks.

Deep Learning Community Research Articles

Related Topics

Articles published on Deep Learning Community

NEARSIDE: Structured kNowledge Extraction frAmework from SpecIes DEscriptions

A convergence analysis of Nesterov’s accelerated gradient method in training deep linear neural networks

Practical Implementation and Challenges of Artificial Intelligence-Driven Electronic Health Record Evaluation: Protected Health Information.

Collective intelligence for deep learning: A survey of recent developments

EEG-GAT: Graph Attention Networks for Classification of Electroencephalogram (EEG) Signals.

Intrusion prevention system using convolutional neural network for wireless sensor network

Unraveling the Hidden Environmental Impacts of AI Solutions for Environment Life Cycle Assessment of AI Solutions

EfficientUNet: Modified encoder‐decoder architecture for the lung segmentation in chest x‐ray images

PBIL for optimizing inception module in convolutional neural networks

TSGB: Target-Selective Gradient Backprop for Probing CNN Visual Saliency.

FVC: An End-to-End Framework Towards Deep Video Compression in Feature Space.

A Trimodel SAR Semisupervised Recognition Method Based on Attention-Augmented Convolutional Networks

Generation of Spacecraft Operations Procedures Using Deep Reinforcement Learning

A novel multimodal fusion network based on a joint-coding model for lane line segmentation

Airborne SAR Autofocus Based on Blurry Imagery Classification

A A Pilot Review of State-of-the-Art Deep Learning Applications to Nuclear Engineering & Technology

Molecular design in drug discovery: a comprehensive review of deep generative models.

Depth Completion and Super-Resolution with Arbitrary Scale Factors for Indoor Scenes.

CondenseNet with exclusive lasso regularization.

Comparison of augmentation and pre-processing for deep learning and chemometric classification of infrared spectra

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Deep Learning Community Research Articles

Related Topics

Articles published on Deep Learning Community

NEARSIDE: Structured kNowledge Extraction frAmework from SpecIes DEscriptions

A convergence analysis of Nesterov’s accelerated gradient method in training deep linear neural networks

Practical Implementation and Challenges of Artificial Intelligence-Driven Electronic Health Record Evaluation: Protected Health Information.

Collective intelligence for deep learning: A survey of recent developments

EEG-GAT: Graph Attention Networks for Classification of Electroencephalogram (EEG) Signals.

Intrusion prevention system using convolutional neural network for wireless sensor network

Unraveling the Hidden Environmental Impacts of AI Solutions for Environment Life Cycle Assessment of AI Solutions

EfficientUNet: Modified encoder‐decoder architecture for the lung segmentation in chest x‐ray images

PBIL for optimizing inception module in convolutional neural networks

TSGB: Target-Selective Gradient Backprop for Probing CNN Visual Saliency.

FVC: An End-to-End Framework Towards Deep Video Compression in Feature Space.

A Trimodel SAR Semisupervised Recognition Method Based on Attention-Augmented Convolutional Networks

Generation of Spacecraft Operations Procedures Using Deep Reinforcement Learning

A novel multimodal fusion network based on a joint-coding model for lane line segmentation

Airborne SAR Autofocus Based on Blurry Imagery Classification

A A Pilot Review of State-of-the-Art Deep Learning Applications to Nuclear Engineering &amp; Technology

Molecular design in drug discovery: a comprehensive review of deep generative models.

Depth Completion and Super-Resolution with Arbitrary Scale Factors for Indoor Scenes.

CondenseNet with exclusive lasso regularization.

Comparison of augmentation and pre-processing for deep learning and chemometric classification of infrared spectra

A A Pilot Review of State-of-the-Art Deep Learning Applications to Nuclear Engineering & Technology