Text Graphs Research Articles

MotivationThe majority of biomedical knowledge is stored in structured databases or as unstructured text in scientific publications. This vast amount of information has led to numerous machine learning-based biological applications using either text through natural language processing (NLP) or structured data through knowledge graph embedding models. However, representations based on a single modality are inherently limited.ResultsTo generate better representations of biological knowledge, we propose STonKGs, a Sophisticated Transformer trained on biomedical text and Knowledge Graphs (KGs). This multimodal Transformer uses combined input sequences of structured information from KGs and unstructured text data from biomedical literature to learn joint representations in a shared embedding space. First, we pre-trained STonKGs on a knowledge base assembled by the Integrated Network and Dynamical Reasoning Assembler consisting of millions of text-triple pairs extracted from biomedical literature by multiple NLP systems. Then, we benchmarked STonKGs against three baseline models trained on either one of the modalities (i.e. text or KG) across eight different classification tasks, each corresponding to a different biological application. Our results demonstrate that STonKGs outperforms both baselines, especially on the more challenging tasks with respect to the number of classes, improving upon the F1-score of the best baseline by up to 0.084 (i.e. from 0.881 to 0.965). Finally, our pre-trained model as well as the model architecture can be adapted to various other transfer learning applications.Availability and implementationWe make the source code and the Python package of STonKGs available at GitHub (https://github.com/stonkgs/stonkgs) and PyPI (https://pypi.org/project/stonkgs/). The pre-trained STonKGs models and the task-specific classification models are respectively available at https://huggingface.co/stonkgs/stonkgs-150k and https://zenodo.org/communities/stonkgs.Supplementary information Supplementary data are available at Bioinformatics online.

In 2017 the fundamental scientific-reference multidisciplinary Ecological Atlas of Russia was published (Ecological …, 2017; Kasimov et al., 2018). The Atlas reflects the ecological situation at the beginning of the 21st century. The Geography Department of Lomonosov Moscow State University with the participation of more than 30 leading departmental and scientific organizations contributed to the Atlas. The Atlas represents a wide range of ecological-geographical spatio-temporal characteristics of the territory of Russia and its regions. The six structural sections of the Atlas contain more than 30 maps showing vegetation in different aspects: Introduction; Natural conditions for the formation of an ecological situation; The impact of economic activity on the environment; Natural and technological hazards; Modern ecological situation; Environmental monitoring and nature conservation. The scale of the base maps of Russia is 1 : 20 000 000, others — 1 : 30 000 000 and smaller. Maps are accompanied by text descriptions, graphs and slides. More than 20 % of the Atlas volume is given to satellite imagery — an effective, in some cases unique, means of visualizing environmental information. The description of the maps is given in the following sequence: inventory maps — estimation maps. The Introduction “Russia on the Ecological Map of the World” analyzes the ecological role of Russia on a planetary scale and assesses the contribution to the observed degradation of the planet’s environment. The text reveals the role of vegetation in the biosphere and its environmental functions. In the section “Natural Conditions for the Formation of an Ecological Situation” there is a photomap “Vegetation Cover” created using MODIS images. The 18 divisions of vegetation are grouped in the legend into five large typological complexes — Forests, Grass and shrub vegetation, Tundra, Wetland complexes, Other vegetation. Mires are represented by three maps in 1 : 30 000 000 scale: “Mires and wetlands” (Fig. 1), “Types of mires”, “Afforestation of mires”. The key topic ‒ “Ecological functions of the vegetation cover” — has been made as a separate map (Volkova, Fedorova, 1995). Large proportion of the section is devoted to the productivity of the vegetation cover (3 maps), the most important indicator controlling the stability of geosystems (Fig. 2). In the section “Impact of economic activity on the environment”, vegetation is displayed through the main object of economic activity — forests and factors that determine the current state of forests: deforestation, derivative forests, forest burnability, and frequency of forest fires. The cumulative effect of their impact is presented on the map “Forest disturbance” (Fig. 3). The consequences of adverse effects on biota are presented on the integrated map “Degradation of the plant and animal world” at a scale of 1 : 20 000 000. The maps of poisonous plants and plants-allergens in 1 : 30 000 000 scale (Dikareva et al., 2017) were made for the first time; they are placed in the section “Natural and technological hazards” (Fig. 4). The map “Ecological state of natural fodder lands” (1 : 20 000 000 s.) is included in the group of maps characterizing the ecological state of individual natural components (surface and underground waters, soils, lands, etc.). The final section of the Atlas “Environmental monitoring and nature conservation” contains the maps “Nature Protection”, “Specially Protected Natural Territories”, “Especially Valuable Wetlands” and maps of the Red Book species of plants. The section concludes with the topic “Environmental Benefits of the Russian Federation and Their Capitalization. Russia is in the market of ecosystem services”. It complements the Introduction chapter, focusing on the huge role of the territory of Russia as a natural regulator of the global environment and the need to capitalize its environmental benefits.

Text Graphs Research Articles

Articles published on Text Graphs

An effective multi-modal adaptive contextual feature information fusion method for Chinese long text classification

Graph-based Text Classification by Contrastive Learning with Text-level Graph Augmentation

A psychological evaluation method incorporating noisy label correction mechanism

Sparse graph matching network for temporal language localization in videos

An Effective Knowledgeable Label-Aware Approach for Sentential Relation Extraction

SUSIE: Pharmaceutical CMC ontology-based information extraction for drug development using machine learning

Traditional Chinese medicine for age-related macular degeneration: A clinical evidence map between 2000 and 2022

Document-level relation extraction based on sememe knowledge-enhanced abstract meaning representation and reasoning

Enhancing Text Classification by Graph Neural Networks With Multi-Granular Topic-Aware Graph

The Study on the Text Classification Based on Graph Convolutional Network and BiLSTM

Contrastive Graph Convolutional Networks with adaptive augmentation for text classification

Conducting a representative national randomized control trial of tailored clinical decision support for nurses remotely: Methods and implications

Word‐Problem Performance Differences by Schema: A Comparison of Students with and without Mathematics Difficulty

STonKGs: a sophisticated transformer trained on biomedical text and knowledge graphs.

Graph Fusion Network for Text Classification

MecCog: a knowledge representation framework for genetic disease mechanism.

An Investigation of Physics Teachers’ Multiple Representation Ability on Newton’s Law Concept

Traditional Chinese Medicine for Essential Hypertension: A Clinical Evidence Map

Растительность: отображение в новом «Экологическом атласе России»

HRGRN: A Graph Search-Empowered Integrative Database of Arabidopsis Signaling Transduction, Metabolism and Gene Regulation Networks.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text Graphs Research Articles

Articles published on Text Graphs

An effective multi-modal adaptive contextual feature information fusion method for Chinese long text classification

Graph-based Text Classification by Contrastive Learning with Text-level Graph Augmentation

A psychological evaluation method incorporating noisy label correction mechanism

Sparse graph matching network for temporal language localization in videos

An Effective Knowledgeable Label-Aware Approach for Sentential Relation Extraction

SUSIE: Pharmaceutical CMC ontology-based information extraction for drug development using machine learning

Traditional Chinese medicine for age-related macular degeneration: A clinical evidence map between 2000 and 2022

Document-level relation extraction based on sememe knowledge-enhanced abstract meaning representation and reasoning

Enhancing Text Classification by Graph Neural Networks With Multi-Granular Topic-Aware Graph

The Study on the Text Classification Based on Graph Convolutional Network and BiLSTM

Contrastive Graph Convolutional Networks with adaptive augmentation for text classification

Conducting a representative national randomized control trial of tailored clinical decision support for nurses remotely: Methods and implications

Word‐Problem Performance Differences by Schema: A Comparison of Students with and without Mathematics Difficulty

STonKGs: a sophisticated transformer trained on biomedical text and knowledge graphs.

Graph Fusion Network for Text Classification

MecCog: a knowledge representation framework for genetic disease mechanism.

An Investigation of Physics Teachers’ Multiple Representation Ability on Newton’s Law Concept

Traditional Chinese Medicine for Essential Hypertension: A Clinical Evidence Map

Растительность: отображение в новом «Экологическом атласе России»

HRGRN: A Graph Search-Empowered Integrative Database of Arabidopsis Signaling Transduction, Metabolism and Gene Regulation Networks.