ERLKG: Entity Representation Learning and Knowledge Graph based association analysis of COVID-19 through mining of unstructured biomedical corpora

Sayantan Basu,Sinchani Chakraborty,Atif Hassan,Ashish Anand,Sana Siddique

doi:10.18653/v1/2020.sdp-1.15

Abstract

We introduce a generic, human-out-of-the-loop pipeline, ERLKG, to perform rapid association analysis of any biomedical entity with other existing entities from a corpora of the same domain. Our pipeline consists of a Knowledge Graph (KG) created from the Open Source CORD-19 dataset by fully automating the procedure of information extraction using SciBERT. The best latent entity representations are then found by benchnmarking different KG embedding techniques on the task of link prediction using a Graph Convolution Network Auto Encoder (GCN-AE). We demonstrate the utility of ERLKG with respect to COVID-19 through multiple qualitative evaluations. Due to the lack of a gold standard, we propose a relatively large intrinsic evaluation dataset for COVID-19 and use it for validating the top two performing KG embedding techniques. We find TransD to be the best performing KG embedding technique with Pearson and Spearman correlation scores of 0.4348 and 0.4570 respectively. We demonstrate that a considerable number of ERLKG’s top protein, chemical and disease predictions are currently in consideration for COVID-19 related research.

Highlights

COVID-19 is a global epidemic with a considerable fatality rate and a high transmission rate, affecting millions of people world-wide since its outbreak.1The search for treatments and possible cures for the novel Coronavirus (Wang et al, 2020b) has led to an exponential increase in scientific publications, but the challenge lies in effectively processing, integrating and leveraging related sources of information.https://www.who.int/docs/defaultsource/coronaviruse/situation-reports/20200811-covid19-sitrep-204.pdf?sfvrsn=1f4383dd 2Rapid and effective utilization of literature during times of pandemic such as COVID-19 is of utmost importance in combating the disease
We introduce a fully automated generic pipeline consisting of an Information Extraction (IE) system followed by Knowledge Graph construction
Such entities are well explored in existing literature and an analysis of their relatedness to COVID-19 is provided by leveraging the CORD-19 Open Research

Summary

Introduction

Rapid and effective utilization of literature during times of pandemic such as COVID-19 is of utmost importance in combating the disease. We introduce a fully automated generic pipeline consisting of an Information Extraction (IE) system followed by Knowledge Graph construction. The IE module uses SciBERT (Beltagy et al, 2019) for performing Named Entity Recognition (NER) and Relationship Extraction (RE). The entire entity extraction procedure is fully automated and no human expertise is used. We focus on the task of association analysis of essential biomedical entities, namely, proteins, diseases and, chemicals. Such entities are well explored in existing literature and an analysis of their relatedness to COVID-19 is provided by leveraging the CORD-19 Open Research

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ERLKG: Entity Representation Learning and Knowledge Graph based association analysis of COVID-19 through mining of unstructured biomedical corpora

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 30	License type: cc-by

Similar Papers

A Co-Embedding Model with Variational Auto-Encoder for Knowledge Graphs
Luodi Xie ... Huimin Huang
Applied Sciences | VOL. 12
Luodi Xie, et. al.Luodi Xie ... Huimin Huang
12 Jan 2022
Applied Sciences | VOL. 12

Utilizing Textual Information in Knowledge Graph Embedding: A Survey of Methods and Applications
Fengyuan Lu ... Peijin Cong
IEEE Access | VOL. 8
Fengyuan Lu, et. al.Fengyuan Lu ... Peijin Cong
01 Jan 2020
IEEE Access | VOL. 8

A Knowledge Graph Embedding Approach for Metaphor Processing
Wei Song ... Ting Liu
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29
Wei Song, et. al.Wei Song ... Ting Liu
15 Dec 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29

Learning knowledge graph embedding with a bi-directional relation encoding network and a convolutional autoencoder decoding network
Kairong Hu ... Hai Liu
Neural Computing and Applications | VOL. 33
Kairong Hu, et. al.Kairong Hu ... Hai Liu
07 Jan 2021
Neural Computing and Applications | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ERLKG: Entity Representation Learning and Knowledge Graph based association analysis of COVID-19 through mining of unstructured biomedical corpora

Abstract

Highlights

Summary

Talk to us

Similar Papers