Deep learning of mutation-gene-drug relations from the literature

Kyubum Lee,Sunwon Lee,Seongsoon Kim,Wonho Shin,Sungjoon Park,Aik Choon Tan,Sunkyu Kim,Yonghwa Choi,Jaewoo Kang,Byounggun Kim

doi:10.1186/s12859-018-2029-1

Kyubum Lee, Sunwon Lee + Show 8 more

Open Access

https://doi.org/10.1186/s12859-018-2029-1

Copy DOI

Abstract

BackgroundMolecular biomarkers that can predict drug efficacy in cancer patients are crucial components for the advancement of precision medicine. However, identifying these molecular biomarkers remains a laborious and challenging task. Next-generation sequencing of patients and preclinical models have increasingly led to the identification of novel gene-mutation-drug relations, and these results have been reported and published in the scientific literature.ResultsHere, we present two new computational methods that utilize all the PubMed articles as domain specific background knowledge to assist in the extraction and curation of gene-mutation-drug relations from the literature. The first method uses the Biomedical Entity Search Tool (BEST) scoring results as some of the features to train the machine learning classifiers. The second method uses not only the BEST scoring results, but also word vectors in a deep convolutional neural network model that are constructed from and trained on numerous documents such as PubMed abstracts and Google News articles. Using the features obtained from both the BEST search engine scores and word vectors, we extract mutation-gene and mutation-drug relations from the literature using machine learning classifiers such as random forest and deep convolutional neural networks.Our methods achieved better results compared with the state-of-the-art methods. We used our proposed features in a simple machine learning model, and obtained F1-scores of 0.96 and 0.82 for mutation-gene and mutation-drug relation classification, respectively. We also developed a deep learning classification model using convolutional neural networks, BEST scores, and the word embeddings that are pre-trained on PubMed or Google News data. Using deep learning, the classification accuracy improved, and F1-scores of 0.96 and 0.86 were obtained for the mutation-gene and mutation-drug relations, respectively.ConclusionWe believe that our computational methods described in this research could be used as an important tool in identifying molecular biomarkers that predict drug responses in cancer patients. We also built a database of these mutation-gene-drug relations that were extracted from all the PubMed abstracts. We believe that our database can prove to be a valuable resource for precision medicine researchers.

Highlights

Molecular biomarkers that can predict drug efficacy in cancer patients are crucial components for the advancement of precision medicine
Identifying molecular biomarkers such as genes with specific mutations to predict the efficacy of a drug in cancer patients is important for the advancement of precision medicine
Since the baseline model is based on finding mutation related entities in a document-level dataset, we designed two different models: a machine learning model using features constructed at the document-level, and a deep convolutional neural network model using features constructed at the sentence-level

Summary

Introduction

Molecular biomarkers that can predict drug efficacy in cancer patients are crucial components for the advancement of precision medicine. Precision medicine aims to deliver personalized treatment to individual patients based on their genomic profiles Identifying molecular biomarkers such as genes with specific mutations to predict the efficacy of a drug in cancer patients is important for the advancement of precision medicine. Large-scale research projects such as Genomics of Drug Sensitivity in Cancer (GDSC) [3], Cancer Cell Line Encyclopedia (CCLE) [4] and Cancer Therapeutics Response Portal (CTRP) [5] provide gene-mutation-drug relations for the advancement of personalized medicine. Databases such as ClinVar [6], My Cancer Genome [7], MD Anderson Personalized Cancer Therapy Knowledgebase [8] contain gene-mutation-drug relations extracted from manually curated literature on clinical studies. Computational methods that automatically extract gene-mutation-drug relations from the literature are urgently needed to assist in the curation process

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jan 25, 2018
Citations: 44	License type: open-access

R Discovery Prime

R Discovery Prime

Deep learning of mutation-gene-drug relations from the literature

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Artificial intelligence software available for medical devices: surgical phase recognition in laparoscopic cholecystectomy
Ken’Ichi Shinozuka ... Sayaka Turuda
Surgical Endoscopy | VOL. 36
Ken’Ichi Shinozuka, et. al.Ken’Ichi Shinozuka ... Sayaka Turuda
09 Mar 2022
Surgical Endoscopy | VOL. 36

Abstract 1394: Diagnosis of thyroid cancer using deep convolutional neural network models applied to sonographic images from clinical ultrasound exams
Xiangchun Li ...
Cancer Research | VOL. 79
Xiangchun Li, et. al.Xiangchun Li ...
01 Jul 2019
Cancer Research | VOL. 79

Artificial intelligence: finding the intersection of predictive modeling and clinical utility
Karthik Ravi
Gastrointestinal Endoscopy | VOL. 93
Karthik RaviKarthik Ravi
07 Mar 2021
Gastrointestinal Endoscopy | VOL. 93

Deep learning-based computed tomography applied to the diagnosis of rib fractures
Zhen-Wei Lin ... Hong Wu
Journal of Radiation Research and Applied Sciences | VOL. 16
Zhen-Wei Lin, et. al.Zhen-Wei Lin ... Hong Wu
14 Mar 2023
Journal of Radiation Research and Applied Sciences | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep learning of mutation-gene-drug relations from the literature

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics