TrendyGenes, a computational pipeline for the detection of literature trends in academia and drug discovery

Guillermo Serrano Nájera,David Narganes Carlón,Daniel J Crowther

doi:10.1038/s41598-021-94897-9

Abstract

Target identification and prioritisation are prominent first steps in modern drug discovery. Traditionally, individual scientists have used their expertise to manually interpret scientific literature and prioritise opportunities. However, increasing publication rates and the wider routine coverage of human genes by omic-scale research make it difficult to maintain meaningful overviews from which to identify promising new trends. Here we propose an automated yet flexible pipeline that identifies trends in the scientific corpus which align with the specific interests of a researcher and facilitate an initial prioritisation of opportunities. Using a procedure based on co-citation networks and machine learning, genes and diseases are first parsed from PubMed articles using a novel named entity recognition system together with publication date and supporting information. Then recurrent neural networks are trained to predict the publication dynamics of all human genes. For a user-defined therapeutic focus, genes generating more publications or citations are identified as high-interest targets. We also used topic detection routines to help understand why a gene is trendy and implement a system to propose the most prominent review articles for a potential target. This TrendyGenes pipeline detects emerging targets and pathways and provides a new way to explore the literature for individual researchers, pharmaceutical companies and funding agencies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Aug 3, 2021
Citations: 4	License type: open-access

R Discovery Prime

R Discovery Prime

TrendyGenes, a computational pipeline for the detection of literature trends in academia and drug discovery

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

The first named entity recognizer in Maithili: Resource creation and system development
Ankur Priyadarshi ... Sujan Kumar Saha
Journal of Intelligent & Fuzzy Systems | VOL. 41
Ankur Priyadarshi, et. al.Ankur Priyadarshi ... Sujan Kumar Saha
11 Aug 2021
Journal of Intelligent & Fuzzy Systems | VOL. 41

A deep learning-based bilingual Hindi and Punjabi named entity recognition system using enhanced word embeddings
Archana Goyal ... Manish Kumar
Knowledge-Based Systems | VOL. 234
Archana Goyal, et. al.Archana Goyal ... Manish Kumar
19 Oct 2021
Knowledge-Based Systems | VOL. 234

The Organization Entity Extraction Telkom University Affiliated using Recurrent Neural Network (RNN)
Aditya Firman Ihsan ... Muhammad Daffa Regenta Sutrisno
Building of Informatics, Technology and Science (BITS) | VOL. 4
Aditya Firman Ihsan, et. al.Aditya Firman Ihsan ... Muhammad Daffa Regenta Sutrisno
21 Sep 2022
Building of Informatics, Technology and Science (BITS) | VOL. 4

The most cited and co-cited COVID-19 articles: Knowledge base for rehabilitation team members.
Rafet Irmak
Work | VOL. 66
Rafet IrmakRafet Irmak
02 Jul 2020
Work | VOL. 66

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TrendyGenes, a computational pipeline for the detection of literature trends in academia and drug discovery

Abstract

Talk to us

Similar Papers

More From: Scientific Reports