PubRunner: A light-weight framework for updating text mining results.

Kishore R Anekalla,Jake Lever,Nicolas Fiorini,J.P Courneya,Michael Muchow,Ben Busby

doi:10.12688/f1000research.11389.1

Abstract

Biomedical text mining promises to assist biologists in quickly navigating the combined knowledge in their domain. This would allow improved understanding of the complex interactions within biological systems and faster hypothesis generation. New biomedical research articles are published daily and text mining tools are only as good as the corpus from which they work. Many text mining tools are underused because their results are static and do not reflect the constantly expanding knowledge in the field. In order for biomedical text mining to become an indispensable tool used by researchers, this problem must be addressed. To this end, we present PubRunner, a framework for regularly running text mining tools on the latest publications. PubRunner is lightweight, simple to use, and can be integrated with an existing text mining tool. The workflow involves downloading the latest abstracts from PubMed, executing a user-defined tool, pushing the resulting data to a public FTP, and publicizing the location of these results on the public PubRunner website. This shows a proof of concept that we hope will encourage text mining developers to build tools that truly will aid biologists in exploring the latest publications.

Highlights

The National Library of Medicine’s (NLM) PubMed database contains over 27 million citations and is growing exponentially (Lu, 2011)
In order to encourage biomedical text mining researchers to widely share their results and code, and keep analyses up-to-date, we present PubRunner
A central website was developed to track the status of different text mining analyses that are managed by PubRunner

Summary

13 Oct 2017

3. Julien Gobeill, University of Applied Sciences and Arts of Western Switzerland (HES-SO, HEG (Geneva School of Management)), Carouge, Switzerland Swiss Institute of Bioinformatics, Geneva, Switzerland. This article is included in the Container Virtualization in Bioinformatics collection. This article is included in the Hackathons collection. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Introduction

Methods

Conclusions and next steps

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: F1000Research	Publication Date: May 2, 2017
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

PubRunner: A light-weight framework for updating text mining results.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: F1000Research

Lead the way for us

Similar Papers

PubRunner: A light-weight framework for updating text mining results
Kishore R Anekalla ... J.P Courneya
F1000Research | VOL. 6
Kishore R Anekalla, et. al.Kishore R Anekalla ... J.P Courneya
13 Oct 2017
F1000Research | VOL. 6

A Variety of Text Mining Technology and Tools Research
Jie Lian ... Zhili Pei
-
Jie Lian, et. al.Jie Lian ... Zhili Pei
01 Jan 2014
01 Jan 2014

Managing biological networks by using text mining and computer-aided curation
Seok Jong Yu ... Yongseong Cho
Journal of the Korean Physical Society | VOL. 67
Seok Jong Yu, et. al.Seok Jong Yu ... Yongseong Cho
01 Nov 2015
Journal of the Korean Physical Society | VOL. 67

Biomedical text mining and its applications in cancer research
Fei Zhu ... Bairong Shen
Journal of Biomedical Informatics | VOL. 46
Fei Zhu, et. al.Fei Zhu ... Bairong Shen
15 Nov 2012
Journal of Biomedical Informatics | VOL. 46

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PubRunner: A light-weight framework for updating text mining results.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: F1000Research