Configurable web-services for biomedical document annotation

Sérgio Matos

doi:10.1186/s13321-018-0317-4

Abstract

The need to efficiently find and extract information from the continuously growing biomedical literature has led to the development of various annotation tools aimed at identifying mentions of entities and relations. Many of these tools have been integrated in user-friendly applications facilitating their use by non-expert text miners and database curators. In this paper we describe the latest version of Neji, a web-services ready text processing and annotation framework. The modular and flexible architecture facilitates adaptation to different annotation requirements, while the built-in web services allow its integration in external tools and text mining pipelines. The evaluation of the web annotation server on the technical interoperability and performance of annotation servers track of BioCreative V.5 further illustrates the flexibility and applicability of this framework.

Highlights

The large amount of information and knowledge continuously produced in the biomedical domain is reflected on the number of published journal articles
The annotation service for participating in the technical interoperability and performance of annotation servers (TIPS) task was configured to run with 23 threads and was deployed on a Docker container with 32 GB of memory running on a server with 24 processing cores
We followed the procedure defined for the TIPS task [8], in which the document text is obtained from the BeCalm abstract and patent servers, and measured the time since the request was submitted to the Neji annotation service until the annotation results were returned

Summary

Introduction

The large amount of information and knowledge continuously produced in the biomedical domain is reflected on the number of published journal articles. In 2017, the PubMed/MEDLINE bibliographic database contained over 26 million references to journal articles in life sciences, of which more than one million were added in that year [1] At this rate, staying updated with the current knowledge and identifying the most relevant publications and information on a given subject is a very challenging task for researchers. To accelerate the curation process, automatic information extraction tools have been developed and integrated in the curation pipeline [4] These tools apply information retrieval and ranking methods to expedite the identification of relevant literature, given particular curation requisites, and information extraction methods that identify textual mentions of entities (e.g. names of genes) or relations (e.g. interactions between a protein and a chemical).

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of cheminformatics	Publication Date: Dec 1, 2018
Citations: 8	License type: open-access

R Discovery Prime

R Discovery Prime

Configurable web-services for biomedical document annotation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of cheminformatics

Lead the way for us

Similar Papers

MER: a shell script and annotation server for minimal named entity recognition and linking
Francisco M Couto ... Andre Lamurias
Journal of Cheminformatics | VOL. 10
Francisco M Couto, et. al.Francisco M Couto ... Andre Lamurias
01 Dec 2018
Journal of Cheminformatics | VOL. 10

OGER: OntoGene’s Entity Recogniser in the BeCalm TIPS Task
...
-
, et. al. ...
27 Apr 2017
27 Apr 2017

Web services-based text-mining demonstrates broad impacts for interoperability and process simplification.
Carolyn J Mattingly ... Allan Peter Davis
Database : the journal of biological databases and curation | VOL. 2014
Carolyn J Mattingly, et. al.Carolyn J Mattingly ... Allan Peter Davis
10 Jun 2014
Database : the journal of biological databases and curation | VOL. 2014

Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics
Leonid L Chepelev ... Alexandre Riazanov
BMC Bioinformatics | VOL. 12
Leonid L Chepelev, et. al.Leonid L Chepelev ... Alexandre Riazanov
26 Jul 2011
BMC Bioinformatics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Configurable web-services for biomedical document annotation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of cheminformatics