Verification of Uncurated Protein Annotations

Francisco M Couto,Mário J Silva,Emily Dimmer,Evelyn Camon,Rolf Apweiler,Vivian Lee

doi:10.4018/978-1-60566-274-9.ch016

Abstract

Molecular Biology research projects produced vast amounts of data, part of which has been preserved in a variety of public databases. However, a large portion of the data contains a significant number of errors and therefore requires careful verification by curators, a painful and costly task, before being reliable enough to derive valid conclusions from it. On the other hand, research in biomedical information retrieval and information extraction are nowadays delivering Text Mining solutions that can support curators to improve the efficiency of their work to deliver better data resources. Over the past decades, automatic text processing systems have successfully exploited biomedical scientific literature to reduce the researchers’ efforts to keep up to date, but many of these systems still rely on domain knowledge that is integrated manually leading to unnecessary overheads and restrictions in its use. A more efficient approach would acquire the domain knowledge automatically from publicly available biological sources, such as BioOntologies, rather than using manually inserted domain knowledge. An example of this approach is GOAnnotator, a tool that assists the verification of uncurated protein annotations. It provided correct evidence text at 93% precision to the curators and thus achieved promising results. GOAnnotator was implemented as a web tool that is freely available at http://xldb.di.fc.ul.pt/rebil/tools/goa/.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Verification of Uncurated Protein Annotations

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
...
-
, et. al. ...
09 Aug 2015
09 Aug 2015

Salton Award Lecture - Information retrieval and computer science
W Bruce Croft
-
W Bruce CroftW Bruce Croft
28 Jul 2003
28 Jul 2003

Text-based intelligent systems: Current research and practice in information extraction and retrieval: Paul S. Jacobs (Ed.). Lawrence Erlbaum Associates, Hillsdale, NJ (1992). viii + 281 pp., $27.50. ISBN 0-8058-1189-3.
Jessica L Milstead
Information Processing and Management | VOL. 29
Jessica L MilsteadJessica L Milstead
01 May 1993
Information Processing and Management | VOL. 29

Challenges in information retrieval and language modeling
James Allan ...
ACM SIGIR Forum | VOL. 37
James Allan, et. al.James Allan ...
01 Apr 2003
ACM SIGIR Forum | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Verification of Uncurated Protein Annotations

Abstract

Talk to us

Similar Papers