Abstract

Scientific literature is a central repository of scientific knowledge - every important scientific discovery has been published in it. As such, it has become a main target of data mining, and in particular, text mining. However, the unstructured, or covertly structured, nature of natural language texts poses a major barrier to accessing the contents of literature. The technology of literature annotation thus has played a central role for text mining. While it still requires enormous further effort, the productivity of literature annotation is recently significantly improved, and there are quite a few groups producing annotations in large scale. While many groups have released those annotation data sets openly to the public, however, the way of sharing the widely valuable resources still remains at a primitive level, e.g., relying on individual exchange of archived files. Meanwhile, the advancement of internet web technology has enabled much convenient ways of collaborating for producing and sharing data. For example, the technology of web 2.0 has enabled crowdsourcing content generation, and web 3.0 has enabled machine-understandable web of data (WoD), a.k.a, linked data (LD), as a mean for linking and sharing data. The first BLAH hackathon/symposium event was organized to initiate a concentrated collaboration for linking and sharing biomedical literature annotation utilizing recent web technology. Specifically, the event aimed at (1) collecting various annotations to PubMed and PMC articles, (2) linking them through normalized texts of literature, and (3) making the resources publicly available through standard Web protocol. After spending four days together during the hackathon to discuss and develop initial sets of shared resources of biomedical literature annotation, ideas for further effort were presented and discussed during a post-hackathon one-day symposium. In the proceedings, extended abstracts of five projects presented in the symposium are included. For the entire program and the output of the hackathon, readers are referred to the homepage of the event: http://2015.linkedannotation.org. We believe the output of the hackathon and the ideas discussed during the symposium has a good potential to open a new era of text mining, enabling rich analysis of heterogeneous annotations, e.g., syntactic and semantic annotations, genomic and clinical annotations, and so on, which is not possible using single annotation sets individually.

Highlights

  • Introduction to the Biomedical Linked AnnotationHackathon (BLAH) 2015 SymposiumJin-Dong Kim1*, Kevin Bretonnel Cohen2, Nigel Collier3, Zhiyong Lu4, Pontus Stenetorp5From Biomedical Linked Annotation Hackathon 2015 Kashiwa, Japan. 23-27 February 2015Scientific literature is a central repository of scientific knowledge - every important scientific discovery has been published in it

  • It has become a main target of data mining, and in particular, text mining

  • The technology of literature annotation has played a central role for text mining

Read more

Summary

Introduction

Introduction to the Biomedical Linked AnnotationHackathon (BLAH) 2015 SymposiumJin-Dong Kim1*, Kevin Bretonnel Cohen2, Nigel Collier3, Zhiyong Lu4, Pontus Stenetorp5From Biomedical Linked Annotation Hackathon 2015 Kashiwa, Japan. 23-27 February 2015Scientific literature is a central repository of scientific knowledge - every important scientific discovery has been published in it. Introduction to the Biomedical Linked Annotation Hackathon (BLAH) 2015 Symposium From Biomedical Linked Annotation Hackathon 2015 Kashiwa, Japan.

Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.