Integration and publication of heterogeneous text-mined relationships on the Semantic Web

Adrien Coulet,Michel Dumontier,Mark A Musen,Nigam H Shah,Yael Garten,Russ B Altman

doi:10.1186/2041-1480-2-s2-s10

Abstract

BackgroundAdvances in Natural Language Processing (NLP) techniques enable the extraction of fine-grained relationships mentioned in biomedical text. The variability and the complexity of natural language in expressing similar relationships causes the extracted relationships to be highly heterogeneous, which makes the construction of knowledge bases difficult and poses a challenge in using these for data mining or question answering.ResultsWe report on the semi-automatic construction of the PHARE relationship ontology (the PHArmacogenomic RElationships Ontology) consisting of 200 curated relations from over 40,000 heterogeneous relationships extracted via text-mining. These heterogeneous relations are then mapped to the PHARE ontology using synonyms, entity descriptions and hierarchies of entities and roles. Once mapped, relationships can be normalized and compared using the structure of the ontology to identify relationships that have similar semantics but different syntax. We compare and contrast the manual procedure with a fully automated approach using WordNet to quantify the degree of integration enabled by iterative curation and refinement of the PHARE ontology. The result of such integration is a repository of normalized biomedical relationships, named PHARE-KB, which can be queried using Semantic Web technologies such as SPARQL and can be visualized in the form of a biological network.ConclusionsThe PHARE ontology serves as a common semantic framework to integrate more than 40,000 relationships pertinent to pharmacogenomics. The PHARE ontology forms the foundation of a knowledge base named PHARE-KB. Once populated with relationships, PHARE-KB (i) can be visualized in the form of a biological network to guide human tasks such as database curation and (ii) can be queried programmatically to guide bioinformatics applications such as the prediction of molecular interactions. PHARE is available at http://purl.bioontology.org/ontology/PHARE.

Highlights

Advances in Natural Language Processing (NLP) techniques enable the extraction of fine-grained relationships mentioned in biomedical text
We report on the construction of a relationship ontology and describe its use for integrating and publishing text-mined relationships on the Semantic Web
The PHARE-Knowledge Base (PHARE-KB) The ontology-driven integration process described in the method section takes as input a set of relationships extracted from MEDLINE abstracts and outputs a set of normalized relationships of the form Role(subject, object) represented using entity types and roles defined in PHARE

Summary

Introduction

Advances in Natural Language Processing (NLP) techniques enable the extraction of fine-grained relationships mentioned in biomedical text. Our work is motivated by the need for automated approaches capturing and formalizing knowledge extracted from the literature via manual or computational approaches. Consider for example, that five curators at the Pharmacogenomics Knowledge Base (PharmGKB) manually browse the pharmacogenomics (PGx) literature to curate relationships relevant for storage in the PharmGKB [2]. The result of this curation process is a high quality database queried by clinicians and bioinformaticians. Automatic approaches using Natural Language Processing (NLP) are increasingly utilized [4]

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Biomedical Semantics	Publication Date: May 17, 2011
Citations: 49	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Integration and publication of heterogeneous text-mined relationships on the Semantic Web

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biomedical Semantics

Lead the way for us

Similar Papers

JOSN: JAVA oriented question-answering system combining semantic web and natural language processing techniques
Shally Garg ... Suresh Kumar
-
Shally Garg, et. al.Shally Garg ... Suresh Kumar
01 Aug 2016
01 Aug 2016

Natural Language Processing for Ontology Development in IoT-Enabled Smart Healthcare
Aytug Turkmen ... Ozgu Can
-
Aytug Turkmen, et. al.Aytug Turkmen ... Ozgu Can
12 Jan 2024
12 Jan 2024

Language Learning Research at the Intersection of Experimental, Computational, and Corpus‐Based Approaches
Patrick Rebuschat ... Detmar Meurers
Language Learning | VOL. 67
Patrick Rebuschat, et. al.Patrick Rebuschat ... Detmar Meurers
01 Jun 2017
Language Learning | VOL. 67

Ontology-Based Interpretation of Natural Language Philipp Cimiano, Christina Unger, and John McCrae (University of Arminia Bielefeld, Germany) Morgan & Claypool, Synthesis Lectures on Human Language Technologies, March 2014, 178 pages, (doi:10.2200/S00561ED1V01Y201401HLT024) , $45.00
Chris Biemann
Computational Linguistics | VOL. 41
Chris BiemannChris Biemann
01 Jun 2015
Computational Linguistics | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integration and publication of heterogeneous text-mined relationships on the Semantic Web

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biomedical Semantics