Extending and encoding existing biological terminologies and datasets for use in the reasoned semantic web.

Soroush Samadian,Bruce McManus,Mark D Wilkinson

doi:10.1186/2041-1480-3-6

Abstract

BackgroundClinical phenotypes and disease-risk stratification are most often determined through the direct observations of clinicians in conjunction with published standards and guidelines, where the clinical expert is the final arbiter of the patient’s classification. While this "human" approach is highly desirable in the context of personalized and optimal patient care, it is problematic in a healthcare research setting because the basis for the patient's classification is not transparent, and likely not reproducible from one clinical expert to another. This sits in opposition to the rigor required to execute, for example, Genome-wide association analyses and other high-throughput studies where a large number of variables are being compared to a complex disease phenotype. Most clinical classification systems and are not structured for automated classification, and similarly, clinical data is generally not represented in a form that lends itself to automated integration and interpretation. Here we apply Semantic Web technologies to the problem of automated, transparent interpretation of clinical data for use in high-throughput research environments, and explore migration-paths for existing data and legacy semantic standards.ResultsUsing a dataset from a cardiovascular cohort collected two decades ago, we present a migration path - both for the terminologies/classification systems and the data - that enables rich automated clinical classification using well-established standards. This is achieved by establishing a simple and flexible core data model, which is combined with a layered ontological framework utilizing both logical reasoning and analytical algorithms to iteratively "lift" clinical data through increasingly complex layers of interpretation and classification. We compare our automated analysis to that of the clinical expert, and discrepancies are used to refine the ontological models, finally arriving at ontologies that mirror the expert opinion of the individual clinical researcher. Other discrepancies, however, could not be as easily modeled, and we evaluate what information we are lacking that would allow these discrepancies to be resolved in an automated manner.ConclusionsWe demonstrate that the combination of semantically-explicit data, logically rigorous models of clinical guidelines, and publicly-accessible Semantic Web Services, can be used to execute automated, rigorous and reproducible clinical classifications with an accuracy approaching that of an expert. Discrepancies between the manual and automatic approaches reveal, as expected, that clinicians do not always rigorously follow established guidelines for classification; however, we demonstrate that "personalized" ontologies may represent a re-usable and transparent approach to modeling individual clinical expertise, leading to more reproducible science.

Highlights

Clinical phenotypes and disease-risk stratification are most often determined through the direct observations of clinicians in conjunction with published standards and guidelines, where the clinical expert is the final arbiter of the patient’s classification
The clinical observations used in this analysis were as follows: Age, Gender, Height, Weight, Body Mass Index (BMI), Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP) Glucose, Cholesterol, Low Density Lipoprotein (LDL), High Density Lipoprotein (HDL), Triglyceride (TG)
Similar queries were issued for DBP, Chol, HDL, TG, and BMI attributes

Summary

Introduction

Clinical phenotypes and disease-risk stratification are most often determined through the direct observations of clinicians in conjunction with published standards and guidelines, where the clinical expert is the final arbiter of the patient’s classification. Terminologies and Nosologies have long been used by clinicians and clinical researchers as a means of more consistently annotating their observations It is not surprising, that the emergence of the Semantic Web found fertile ground in the clinical and life science communities, and formal Semantic Web standards have been rapidly adopted by these communities to migrate existing annotation systems into these modern frameworks and syntaxes. That the emergence of the Semantic Web found fertile ground in the clinical and life science communities, and formal Semantic Web standards have been rapidly adopted by these communities to migrate existing annotation systems into these modern frameworks and syntaxes While this largely syntactic migration is a useful exercise, in that it becomes possible to do simple reasoning over manual annotations, this simple migration does not enable the full power of modern semantic technologies to be applied to these important biomedical datasets. We base our exploration in a real-world use case, using clinical data collected and annotated 20 years ago in the context of a study of patient outcomes after various cardiovascular interventions

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Biomedical Semantics	Publication Date: Jul 20, 2012
Citations: 11	License type: cc-by

R Discovery Prime

R Discovery Prime

Extending and encoding existing biological terminologies and datasets for use in the reasoned semantic web.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biomedical Semantics

Lead the way for us

Similar Papers

Actinic keratosis: correlation between clinical and histological classification systems.
L Schmitz ... E Bierhoff
Journal of the European Academy of Dermatology and Venereology | VOL. 30
L Schmitz, et. al.L Schmitz ... E Bierhoff
08 Mar 2016
Journal of the European Academy of Dermatology and Venereology | VOL. 30

Proposed international clinical diabetic retinopathy and diabetic macular edema disease severity scales
C.P Wilkinson ... Juan T Verdaguer
Ophthalmology | VOL. 110
C.P Wilkinson, et. al.C.P Wilkinson ... Juan T Verdaguer
20 Aug 2003
Ophthalmology | VOL. 110

Clinical Classification of Age-related Macular Degeneration
Frederick L Ferris ... Srinivas R Sadda
Ophthalmology | VOL. 120
Frederick L Ferris, et. al.Frederick L Ferris ... Srinivas R Sadda
15 Jan 2013
Ophthalmology | VOL. 120

Point/Counterpoint. Future radiotherapy practice will be based on evidence from retrospective interrogation of linked clinical data sources rather than prospective randomized controlled clinical trials.
Andre L A J Dekker ... Colin G Orton
Medical physics | VOL. 41
Andre L A J Dekker, et. al.Andre L A J Dekker ... Colin G Orton
04 Feb 2014
Medical physics | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extending and encoding existing biological terminologies and datasets for use in the reasoned semantic web.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biomedical Semantics