Evaluating semantic similarity between Chinese biomedical terms through multiple ontologies with score normalization: An initial study

Wenxin Ning,Ming Yu,Dehua Kong

doi:10.1016/j.jbi.2016.10.017

Wenxin Ning, Ming Yu + Show 1 more

Open Access

https://doi.org/10.1016/j.jbi.2016.10.017

Copy DOI

Export

Save

Cite

Journal: Journal of Biomedical Informatics	Publication Date: Nov 1, 2016
Citations: 9	License type: publisher-specific-oa

Affiliation: Tsinghua University

Abstract
Full-Text
Similar Papers

Abstract

Listen

BackgroundSemantic similarity estimation significantly promotes the understanding of natural language resources and supports medical decision making. Previous studies have investigated semantic similarity and relatedness estimation between biomedical terms through resources in English, such as SNOMED-CT or UMLS. However, very limited studies focused on the Chinese language, and technology on natural language processing and text mining of medical documents in China is urgently needed. Due to the lack of a complete and publicly available biomedical ontology in China, we only have access to several modest-sized ontologies with no overlaps. Although all these ontologies do not constitute a complete coverage of biomedicine, their coverage of their respective domains is acceptable. In this paper, semantic similarity estimations between Chinese biomedical terms using these multiple non-overlapping ontologies were explored as an initial study. MethodsTypical path-based and information content (IC)-based similarity measures were applied on these ontologies. From the analysis of the computed similarity scores, heterogeneity in the statistical distributions of scores derived from multiple ontologies was discovered. This heterogeneity hampers the comparability of scores and the overall accuracy of similarity estimation. This problem was addressed through a novel language-independent method by combining semantic similarity estimation and score normalization. A reference standard was also created in this study. ResultsCompared with the existing task-independent normalization methods, the newly developed method exhibited superior performance on most IC-based similarity measures. The accuracy of semantic similarity estimation was enhanced through score normalization. This enhancement resulted from the mitigation of heterogeneity in the similarity scores derived from multiple ontologies. ConclusionWe demonstrated the potential necessity of score normalization when estimating semantic similarity using ontology-based measures. The results of this study can also be extended to other language systems to implement semantic similarity estimation in biomedicine.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Evaluating semantic similarity between Chinese biomedical terms through multiple ontologies with score normalization: An initial study

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics

Lead the way for us

Similar Papers

Towards the estimation of feature-based semantic similarity using multiple ontologies
Albert Solé-Ribalta ... Francesc Serratosa
Knowledge-Based Systems | VOL. 55
Albert Solé-Ribalta, et. al.Albert Solé-Ribalta ... Francesc Serratosa
19 Oct 2013
Knowledge-Based Systems | VOL. 55

Semantic similarity estimation from multiple ontologies
Montserrat Batet ... Aida Valls
Applied Intelligence | VOL. 38
Montserrat Batet, et. al.Montserrat Batet ... Aida Valls
26 May 2012
Applied Intelligence | VOL. 38

BIOSSES: a semantic sentence similarity estimation system for the biomedical domain.
Gizem Soğancıoğlu ... Arzucan Özgür
Bioinformatics | VOL. 33
Gizem Soğancıoğlu, et. al.Gizem Soğancıoğlu ... Arzucan Özgür
12 Jul 2017
Bioinformatics | VOL. 33

Clustering clinical models from local electronic health records based on semantic similarity
Kirstine Rosenbeck Gøeg ... Stig Kjær Andersen
Journal of Biomedical Informatics | VOL. 54
Kirstine Rosenbeck Gøeg, et. al.Kirstine Rosenbeck Gøeg ... Stig Kjær Andersen
31 Dec 2015
Journal of Biomedical Informatics | VOL. 54

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Evaluating semantic similarity between Chinese biomedical terms through multiple ontologies with score normalization: An initial study

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics