Ontology-based Measures Research Articles

BackgroundSemantic similarity estimation significantly promotes the understanding of natural language resources and supports medical decision making. Previous studies have investigated semantic similarity and relatedness estimation between biomedical terms through resources in English, such as SNOMED-CT or UMLS. However, very limited studies focused on the Chinese language, and technology on natural language processing and text mining of medical documents in China is urgently needed. Due to the lack of a complete and publicly available biomedical ontology in China, we only have access to several modest-sized ontologies with no overlaps. Although all these ontologies do not constitute a complete coverage of biomedicine, their coverage of their respective domains is acceptable. In this paper, semantic similarity estimations between Chinese biomedical terms using these multiple non-overlapping ontologies were explored as an initial study. MethodsTypical path-based and information content (IC)-based similarity measures were applied on these ontologies. From the analysis of the computed similarity scores, heterogeneity in the statistical distributions of scores derived from multiple ontologies was discovered. This heterogeneity hampers the comparability of scores and the overall accuracy of similarity estimation. This problem was addressed through a novel language-independent method by combining semantic similarity estimation and score normalization. A reference standard was also created in this study. ResultsCompared with the existing task-independent normalization methods, the newly developed method exhibited superior performance on most IC-based similarity measures. The accuracy of semantic similarity estimation was enhanced through score normalization. This enhancement resulted from the mitigation of heterogeneity in the similarity scores derived from multiple ontologies. ConclusionWe demonstrated the potential necessity of score normalization when estimating semantic similarity using ontology-based measures. The results of this study can also be extended to other language systems to implement semantic similarity estimation in biomedicine.

Read full abstract

Our objective is to develop a framework for creating reference standards for functional testing of computerized measures of semantic relatedness. Currently, research on computerized approaches to semantic relatedness between biomedical concepts relies on reference standards created for specific purposes using a variety of methods for their analysis. In most cases, these reference standards are not publicly available and the published information provided in manuscripts that evaluate computerized semantic relatedness measurement approaches is not sufficient to reproduce the results. Our proposed framework is based on the experiences of medical informatics and computational linguistics communities and addresses practical and theoretical issues with creating reference standards for semantic relatedness. We demonstrate the use of the framework on a pilot set of 101 medical term pairs rated for semantic relatedness by 13 medical coding experts. While the reliability of this particular reference standard is in the “moderate” range; we show that using clustering and factor analyses offers a data-driven approach to finding systematic differences among raters and identifying groups of potential outliers. We test two ontology-based measures of relatedness and provide both the reference standard containing individual ratings and the R program used to analyze the ratings as open-source. Currently, these resources are intended to be used to reproduce and compare results of studies involving computerized measures of semantic relatedness. Our framework may be extended to the development of reference standards in other research areas in medical informatics including automatic classification, information retrieval from medical records and vocabulary/ontology development.

Read full abstract

Ontology-based Measures Research Articles

Related Topics

Articles published on Ontology-based Measures

A comparative study of methods for a priori prediction of MCQ difficulty

A reproducible survey on word embeddings and ontology-based methods for word similarity: Linear combinations outperform the state of the art

Evaluating semantic similarity between Chinese biomedical terms through multiple ontologies with score normalization: An initial study

An information theoretic approach to improve semantic similarity assessments across multiple ontologies

A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain

A new gene ontology-based measure for the functional similarity of gene products

Ontology-based semantic similarity: A new feature-based approach

Towards a framework for developing semantic relatedness reference standards

An ontology-based measure to compute semantic similarity in biomedicine

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Ontology-based Measures Research Articles

Related Topics

Articles published on Ontology-based Measures

A comparative study of methods for a priori prediction of MCQ difficulty

A reproducible survey on word embeddings and ontology-based methods for word similarity: Linear combinations outperform the state of the art

Evaluating semantic similarity between Chinese biomedical terms through multiple ontologies with score normalization: An initial study

An information theoretic approach to improve semantic similarity assessments across multiple ontologies

A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain

A new gene ontology-based measure for the functional similarity of gene products

Ontology-based semantic similarity: A new feature-based approach

Towards a framework for developing semantic relatedness reference standards

An ontology-based measure to compute semantic similarity in biomedicine