Abstract
BackgroundOntologies house various kinds of domain knowledge in formal structures, primarily in the form of concepts and the associative relationships between them. Ontologies have become integral components of many health information processing environments. Hence, quality assurance of the conceptual content of any ontology is critical. Relationships are foundational to the definition of concepts. Missing relationship errors (i.e., unintended omissions of important definitional relationships) can have a deleterious effect on the quality of an ontology. An abstraction network is a structure that overlays an ontology and provides an alternate, summarization view of its contents. One kind of abstraction network is called an area taxonomy, and a variation of it is called a subtaxonomy. A methodology based on these taxonomies for more readily finding missing relationship errors is explored.MethodsThe area taxonomy and the subtaxonomy are deployed to help reveal concepts that have a high likelihood of exhibiting missing relationship errors. A specific top-level grouping unit found within the area taxonomy and subtaxonomy, when deemed to be anomalous, is used as an indicator that missing relationship errors are likely to be found among certain concepts. Two hypotheses pertaining to the effectiveness of our Quality Assurance approach are studied.ResultsOur Quality Assurance methodology was applied to the Biological Process hierarchy of the National Cancer Institute thesaurus (NCIt) and SNOMED CT’s Eye/vision finding subhierarchy within its Clinical finding hierarchy. Many missing relationship errors were discovered and confirmed in our analysis. For both test-bed hierarchies, our Quality Assurance methodology yielded a statistically significantly higher number of concepts with missing relationship errors in comparison to a control sample of concepts. Two hypotheses are confirmed by these findings.ConclusionsQuality assurance is a critical part of an ontology’s lifecycle, and automated or semi-automated tools for supporting this process are invaluable. We introduced a Quality Assurance methodology targeted at missing relationship errors. Its successful application to the NCIt’s Biological Process hierarchy and SNOMED CT’s Eye/vision finding subhierarchy indicates that it can be a useful addition to the arsenal of tools available to ontology maintenance personnel.
Highlights
Ontologies house various kinds of domain knowledge in formal structures, primarily in the form of concepts and the associative relationships between them
While it is true that some consider an error of omission as being less severe than an error of commission, missing relationship errors can have a deleterious effect on the quality of the ontology, when they appear in large numbers
We have developed a number of abstraction networks—compact summarization structures for ontologies—and have shown them to be useful in support of ontology quality assurance (QA) [5]
Summary
Ontologies house various kinds of domain knowledge in formal structures, primarily in the form of concepts and the associative relationships between them. Quality assurance of the conceptual content of any ontology is critical. Relationships are foundational to the definition of concepts. Missing relationship errors (i.e., unintended omissions of important definitional relationships) can have a deleterious effect on the quality of an ontology. We are focusing on quality assurance (QA) pertaining to a specific kind of error of omission, namely, missing relationship errors, i.e., omissions of critical relationships from concept definitions. While it is true that some consider an error of omission as being less severe than an error of commission, missing relationship errors can have a deleterious effect on the quality of the ontology, when they appear in large numbers. As relationships affect the functioning of classifiers employed in ontology management, omitted relationships can lead to the incorrect placement of concepts (i.e., incorrect parentage) in the ontology hierarchy [4]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.