Abstract

Breast cancer pathology reports are used in the diagnosis of the disease and determination of the stage of cancer in a patient. These reports are written or electronically generated by the Pathologist in English. The contents of a Pathology report generated by the Pathologist are usually in unstructured natural language form. The contents of a report are used to determine the Pathological classification and Cancer stage of a patient. Information extraction and making pathological decisions from natural language text is a complex process due to the heterogeneity of the report structure and its contents. The reports can be homogenized using the global annotation standard Systematized Nomenclature of Medicine – Clinical Terms, SNOMED-CT. It enables consistent representations of medical terms and can be used for clinical decision support systems (CDSS) and cancer reporting. SNOMED is a vast repository and its enormity and complexity necessitates extraction of a subset for a particular domain before using it for annotation. The annotation is performed either in online mode at the time of generation of the report or in offline mode on a batch of archived reports. A CDSS prototype is developed for breast cancer domain, which provides support to the Pathologist to determine the Pathological Classification and Cancer Staging on both natural language text and SNOMED-annotated text. With regard to Pathological decisions, a hypothesis is formulated that Annotation using SNOMED does not improve the system’s performance in determining the cancer stage of a patient. For annotating the text, the system initially extracts a SNOMED subset for the domain. Performance Analysis of the decision support processes was done by determining Precision, Recall, Specificity, Accuracy, F-measure and Error. The analysis indicates that the annotation feature improved the accuracy of automated Pathological decisions presented by the CDSS to the Clinician for finalizing his decisions. In the future, the CDSS feature can be applied to other cancer domains and thus provide a means to improve decision-making related to those domains.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call