Ontology-based enriched concept graphs for medical document classification

Niloofer Shanavas,Hui Wang,Zhiwei Lin,Glenn Hawe

doi:10.1016/j.ins.2020.03.006

Abstract

The rapidly increasing volume of medical text data, including biomedical literature and clinical records, presents difficulties to biomedical researchers and clinical practitioners. Automatic text classification is an important means for managing medical text data. The main challenge in medical text classification is the complex terminology used in these documents. Therefore, it is critical to handle synonymy, polysemy, and multi-word concepts so that classification is based on the meaning of these documents. The solution to this problem of complex terminology helps in building systems with better access to relevant data, resulting in more effective utilisation of the existing information. In this paper, we present a simple and effective approach to address this challenge. A concept graph is automatically constructed and enriched for each medical text document with the help of a domain-specific similarity matrix that is built using Unified Medical Language System (UMLS) concepts in the training documents. Medical text documents are compared based on their enriched concept graphs using a graph kernel. Classification is then done based on the comparison result. The benefit of this approach is that it allows the incorporation of domain knowledge into the classification framework. The experiments on biomedical abstracts and clinical reports classification show the effectiveness of the proposed approach. Based on evaluation metrics of precision, recall and F1-scores, our method achieves a significantly higher classification performance than other widely used similarity measures for similarity-based text classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information Sciences	Publication Date: Mar 14, 2020
Citations: 24	License type: other-oa

R Discovery Prime

R Discovery Prime

Ontology-based enriched concept graphs for medical document classification

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Similar Papers

Applicability of Machine Learning Methods to Multi-label Medical Text Classification
Iuliia Lenivtceva ... Mariya Kashina
-
Iuliia Lenivtceva, et. al.Iuliia Lenivtceva ... Mariya Kashina
01 Jan 2020
01 Jan 2020

Research on Medical Text Classification based on BioBERT-GRU-Attention
Weidong Chen ... Wenhai Li
-
Weidong Chen, et. al.Weidong Chen ... Wenhai Li
20 Aug 2022
20 Aug 2022

Medical Text Classification Using Hybrid Deep Learning Models with Multihead Attention.
Sunil Kumar Prabhakar ... Dong-Ok Won
Computational Intelligence and Neuroscience | VOL. 2021
Sunil Kumar Prabhakar, et. al.Sunil Kumar Prabhakar ... Dong-Ok Won
01 Jan 2020
Computational Intelligence and Neuroscience | VOL. 2021

Improving Medical Short Text Classification with Semantic Expansion Using Word-Cluster Embedding
Ying Shen ... Kai Lei
-
Ying Shen, et. al.Ying Shen ... Kai Lei
24 Jul 2018
24 Jul 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ontology-based enriched concept graphs for medical document classification

Abstract

Talk to us

Similar Papers

More From: Information Sciences