Effects of Classification Techniques on Medical Reports Classification

Elfadil Abdalla Mohamed,Fathi H Saad,Omer I E Mohamed

doi:10.24297/ijct.v13i2.2906

Abstract

Text classification is the process of assigning pre-defined category labels to documents based on what a classifications has learned from training examples. This paper investigates the partially supervised classification approach in the medical field. The approaches that have been evaluated include Rocchio, NaÃ¯ve Bayesian (NB), Spy, Support vector machine (SVM), and Expectation Maximization (EM). A combination of these methods has been conducted.Â The experimental result showed that the combination which uses EM in step 2 is always produces better results than those uses SVM using small set of training samples. We also found that reducing the features based on tf-tdf values is decreasing the classification performance dramatically. Moreover, reducing the features based on their frequencies improve the classification performance significantly while also increasing efficiency, but it may require some experimentationÂ

Highlights

Classification is a form of data analysis that extracts models describing important data classes [10]
Comparing the classification performance obtained by ROC, Naïve Bayesian (NB) and Spy using Support vector machine (SVM) method for step two we found ROC achieved the best results in term of accuracy and F-measure regardless the number of training samples used followed by Spy
In term on the considerable amount of strong features could be noticed by the classification accuracy obtained by ROC-EM (95.9%), for example, is very competitive to those obtained by ROC-EM and S-EM (95.89% and 95.89% respectively), and by the excellent classification performance in term of F-measure obtained by ROC-EM and S-EM (93.39 % and 91.29 % respectively) are much better that those obtained by the same techniques in Table 8 which are 85.90 and 85.88 respectively

Summary

Introduction

Classification is a form of data analysis that extracts models describing important data classes [10]. The extracted models are called classifiers which are used to predict categorical class labels. The medical field has recently received great attention regarding the analysis of medical data which is available in an electronic form. The nature of the medical data is either unstructured or semi-structured which make it difficult to be analyzed using traditional data mining techniques. The medical staffs need automatic classification methods to analyze and categorize this huge amount of data. The Gastroenterology unit of a local hospital in UK had just such a problem as they collected electronic reports on thousands of colonoscopy procedures, but could not give answer to simple questions, such as the percentage of successful colonoscopies undertaken [34]. The aim of colonoscopy is to check for medical problems such as bleeding, colon cancer, polyps, colitis, etc. [6]

Objectives

Methods

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Effects of Classification Techniques on Medical Reports Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY

Lead the way for us

Journal: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY	Publication Date: Apr 16, 2014
License type: CC BY 4.0

Similar Papers

SVM-Based decision fusion model for detecting concepts in films
P Muneesawang ... L Guan
-
P Muneesawang, et. al.P Muneesawang ... L Guan
01 Jan 2007
01 Jan 2007

Approximate modeling for high order non-linear functions using small sample sets
Tung-I Tsai ... Der-Chiang Li
Expert Systems with Applications | VOL. 34
Tung-I Tsai, et. al.Tung-I Tsai ... Der-Chiang Li
12 Oct 2006
Expert Systems with Applications | VOL. 34

A biologically inspired approach to learning spatio-temporal patterns
Banafsheh Rekabdar ... Richard Kelley
-
Banafsheh Rekabdar, et. al.Banafsheh Rekabdar ... Richard Kelley
01 Aug 2015
01 Aug 2015

Effects of Training Set Size on Supervised Machine-Learning Land-Cover Classification of Large-Area High-Resolution Remotely Sensed Data
Christopher A Ramezan ... Bradley S Price
Remote Sensing | VOL. 13
Christopher A Ramezan, et. al.Christopher A Ramezan ... Bradley S Price
21 Jan 2021
Remote Sensing | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effects of Classification Techniques on Medical Reports Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: INTERNATIONAL JOURNAL OF COMPUTERS &amp; TECHNOLOGY

More From: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY