Abstract

Electronic Medical Record (EMR) is an important element of information technology in healthcare sector. EMR is an electronic record containing health-related information on patients that can be created and managed by authorized physician and staff in a healthcare service organization. EMR is a framework for determining diagnosis and treatment. EMR has free text and unstructured format which makes it more difficult to extract the hidden information as a decision support system. This study performs classification from Indonesian EMR for clinical decision support system (CDSS) in classifying patient diagnosis using Term Frequency-Inverse Document Frequency (TF-IDF) for feature extraction and Support Vector Machine (SVM) for classifier method. SVM is a powerful algorithm in high-dimensional data such as in textual data processing. The focus diagnoses classified in this paper are tuberculosis, cancer, diabetes mellitus, hypertension, and chronic kidney which have high prevalence rates in Indonesia. The model is built by considering the kernel function and the use of stopword removal or without stopword removal. The result showed that TF - IDF and SVM method could be used effectively to predict diagnosis with stop word removal. Classification performance increased with stopword removal on all SVM kernels with accuracy in linear kernel 89.91 %, polynomial kernel 90.58%, RBF kernel 90.75%, and sigmoid kernel 91.03%.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call