Classification Model on Big Data in Medical Diagnosis Based on Semi-Supervised Learning

Lei Wang,Wei Yan,Qing Qian,Wenbo Cheng,Jishuai Wang,Qiang Zhang

doi:10.1093/comjnl/bxaa006

Abstract

Abstract Big data in medical diagnosis can provide abundant value for clinical diagnosis, decision support and many other applications, but obtaining a large number of labeled medical data will take a lot of time and manpower. In this paper, a classification model based on semi-supervised learning algorithm using both labeled and unlabeled data is proposed to process big data in medical diagnosis, which includes structured, semi-structured and unstructured data. For the medical laboratory data, this paper proposes a self-training algorithm based on repeated labeling strategy to solve the problem that mislabeled samples weaken the performance of classifiers. Aiming at medical record data, this paper extracts features with high correlation of classification results based on domain expert knowledge base first, and then chooses the unlabeled medical record data with the highest confidence to expand the training set and optimizes the performance of the classifiers of tri-training algorithm, which uses supervised learning algorithm to train three basic classifiers. The experimental results show that the proposed medical diagnosis data classification model based on semi-supervised learning algorithm has good performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Classification Model on Big Data in Medical Diagnosis Based on Semi-Supervised Learning

Abstract

Talk to us

Similar Papers

More From: The Computer Journal

Lead the way for us

Journal: The Computer Journal	Publication Date: Mar 17, 2020
Citations: 8

Similar Papers

Improving the Trustworthiness of Interactive Visualization Tools for Healthcare Data through a Medical Fuzzy Expert System.
Abdullah M Albarrak
Diagnostics | VOL. 13
Abdullah M AlbarrakAbdullah M Albarrak
13 May 2023
Diagnostics | VOL. 13

Evaluation of Parameter Update Effects in Deep Semi-Supervised Learning Algorithms
Elie Neghawi ... Yan Liu
-
Elie Neghawi, et. al.Elie Neghawi ... Yan Liu
01 Jul 2020
01 Jul 2020

Semisupervised Learning for Seismic Monitoring Applications
Lisa Linville ... Jennifer Galasso
Seismological Research Letters | VOL. 92
Lisa Linville, et. al.Lisa Linville ... Jennifer Galasso
21 Oct 2020
Seismological Research Letters | VOL. 92

Information Technology Support for Athlete Health Monitoring and Medical Diagnosis
Jing Su
Scalable Computing: Practice and Experience | VOL. 25
Jing SuJing Su
01 Oct 2024
Scalable Computing: Practice and Experience | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classification Model on Big Data in Medical Diagnosis Based on Semi-Supervised Learning

Abstract

Talk to us

Similar Papers

More From: The Computer Journal