LDSVM: Leukemia Cancer Classification Using Machine Learning

Abdul Karim,Azhari Azhari,Mobeen Shahroz,Samir Brahim Belhaouri,Khabib Mustofa

doi:10.32604/cmc.2022.021218

Abstract

Leukemia is blood cancer, including bone marrow and lymphatic tissues, typically involving white blood cells. Leukemia produces an abnormal amount of white blood cells compared to normal blood. Deoxyribonucleic acid (DNA) microarrays provide reliable medical diagnostic services to help more patients find the proposed treatment for infections. DNA microarrays are also known as biochips that consist of microscopic DNA spots attached to a solid glass surface. Currently, it is difficult to classify cancers using microarray data. Nearly many data mining techniques have failed because of the small sample size, which has become more critical for organizations. However, they are not highly effective in improving results and are frequently employed by doctors for cancer diagnosis. This study proposes a novel method using machine learning algorithms based on microarrays of leukemia GSE9476 cells. The main aim was to predict the initial leukemia disease. Machine learning algorithms such as decision tree (DT), naive bayes (NB), random forest (RF), gradient boosting machine (GBM), linear regression (LinR), support vector machine (SVM), and novel approach based on the combination of Logistic Regression (LR), DT and SVM named as ensemble LDSVM model. The k-fold cross-validation and grid search optimization methods were used with the LDSVM model to classify leukemia in patients and comparatively analyze their impacts. The proposed approach evaluated better accuracy, precision, recall, and f1 scores than the other algorithms. Furthermore, the results were relatively assessed, which showed LDSVM performance. This study aims to successfully predict leukemia in patients and enhance prediction accuracy in minimum time. Moreover, a Synthetic minority oversampling technique (SMOTE) and Principal compenent analysis (PCA) approaches were implemented. This makes the records generalized and evaluates the outcomes well. PCA reduces the feature count without losing any information and deals with class imbalanced datasets, as well as faster model execution along with less computation cost. In this study, a novel process was used to reduce the column results to develop a faster and more rapid experiment execution.

Highlights

Leukemia is the most common type of blood cancer in all age groups, in children
Leukemia cancer classification is proposed based on deoxyribonucleic acid (DNA) microarray data using the proposed approach, which consists of a novel structure of classification methods
This study proposed machine-learning best algorithms for leukemia cancer classification

Summary

Introduction

Leukemia is the most common type of blood cancer in all age groups, in children. This abnormal concept is caused by increased blood cell proliferation and immature growth, which can harm red blood cells, brain tissue, and the immune system. A cell is instructed by the genetic code when to reproduce and when to die. Changes in gene expression may lead to defective instructions. Some genes modify proteins to fix damaged cells, which may be related to cancer. If parents have these mutations, they may be inherited by their offspring. Because of the numerous genes, classifying disease-related genes is a complicated task in machine learning

Objectives

Methods

Results

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computers, Materials & Continua	Publication Date: Jan 1, 2022
Citations: 4	License type: cc-by

R Discovery Prime

LDSVM: Leukemia Cancer Classification Using Machine Learning

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Computers, Materials & Continua

Lead the way for us

Similar Papers

A new FPGA-based edge detection system for the gridding of DNA microarray images
Luca Sterpone ... Massimo Violante
-
Luca Sterpone, et. al.Luca Sterpone ... Massimo Violante
01 May 2007
01 May 2007

A Novel Dual-Core Architecture for the Analysis of DNA Microarray Images
L Sterpone
IEEE Transactions on Instrumentation and Measurement | VOL. 58
L SterponeL Sterpone
01 Aug 2009
IEEE Transactions on Instrumentation and Measurement | VOL. 58

Reconfigurable Devices for the Analysis of DNA Microarray
-
-
--
09 Oct 2008
09 Oct 2008

Principles and Applications of Deoxyribonucleic Acid Microarray: A Review
Haben Fesseha ... Hiwot Tilahun
Pathology and Laboratory Medicine – Open Journal | VOL. 3
Haben Fesseha, et. al.Haben Fesseha ... Hiwot Tilahun
30 Mar 2021
Pathology and Laboratory Medicine – Open Journal | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

LDSVM: Leukemia Cancer Classification Using Machine Learning

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Computers, Materials &amp; Continua

More From: Computers, Materials & Continua