Breast, Lung and Liver Cancer Classification from Structured and Unstructured Data

Beatriz A Gonzalez-Beltrán,Erick E Montelongo-Gonzalez,José A Reyes-Ortiz

doi:10.13053/cys-26-1-4167

Abstract

Currently, cancer is a worldwide public health problem. Machine and deep learning techniques hold great promise in healthcare by analyzing Electronic Health Records (EHR) that contain a large collection of structured and unstructured data. However, most research has been done with structured data, and valuable data is also found in doctor’s plain-text notes. Thus, this paper proposes an approach to classify breast, liver, and lung cancer based on structured and unstructured data obtained from the MIMIC-II clinical database by using machine and deep learning techniques. In particular, the Paragraph Vector algorithm is used as a deep learning approach to text representation. The goal of this work is to help physicians in early diagnosis of cancer. The proposed approach was tested on a balanced dataset of breast, liver, and lung cancer patient records. Pre-processing is done with structured and unstructured data, and the result is used as input variables to three machine learning models: Support Vector Machines, Multi Layer Perceptron, and Adaboost-SAMME. Then, the scoring metrics for these models are calculated in different training data configurations to choose the best performing model for classification. Results show that the best performing model was obtained with MLP, achieving 89% precision using unstructured data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Breast, Lung and Liver Cancer Classification from Structured and Unstructured Data

Abstract

Talk to us

Similar Papers

More From: Computación y Sistemas

Lead the way for us

Similar Papers

Early Breast Cancer Prediction using Machine Learning and Deep Learning Techniques
Swati B Patil ... Parikshit N Mahalle
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11
Swati B Patil, et. al.Swati B Patil ... Parikshit N Mahalle
07 Oct 2023
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11

Comprehensive Study for Breast Cancer Using Deep Learning and Traditional Machine Learning
-
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34
--
12 Apr 2022
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34

Application of deep and machine learning techniques for multi-label classification performance on psychotic disorder diseases
Israel Elujide ... Jeremiah O Olamijuwon
Informatics in Medicine Unlocked | VOL. 23
Israel Elujide, et. al.Israel Elujide ... Jeremiah O Olamijuwon
01 Jan 2020
Informatics in Medicine Unlocked | VOL. 23

Machine Learning Models for Cancer Type Classification with Unstructured Data
Erick E Montelongo González ... José A Reyes Ortiz
Computación y Sistemas | VOL. 24
Erick E Montelongo González, et. al.Erick E Montelongo González ... José A Reyes Ortiz
30 Jun 2020
Computación y Sistemas | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Breast, Lung and Liver Cancer Classification from Structured and Unstructured Data

Abstract

Talk to us

Similar Papers

More From: Computación y Sistemas