The Challenge of Choosing the Best Classification Method in Radiomic Analyses: Recommendations and Applications to Lung Cancer CT Images.

Federica Corso,Cristiano Rampinelli,Marta Cremonesi,Federica Bellerba,Giulia Tini,Daniela Origgi,Giuliana Lo Presti,Francesca Botta,Luca Mazzarella,Chiara Paganelli,Simone Pietro De Angelis,Massimo Bellomi,Stefania Rizzo,Pier Giuseppe Pelicci,Sara Gandini,Sara Raimondi,Lisa Rinaldi,Noemi Garau

doi:10.3390/cancers13123088

Abstract

Simple SummaryRadiomics aims to extract high-dimensional features from clinical images and associate them to clinical outcomes. These associations may be further investigated with machine learning models; however, guidelines on the most suitable method to support clinical decisions are still missing. To improve the reliability and the accuracy of radiomic features in the prediction of a binary variable in a lung cancer setting, we compared several machine learning classifiers and feature selection methods on simulated data. These account for important characteristics that may vary in real clinical datasets: sample size, outcome balancing and association strength between radiomic features and outcome variables. We were able to suggest the most suitable classifiers for each studied case and to evaluate the impact of method choices. Our work highlights the importance of these choices in radiomic analyses and provides guidelines on how to select the best models for the data at hand.Radiomics uses high-dimensional sets of imaging features to predict biological characteristics of tumors and clinical outcomes. The choice of the algorithm used to analyze radiomic features and perform predictions has a high impact on the results, thus the identification of adequate machine learning methods for radiomic applications is crucial. In this study we aim to identify suitable approaches of analysis for radiomic-based binary predictions, according to sample size, outcome balancing and the features–outcome association strength. Simulated data were obtained reproducing the correlation structure among 168 radiomic features extracted from Computed Tomography images of 270 Non-Small-Cell Lung Cancer (NSCLC) patients and the associated to lymph node status. Performances of six classifiers combined with six feature selection (FS) methods were assessed on the simulated data using AUC (Area Under the Receiver Operating Characteristics Curves), sensitivity, and specificity. For all the FS methods and regardless of the association strength, the tree-based classifiers Random Forest and Extreme Gradient Boosting obtained good performances (AUC ≥ 0.73), showing the best trade-off between sensitivity and specificity. On small samples, performances were generally lower than in large–medium samples and with larger variations. FS methods generally did not improve performances. Thus, in radiomic studies, we suggest evaluating the choice of FS and classifiers, considering specific sample size, balancing, and association strength.

Highlights

Radiomics focuses on extracting and mining high-dimensional sets of quantitative features from medical images, which are expected to provide a detailed and comprehensive characterization of the tumor phenotype [1], being calculated on the entire volume of the lesion
Predictive and prognostic models characterized by high accuracy, reliability, and efficiency are vital factors for radiomics to play an active role in supporting clinical decisions in oncology [12,13,14,15,16,17]
To identify the main issues that should be tackled when simulating radiomic features, we first carried out some descriptive analyses of our real data on Non-Small-Cell Lung Cancer (NSCLC) patients

Summary

Introduction

Radiomics focuses on extracting and mining high-dimensional sets of quantitative features from medical images, which are expected to provide a detailed and comprehensive characterization of the tumor phenotype [1], being calculated on the entire volume of the lesion. Predictive and prognostic models characterized by high accuracy, reliability, and efficiency are vital factors for radiomics to play an active role in supporting clinical decisions in oncology [12,13,14,15,16,17]. Fewer papers have focused on the impact of different methods, such as feature selection and classification, on predictive modelling. The identification of the optimal ML methods for radiomic applications represent a crucial step toward stable and clinically relevant radiomic biomarkers

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Cancers	Publication Date: Jun 21, 2021
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The Challenge of Choosing the Best Classification Method in Radiomic Analyses: Recommendations and Applications to Lung Cancer CT Images.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Cancers

Lead the way for us

Similar Papers

Feature selection methods and predictive models in CT lung cancer radiomics
Gary Ge ... Jie Zhang
Journal of Applied Clinical Medical Physics | VOL. 24
Gary Ge, et. al.Gary Ge ... Jie Zhang
17 Dec 2022
Journal of Applied Clinical Medical Physics | VOL. 24

Comparison of Feature Selection Methods and Machine Learning Classifiers for Radiomics Analysis in Glioma Grading
Pan Sun ... Lin Shi
IEEE access : practical innovations, open solutions | VOL. 7
Pan Sun, et. al.Pan Sun ... Lin Shi
01 Jan 2019
IEEE access : practical innovations, open solutions | VOL. 7

Analysis of Cross-Combinations of Feature Selection and Machine-Learning Classification Methods Based on [18F]F-FDG PET/CT Radiomic Features for Metabolic Response Prediction of Metastatic Breast Cancer Lesions.
Ober Van Gómez ... Laszlo Papp
Cancers | VOL. 14
Ober Van Gómez, et. al.Ober Van Gómez ... Laszlo Papp
14 Jun 2022
Cancers | VOL. 14

Flash flood susceptibility mapping in urban area using genetic algorithm and ensemble method
Azlan Saleh ... Quoc Bao Pham
Geocarto International | VOL. 37
Azlan Saleh, et. al.Azlan Saleh ... Quoc Bao Pham
21 Jan 2022
Geocarto International | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Challenge of Choosing the Best Classification Method in Radiomic Analyses: Recommendations and Applications to Lung Cancer CT Images.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Cancers