To explore the feasibility of differentiating three predominant metastatic tumor types using lung computed tomography (CT) radiomics features based on supervised machine learning. This retrospective analysis included 252 lung metastases (LM) (from 78 patients), which were divided into the training (n=176) and test (n=76) cohort randomly. The metastases originated from colorectal cancer (n=97), breast cancer (n=87), and renal carcinoma (n=68). An additional 77 LM (from 35 patients) were used for external validation. All radiomics features were extracted from lung CT using an open-source software called 3D slicer. The least absolute shrinkage and selection operator (LASSO) method selected the optimal radiomics features to build the model. Random forest and support vector machine (SVM) were selected to build three-class and two-class models. The performance of the classification model was evaluated with the area under the receiver operating characteristic curve (AUC) by two strategies: one-versus-rest and one-versus-one. Eight hundred and fifty-one quantitative radiomics features were extracted from lung CT. By LASSO, 23 optimal features were extracted in three-class, and 25, 29, and 35 features in two-class for differentiating every two of three LM (colorectal cancer vs. renal carcinoma, colorectal cancer vs. breast cancer, and breast cancer vs. renal carcinoma, respectively). The AUCs of the three-class model were 0.83 for colorectal cancer, 0.79 for breast cancer, and 0.91 for renal carcinoma in the test cohort. In the external validation cohort, the AUCs were 0.77, 0.83, and 0.81, respectively. Swarmplot shows the distribution of radiomics features among three different LM types. In the two-class model, high accuracy and AUC were obtained by SVM. The AUC of discriminating colorectal cancer LM from renal carcinoma LM was 0.84, and breast cancer LM from colorectal cancer LM and renal carcinoma LM were 0.80 and 0.94, respectively. The AUCs were 0.77, 0.78, and 0.84 in the external validation cohort. Quantitative radiomics features based on Lung CT exhibited good discriminative performance in LM of primary colorectal cancer, breast cancer, and renal carcinoma.
Read full abstract