Abstract

BackgroundLung cancer remains the leading cause of cancer deaths across the world. Early detection of lung cancer by low-dose computed tomography (LDCT) can reduce the mortality rate. However, making a definitive preoperative diagnosis of malignant pulmonary nodules (PNs) found by LDCT is a clinical challenge. This study aimed to develop a prediction model based on DNA methylation biomarkers and radiological characteristics for identifying malignant pulmonary nodules from benign PNs.MethodsWe assessed three DNA methylation biomarkers (PTGER4, RASSF1A, and SHOX2) and clinically-relevant variables in a training cohort of 110 individuals with PNs. Four machine-learning-based prediction models were established and compared, including the K-nearest neighbors (KNN), random forest (RF), support vector machine (SVM), and logistic regression (LR) algorithms. Variables of the best-performing algorithm (LR) were selected through stepwise use of Akaike’s information criterion (AIC). The constructed prediction model was compared with the methylation biomarkers and the Mayo Clinic model using the non-parametric approach of DeLong et al. with the area under a receiver operator characteristic curve (AUC) analysis.ResultsA prediction model was finally constructed based on three DNA methylation biomarkers and one radiological characteristic for identifying malignant from benign PNs. The developed prediction model achieved an AUC value of 0.951 in malignant PNs diagnosis, significantly higher than the three DNA methylation biomarkers (0.912, 95% CI:0.843–0.958, p = 0.013) or Mayo Clinic model (0.823, 95% CI:0.739–0.890, p = 0.001). Validation of the prediction model in the testing cohort of 100 subjects with PNs confirmed the diagnostic value.ConclusionWe have shown that integrating DNA methylation biomarkers and radiological characteristics could more accurately identify lung cancer in subjects with CT-found PNs. The prediction model developed in our study may provide clinical utility in combination with LDCT to improve the over-all diagnosis of lung cancer.

Highlights

  • Lung cancer remains the leading cause of cancer deaths across the world

  • We aimed to investigate if combining DNA methylation biomarkers with clinical/radiological characteristics could more efficiently distinguish malignant from benign lung nodules detected by low-dose computed tomography (LDCT)

  • We found that the models of support vector machine (SVM) and logistic regression (LR) can accurately classify malignant from benign pulmonary nodules (PNs) with area under the curve (AUC) of 0.92 and 0.93

Read more

Summary

Introduction

Lung cancer remains the leading cause of cancer deaths across the world. Detection of lung cancer by low-dose computed tomography (LDCT) can reduce the mortality rate. This study aimed to develop a prediction model based on DNA methylation biomarkers and radiological characteristics for identifying malignant pulmonary nodules from benign PNs. Lung cancer is the second most common cancer globally and the leading cause of cancer mortality worldwide [1]. The 5-year survival rate of lung cancer is only 15–19% at all stages. Low-dose computed tomography (LDCT) is widely accepted as a reliable screening tool for lung cancer early detection. The National Lung Screening Trial (NLST) reported that LDCT decreases the mortality rate by 20% in high-risk people [4, 5]. The National Lung Screening Trial showed that in heavy smokers, the positive rate of indeterminate PNs detected by LDCT was 24.2%; 96.4% of these PNs were confirmed to be false positives over the three rounds of screening [5]

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call