Lung cancer remains a leading cause of cancer-related deaths globally. Early and accurate detection is crucial for improving patient outcomes. Traditional methods, relying on manual interpretation of medical images, are time-consuming and prone to errors. Deep learning, particularly convolutional neural networks (CNNs), offers an automated alternative capable of learning intricate patterns from medical images. However, previous deep learning models for lung cancer detection have faced challenges such as limited data, inadequate feature extraction, interpretability issues, and susceptibility to data variability. This paper presents a novel deep learning methodology that addresses these limitations. Our approach leverages expansive datasets, incorporates advanced feature extraction techniques, improves interpretability, and accommodates the diverse nature of lung cancer. Specifically, we develop dedicated models for both chest X-ray and CT images utilizing publicly available datasets from Kaggle. Through the integration of feature selection and model selection techniques—such as employing a genetic algorithm in conjunction with the tree-based pipeline optimization tool (TPOT)—we achieved remarkable accuracy. Our X-ray model attains an overall accuracy of 95.47%, the CT model achieves an accuracy of 98.70%, and the combined model achieves an impressive overall accuracy of 98.93%. Our methodology significantly enhances the performance and efficiency of lung cancer detection and is a valuable tool for early diagnosis and intervention.
Read full abstract