Abstract
This paper demonstrates a computer-aided diagnosis (CAD) system for lung cancer classification of CT scans with unmarked nodules, a dataset from the Kaggle Data Science Bowl, 2017. Thresholding was used as an initial segmentation approach to segment out lung tissue from the rest of the CT scan. Thresholding produced the next best lung segmentation. The initial approach was to directly feed the segmented CT scans into 3D CNNs for classification, but this proved to be inadequate. Instead, a modified U-Net trained on LUNA16 data (CT scans with labeled nodules) was used to first detect nodule candidates in the Kaggle CT scans. The U-Net nodule detection produced many false positives, so regions of CTs with segmented lungs where the most likely nodule candidates were located as determined by the U-Net output were fed into 3D Convolutional Neural Networks (CNNs) to ultimately classify the CT scan as positive or negative for lung cancer. The 3D CNNs produced a test set Accuracy of 86.6%. The performance of our CAD system outperforms the current CAD systems in literature which have several training and testing phases that each requires a lot of labeled data, while our CAD system has only three major phases (segmentation, nodule candidate detection, and malignancy classification), allowing more efficient training and detection and more generalizability to other cancers.
Highlights
Lung cancer is one of the most common cancers, aca dataset comprising lung nodules from more than 1390 low dose CT scans.counting for over 225,000 cases, 150,000 deaths, and $12 billion in health care costs yearly in the U.S [1]
The 3D Convolutional Neural Networks (CNNs) produced air, so for the computer-aided diagnosis (CAD) systems search to be efficient, this noise a test set Accuracy of 86.6%
The Data Science Bowl (DSB) database consists of 1397 CT scans and 248580 slices
Summary
Lung cancer is one of the most common cancers, aca dataset comprising lung nodules from more than 1390 low dose CT scans. Counting for over 225,000 cases, 150,000 deaths, and $12 billion in health care costs yearly in the U.S [1]. It is an axial slice) o in DICOM form one of the deadliest cancers; overall, only 17% of people in the Kaggle data the U.S diagnosed with lung cancer survive five years after in our malignan the diagnosis, and the survival rate is lower in developing. Detection of lung cancer (detection during the earlier stages) significantly improves the chances for survival, but it is more difficult to detect early stages of lung cancer as there are fewer symptoms [1]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Advanced Computer Science and Applications
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.