To evaluate the performance of automatic deep learning (DL) algorithm for size, mass, and volume measurements in predicting prognosis of lung adenocarcinoma (LUAD) and compared with manual measurements. A total of 542 patients with clinical stage 0-I peripheral LUAD and with preoperative CT data of 1-mm slice thickness were included. Maximal solid size on axial image (MSSA) was evaluated by two chest radiologists. MSSA, volume of solid component (SV), and mass of solid component (SM) were evaluated by DL. Consolidation-to-tumor ratios (CTRs) were calculated. For ground glass nodules (GGNs), solid parts were extracted with different density level thresholds. The prognosis prediction efficacy of DL was compared with that of manual measurements. Multivariate Cox proportional hazards model was used to find independent risk factors. The prognosis prediction efficacy of T-staging (TS) measured by radiologists was inferior to that of DL. For GGNs, MSSA-based CTR measured by radiologists (RMSSA%) could not stratify RFS and OS risk, whereas measured by DL using 0HU (2D-AIMSSA0HU%) could by using different cutoffs. SM and SV measured by DL using 0 HU (AISM0HU% and AISV0HU%) could effectively stratify the survival risk regardless of different cutoffs and were superior to 2D-AIMSSA0HU%. AISM0HU% and AISV0HU% were independent risk factors. DL algorithm can replace human for more accurate T-staging of LUAD. For GGNs, 2D-AIMSSA0HU% could predict prognosis rather than RMSSA%. The prediction efficacy of AISM0HU% and AISV0HU% was more accurate than of 2D-AIMSSA0HU% and both were independent risk factors. Deep learning algorithm could replace human for size measurements and could better stratify prognosis than manual measurements in patients with lung adenocarcinoma. • Deep learning (DL) algorithm could replace human for size measurements and could better stratify prognosis than manual measurements in patients with lung adenocarcinoma (LUAD). • For GGNs, maximal solid size on axial image (MSSA)-based consolidation-to-tumor ratio (CTR) measured by DL using 0 HU could stratify survival risk than that measured by radiologists. • The prediction efficacy of mass- and volume-based CTRs measured by DL using 0 HU was more accurate than of MSSA-based CTR and both were independent risk factors.
Read full abstract