Abstract

To study and investigate the synergistic benefit of incorporating both conventional handcrafted and learning-based features in disease identification across a wide range of clinical setups. In this retrospective study, we collected 170, 150, 209, and 137 patients with four different disease types associated with identification objectives : Lymph node metastasis status of gastric cancer (GC), 5-year survival status of patients with high-grade osteosarcoma (HOS), early recurrence status of intrahepatic cholangiocarcinoma (ICC), and pathological grades of pancreatic neuroendocrine tumors (pNETs). Computed tomography (CT) and magnetic resonance imaging (MRI) were used to derive image features for GC/HOS/pNETs and ICC, respectively. In each study, 67 universal handcrafted features and study-specific features based on the sparse autoencoder (SAE) method were extracted and fed into the subsequent feature selection and learning model to predict the corresponding disease identification. Models using handcrafted alone, SAE alone, and hybrid features were optimized and their performance was compared. Prominent features were analyzed both qualitatively and quantitatively to generate study-specific and cross-study insight. In addition to direct performance gain assessment, correlation analysis was performed to assess the complementarity between handcrafted features and SAE features. On the independent hold-off test, the handcrafted, SAE, and hybrid features based prediction yielded area under the curve of 0.761versus 0.769versus 0.829 for GC, 0.629versus 0.740versus 0.709 for HOS, 0.717versus 0.718versus 0.758 for ICC, and 0.739versus 0.715versus 0.771 for pNETs studies, respectively. In three out of the four studies, prediction using the hybrid features yields the best performance, demonstrating the general benefit in using hybrid features. Prediction with SAE features alone had the best performance in the HOS study, which may be explained by the complexity of HOS prognosis and the possibility of a slight overfit due to higher correlation between handcrafted and SAE features. This study demonstrated the general benefit of combing handcrafted and learning-based features in radiomics modeling. It also clearly illustrates the task-specific and data-specific dependency on the performance gain and suggests that while the common methodology of feature combination may be applied across various studies and tasks, study-specific feature selection and model optimization are still necessary to achieve high accuracy and robustness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.