Feature extraction in ML plays a crucial role in transforming raw data into a more meaningful and interpretable representation. In this study, we thoroughly examined a range of feature extraction techniques and assessed their impact on the binary classification models for medical images, utilizing a diverse and rich set of medical imaging modalities. Using H&E-stained, chest X-ray, and retina OCT images, we applied methods to extract statistical, radiomics, and deep features. These features were then used to develop PCA-LDA models as the employed classifier. We evaluated the models based on two decisive metrics: latency and performance. Latency measured the time taken for feature extraction and prediction, while mean sensitivity (balanced accuracy) characterizes the model performance. Our comparative study revealed that statistical and radiomics features were less effective for medical image classification, as they showed high latency and lower performance scores. In contrast, pre-trained DL networks performed efficiently, with high sensitivity and low latency. For H&E-stained images, the statistical feature extraction took about an hour and achieved 90.8% sensitivity, while ResNet50 reduced processing time fourfold and increased sensitivity to 96.9%. For chest X-rays, radiomics features were time-intensive with 92.2% sensitivity, while ResNet50 improved sensitivity to 96% with faster extraction time. For retina OCT images, radiomics yielded a sensitivity of 91%, while DenseNet121 achieved 98.6% sensitivity in 15min. These findings underscore the superior performance of DL techniques over the statistical and radiomics features, highlighting their potential for real-world applications where accurate and rapid diagnostic decisions are essential.
Read full abstract