Small Datasets Research Articles

The COVID-19 pandemic has emerged as a global health crisis, impacting millions worldwide. Although chest computed tomography (CT) scan images are pivotal in diagnosing COVID-19, their manual interpretation by radiologists is time-consuming and potentially subjective. Automated computer-aided diagnostic (CAD) frameworks offer efficient and objective solutions. However, machine or deep learning methods often face challenges in their reproducibility due to underlying biases and methodological flaws. To address these issues, we propose XCT-COVID, an explainable, transferable, and reproducible CAD framework based on deep transfer learning to predict COVID-19 infection from CT scan images accurately. This is the first study to develop three distinct models within a unified framework by leveraging a previously unexplored large dataset and two widely used smaller datasets. We employed five known convolutional neural network architectures, both with and without pretrained weights, on the larger dataset. We optimized hyperparameters through extensive grid search and 5-fold cross-validation (CV), significantly enhancing the model performance. Experimental results from the larger dataset showed that the VGG16 architecture (XCT-COVID-L) with pretrained weights consistently outperformed other architectures, achieving the best performance, on both 5-fold CV and independent test. When evaluated with the external datasets, XCT-COVID-L performed well with data with similar distributions, demonstrating its transferability. However, its performance significantly decreased on smaller datasets with lower-quality images. To address this, we developed other models, XCT-COVID-S1 and XCT-COVID-S2, specifically for the smaller datasets, outperforming existing methods. Moreover, eXplainable Artificial Intelligence (XAI) analyses were employed to interpret the models’ functionalities. For prediction and reproducibility purposes, the implementation of XCT-COVID is publicly accessible at https://github.com/cbbl-skku-org/XCT-COVID/.

Read full abstract

Image super-resolution involves reconstructing a blurry, low-resolution image with limited information into a clear, high-resolution image containing more detailed information. The images generated by super-resolution reconstruction can enhance the performance of downstream computer vision tasks, and hold wide application prospects in fields such as industrial fault detection, plant phenotype parameter extraction, medical imaging, and more. High-frequency components in images, such as edges and texture details, typically require more attention. However, when the training samples are limited, effectively recovering clear high-frequency details of images becomes highly challenging. Therefore, this paper proposes a single-image super-resolution method based on generative adversarial networks, named DESRGAN. Compared to existing methods, DESRGAN achieves better reconstruction of image details even with a limited number of training samples. DESRGAN introduces several key innovations: a shallow generator structure to address overfitting issues in small sample scenarios, a dual-stream feature extraction network with dilated convolutions to capture multi-scale contextual information and expand the receptive field, and an artifact loss designed to eliminate artifacts and preserve the true high-frequency details of the super-resolved images. Extensive ablation experiments and comparative studies with multiple state-of-the-art models are conducted on two small sample datasets, "Root" and "Leaves," as well as five publicly available datasets. The results demonstrate that the proposed DESRGAN achieves superior performance in small sample single image super-resolution tasks, with improvements of 1.39 dB in PSNR and 0.013 in SSIM. The generated high-resolution images exhibit clear texture and edge structures, presenting favorable subjective visual effects. Moreover, the model displays strong generalization capabilities.

Read full abstract

Small Datasets Research Articles

Related Topics

Articles published on Small Datasets

Estimation of minimal data sets sizes for machine learning predictions in digital mental health interventions

An explainable analysis of diabetes mellitus using statistical and artificial intelligence techniques

Quantum neural network-assisted learning for small medical datasets: a case study in emphysema detection

Small RNA sequencing data of plasma extracellular vesicles in a breast cancer screening population

3D radiative transfer modeling of almond canopy for nitrogen estimation by hyperspectral imaging

DIAFM: An Improved and Novel Approach for Incremental Frequent Itemset Mining

Implementing a Bayesian approach using Stan with Torsten: Population pharmacokinetics analysis of somatrogon.

Enhancing Software Effort Estimation with Pre-Trained Word Embeddings: A Small-Dataset Solution for Accurate Story Point Prediction

Axial-UNet++ Power Line Detection Network Based on Gated Axial Attention Mechanism

Word embeddings on ideology and issues from Swedish parliamentarians’ motions: a comparative approach

A novel knowledge distillation framework for enhancing small object detection in blurry environments with unmanned aerial vehicle-assisted images

Detection and Identification of Stuttering Types Using Siamese Network

Application of artificial intelligence-based detection of furcation involvement in mandibular first molar using cone beam tomography images- a preliminary study

Chain-structured neural architecture search for financial time series forecasting

Leveraging deep transfer learning and explainable AI for accurate COVID-19 diagnosis: Insights from a multi-national chest CT scan study

Hierarchical Bayesian modeling for Inverse Uncertainty Quantification of system thermal-hydraulics code using critical flow experimental data

Unveiling Lung Diseases in CT Scan Images With a Hybrid Bio‐Inspired Mutated Spider‐Monkey and Crow Search Algorithm

Beware of diffusion models for synthesizing medical images - A comparison with GANs in terms of memorizing brain MRI and chest x-ray images

Advancements and Challenges: A Comprehensive Review of GAN-based Models for the Mitigation of Small Dataset and Texture Sticking Issues in Fake License Plate Recognition

DESRGAN: Detail-enhanced generative adversarial networks for small sample single image super-resolution

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Small Datasets Research Articles

Related Topics

Articles published on Small Datasets

Estimation of minimal data sets sizes for machine learning predictions in digital mental health interventions

An explainable analysis of diabetes mellitus using statistical and artificial intelligence techniques

Quantum neural network-assisted learning for small medical datasets: a case study in emphysema detection

Small RNA sequencing data of plasma extracellular vesicles in a breast cancer screening population

3D radiative transfer modeling of almond canopy for nitrogen estimation by hyperspectral imaging

DIAFM: An Improved and Novel Approach for Incremental Frequent Itemset Mining

Implementing a Bayesian approach using Stan with Torsten: Population pharmacokinetics analysis of somatrogon.

Enhancing Software Effort Estimation with Pre-Trained Word Embeddings: A Small-Dataset Solution for Accurate Story Point Prediction

Axial-UNet++ Power Line Detection Network Based on Gated Axial Attention Mechanism

Word embeddings on ideology and issues from Swedish parliamentarians’ motions: a comparative approach

A novel knowledge distillation framework for enhancing small object detection in blurry environments with unmanned aerial vehicle-assisted images

Detection and Identification of Stuttering Types Using Siamese Network

Application of artificial intelligence-based detection of furcation involvement in mandibular first molar using cone beam tomography images- a preliminary study

Chain-structured neural architecture search for financial time series forecasting

Leveraging deep transfer learning and explainable AI for accurate COVID-19 diagnosis: Insights from a multi-national chest CT scan study

Hierarchical Bayesian modeling for Inverse Uncertainty Quantification of system thermal-hydraulics code using critical flow experimental data

Unveiling Lung Diseases in CT Scan Images With a Hybrid Bio‐Inspired Mutated Spider‐Monkey and Crow Search Algorithm

Beware of diffusion models for synthesizing medical images - A comparison with GANs in terms of memorizing brain MRI and chest x-ray images

Advancements and Challenges: A Comprehensive Review of GAN-based Models for the Mitigation of Small Dataset and Texture Sticking Issues in Fake License Plate Recognition

DESRGAN: Detail-enhanced generative adversarial networks for small sample single image super-resolution