Corrupted Images Research Articles

Gastrointestinal endoscopic image analysis presents significant challenges, such as considerable variations in quality due to the challenging in-body imaging environment, the often-subtle nature of abnormalities with low interobserver agreement, and the need for real-time processing. These challenges pose strong requirements on the performance, generalization, robustness and complexity of deep learning-based techniques in such safety–critical applications. While Convolutional Neural Networks (CNNs) have been the go-to architecture for endoscopic image analysis, recent successes of the Transformer architecture in computer vision raise the possibility to update this conclusion. To this end, we evaluate and compare clinically relevant performance, generalization and robustness of state-of-the-art CNNs and Transformers for neoplasia detection in Barrett’s esophagus. We have trained and validated several top-performing CNNs and Transformers on a total of 10,208 images (2,079 patients), and tested on a total of 7,118 images (998 patients) across multiple test sets, including a high-quality test set, two internal and two external generalization test sets, and a robustness test set. Furthermore, to expand the scope of the study, we have conducted the performance and robustness comparisons for colonic polyp segmentation (Kvasir-SEG) and angiodysplasia detection (Giana). The results obtained for featured models across a wide range of training set sizes demonstrate that Transformers achieve comparable performance as CNNs on various applications, show comparable or slightly improved generalization capabilities and offer equally strong resilience and robustness against common image corruptions and perturbations. These findings confirm the viability of the Transformer architecture, particularly suited to the dynamic nature of endoscopic video analysis, characterized by fluctuating image quality, appearance and equipment configurations in transition from hospital to hospital. The code is made publicly available at: https://github.com/BONS-AI-VCA-AMC/Endoscopy-CNNs-vs-Transformers.

Simulated computed tomography (CT) images allow for knowledge of the underlying ground truth and for easy variation of imaging conditions, making them ideal for testing and optimization of new applications or algorithms. However, simulating all processes that affect CT images can result in simulations that are demanding in terms of processing time and computer memory. Therefore, it is of interest to determine how much the simulation can be simplified while still achieving realistic results. To develop a scanner-specific CT simulation using physics-based simulations for the position-dependent effects and shift-invariant image corruption methods for the detector effects. And to investigate the impact on image realism of introducing simplifications in the simulation process that lead to faster and less memory-demanding simulations. To make the simulator realistic and scanner-specific, the spatial resolution and noise characteristics, and the exposure-to-detector output relationship of a clinical CT system were determined. The simulator includes a finite focal spot size, raytracing of the digital phantom, gantry rotation during projection acquisition, and finite detector element size. Previously published spectral models were used to model the spectrum for the given tube voltage. The integrated energy at each element of the detector was calculated using the Beer-Lambert law. The resulting angular projections were subsequently corrupted by the detector modulation transfer function (MTF), and by addition of noise according to the noise power spectrum (NPS) and signal mean-variance relationship, which were measured for different scanner settings. The simulated sinograms were reconstructed on the clinical CT system and compared to real CT images in terms of CT numbers, noise magnitude using the standard deviation, noise frequency content using the NPS, and spatial resolution using the MTF throughout the field of view (FOV). The CT numbers were validated using a multi-energy CT phantom, the noise magnitude and frequency were validated with a water phantom, and the spatial resolution was validated with a tungsten wire. These metrics were compared at multiple scanner settings, and locations in the FOV. Once validated, the simulation was simplified by reducing the level of subsampling of the focal spot area, rotation and of detector pixel size, and the changes in MTFs were analyzed. The average relative errors for spatial resolution within and across image slices, noise magnitude, and noise frequency content within and across slices were 3.4%, 3.3%, 4.9%, 3.9%, and 6.2%, respectively. The average absolute difference in CT numbers was 10.2HU and the maximum was 22.5HU. The simulation simplification showed that all subsampling can be avoided, except for angular, while the error in frequency at 10% MTF would be maximum 16.3%. The simulation of a scanner-specific CT allows for the generation of realistic CT images by combining physics-based simulations for the position-dependent effects and image-corruption methods for the shift-invariant ones. Together with the available ground truth of the digital phantom, it results in a useful tool to perform quantitative analysis of reconstruction or post-processing algorithms. Some simulation simplifications allow for reduced time and computer power requirements with minimal loss of realism.

Corrupted Images Research Articles

Related Topics

Articles published on Corrupted Images

Will Transformers change gastrointestinal endoscopic image analysis? A comparative analysis between CNNs and Transformers, in terms of performance, robustness and generalization

Image dejittering on the perspective of spatially-varying mixed noise removal

Benchmarking robustness of deep neural networks in semantic segmentation of fluorescence microscopy images

A Review on FPGA-based Architectures for Noise Removal of Digital Images using High-Level Design

Transforming experimental radiology: Design and implementation of an innovative ePACS image storage system for AI imaging research environments

Benchmarking PathCLIP for Pathology Image Analysis.

Semi-hard constraint augmentation of triplet learning to improve image corruption classification

Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact Simulation

STATNet: One-stage coal-gangue detector based on deep learning algorithm for real industrial application

Real-World Video Super-Resolution with a Degradation-Adaptive Model.

Toward Blind Flare Removal Using Knowledge-Driven Flare-Level Estimator.

Unified Multi-Modal Image Synthesis for Missing Modality Imputation.

Benchmarking the Robustness of Instance Segmentation Models.

Survey: Image mixing and deleting for data augmentation

Review of image inpainting in practical challenge

Cross-modality Neuroimage Synthesis: A Survey

End-to-end metric learning from corrupted images using triplet dimensionality reduction loss

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors.

Light Codes for Fast Two-Way Human-Centric Visual Communication

Development, validation, and simplification of a scanner-specific CT simulator.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Corrupted Images Research Articles

Related Topics

Articles published on Corrupted Images

Will Transformers change gastrointestinal endoscopic image analysis? A comparative analysis between CNNs and Transformers, in terms of performance, robustness and generalization

Image dejittering on the perspective of spatially-varying mixed noise removal

Benchmarking robustness of deep neural networks in semantic segmentation of fluorescence microscopy images

A Review on FPGA-based Architectures for Noise Removal of Digital Images using High-Level Design

Transforming experimental radiology: Design and implementation of an innovative ePACS image storage system for AI imaging research environments

Benchmarking PathCLIP for Pathology Image Analysis.

Semi-hard constraint augmentation of triplet learning to improve image corruption classification

Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact Simulation

STATNet: One-stage coal-gangue detector based on deep learning algorithm for real industrial application

Real-World Video Super-Resolution with a Degradation-Adaptive Model.

Toward Blind Flare Removal Using Knowledge-Driven Flare-Level Estimator.

Unified Multi-Modal Image Synthesis for Missing Modality Imputation.

Benchmarking the Robustness of Instance Segmentation Models.

Survey: Image mixing and deleting for data augmentation

Review of image inpainting in practical challenge

Cross-modality Neuroimage Synthesis: A Survey

End-to-end metric learning from corrupted images using triplet dimensionality reduction loss

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors.

Light Codes for Fast Two-Way Human-Centric Visual Communication

Development, validation, and simplification of a scanner-specific CT simulator.