Large Image Datasets Research Articles

Large medical imaging data sets are becoming increasingly available. A common challenge in these data sets is to ensure that each sample meets minimum quality requirements devoid of significant artefacts. Despite a wide range of existing automatic methods having been developed to identify imperfections and artefacts in medical imaging, they mostly rely on data-hungry methods. In particular, the scarcity of artefact-containing scans available for training has been a major obstacle in the development and implementation of machine learning in clinical research. To tackle this problem, we propose a novel framework having four main components: (1) a set of artefact generators inspired by magnetic resonance physics to corrupt brain MRI scans and augment a training dataset, (2) a set of abstract and engineered features to represent images compactly, (3) a feature selection process that depends on the class of artefact to improve classification performance, and (4) a set of Support Vector Machine (SVM) classifiers trained to identify artefacts. Our novel contributions are threefold: first, we use the novel physics-based artefact generators to generate synthetic brain MRI scans with controlled artefacts as a data augmentation technique. This will avoid the labour-intensive collection and labelling process of scans with rare artefacts. Second, we propose a large pool of abstract and engineered image features developed to identify 9 different artefacts for structural MRI. Finally, we use an artefact-based feature selection block that, for each class of artefacts, finds the set of features that provide the best classification performance. We performed validation experiments on a large data set of scans with artificially-generated artefacts, and in a multiple sclerosis clinical trial where real artefacts were identified by experts, showing that the proposed pipeline outperforms traditional methods. In particular, our data augmentation increases performance by up to 12.5 percentage points on the accuracy, F1, F2, precision and recall. At the same time, the computation cost of our pipeline remains low –less than a second to process a single scan– with the potential for real-time deployment. Our artefact simulators obtained using adversarial learning enable the training of a quality control system for brain MRI that otherwise would have required a much larger number of scans in both supervised and unsupervised settings. We believe that systems for quality control will enable a wide range of high-throughput clinical applications based on the use of automatic image-processing pipelines.

Read full abstract

Infectious keratitis (IK) is among the top five leading causes of blindness globally. Early diagnosis is needed to guide appropriate therapy to avoid complications such as vision impairment and blindness. Slit lamp microscopy and culture of corneal scrapes are key to diagnosing IK. Slit lamp photography was transformed when digital cameras and smartphones were invented. The digital camera or smartphone camera sensor's resolution, the resolution of the slit lamp and the focal length of the smartphone camera system are key to a high-quality slit lamp image. Alternative diagnostic tools include imaging, such as optical coherence tomography (OCT) and in vivo confocal microscopy (IVCM). OCT's advantage is its ability to accurately determine the depth and extent of the corneal ulceration, infiltrates and haze, therefore characterizing the severity and progression of the infection. However, OCT is not a preferred choice in the diagnostic tool package for infectious keratitis. Rather, IVCM is a great aid in the diagnosis of fungal and Acanthamoeba keratitis with overall sensitivities of 66-74% and 80-100% and specificity of 78-100% and 84-100%, respectively. Recently, deep learning (DL) models have been shown to be promising aids for the diagnosis of IK via image recognition. Most of the studies that have developed DL models to diagnose the different types of IK have utilised slit lamp photographs. Some studies have used extremely efficient single convolutional neural network algorithms to train their models, and others used ensemble approaches with variable results. Limitations of DL models include the need for large image datasets to train the models, the difficulty in finding special features of the different types of IK, the imbalance of training models, the lack of image protocols and misclassification bias, which need to be overcome to apply these models into real-world settings. Newer artificial intelligence technology that generates synthetic data, such as generative adversarial networks, may assist in overcoming some of these limitations of CNN models.

Read full abstract

Large Image Datasets Research Articles

Related Topics

Articles published on Large Image Datasets

Traffic Sign Classification Using ASIC

Comparison of deep learning-based image segmentation methods for intravascular ultrasound on retrospective and large image cohort study

Self-supervised pre-training with contrastive and masked autoencoder methods for dealing with small datasets in deep learning for medical imaging

An efficient semi-supervised quality control system trained using physics-based MRI-artefact generators and adversarial training

Subsurface imaging dataset acquired at the Garner Valley Downhole Array site using a dense network of three-component nodal stations

An Enhanced Automated Identification of Brain Tumor Cells Using Image Segmentation

A computer vision and residual neural network (ResNet) combined method for automated and accurate yeast replicative aging analysis of high-throughput microfluidic single-cell images

Updates in Diagnostic Imaging for Infectious Keratitis: A Review.

Smart Plant Disease Analysis and Management System Using Ai and Iot

ANDA: an open-source tool for automated image analysis of in vitro neuronal cells

Ten recommendations for organising bioimaging data for archival

Chasing a Better Decision Margin for Discriminative Histopathological Breast Cancer Image Classification

Retina Oculomics in Neurodegenerative Disease.

MASIC: Deep Mask Stereo Image Compression

PyTorch Deep Learning for Food Image Classification with Food Dataset

Synplex: In silico modelling of the tumor microenvironment from multiplex images.

A Brainwide Risk Score for Psychiatric Disorder Evaluated in a Large Adolescent Population Reveals Increased Divergence Among Higher-Risk Groups Relative to Control Participants

Medimatrix: innovative pre-training of grayscale images for rheumatoid arthritis diagnosis revolutionises medical image classification.

Deep learning enhanced achromatic imaging with a singlet flat lens.

An introduction to artificial intelligence in machine vision for postharvest detection of disorders in horticultural products

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Large Image Datasets Research Articles

Related Topics

Articles published on Large Image Datasets

Traffic Sign Classification Using ASIC

Comparison of deep learning-based image segmentation methods for intravascular ultrasound on retrospective and large image cohort study

Self-supervised pre-training with contrastive and masked autoencoder methods for dealing with small datasets in deep learning for medical imaging

An efficient semi-supervised quality control system trained using physics-based MRI-artefact generators and adversarial training

Subsurface imaging dataset acquired at the Garner Valley Downhole Array site using a dense network of three-component nodal stations

An Enhanced Automated Identification of Brain Tumor Cells Using Image Segmentation

A computer vision and residual neural network (ResNet) combined method for automated and accurate yeast replicative aging analysis of high-throughput microfluidic single-cell images

Updates in Diagnostic Imaging for Infectious Keratitis: A Review.

Smart Plant Disease Analysis and Management System Using Ai and Iot

ANDA: an open-source tool for automated image analysis of in vitro neuronal cells

Ten recommendations for organising bioimaging data for archival

Chasing a Better Decision Margin for Discriminative Histopathological Breast Cancer Image Classification

Retina Oculomics in Neurodegenerative Disease.

MASIC: Deep Mask Stereo Image Compression

PyTorch Deep Learning for Food Image Classification with Food Dataset

Synplex: In silico modelling of the tumor microenvironment from multiplex images.

A Brainwide Risk Score for Psychiatric Disorder Evaluated in a Large Adolescent Population Reveals Increased Divergence Among Higher-Risk Groups Relative to Control Participants

Medimatrix: innovative pre-training of grayscale images for rheumatoid arthritis diagnosis revolutionises medical image classification.

Deep learning enhanced achromatic imaging with a singlet flat lens.

An introduction to artificial intelligence in machine vision for postharvest detection of disorders in horticultural products