Annotation Cost Research Articles

Machine learning has significantly impacted the analysis of biological images and is now an important part of many biological data analysis pipelines. A variety of biological and biomedical domain-related tasks is gaining benefit from image analysis and pattern recognition tools developed currently. Applications include diagnostic histopathology, environmental monitoring, synthetic biology, genomics, and proteomics. Particularly in the last decade, several deep learning and advanced computer vision methods such as convolutional neural networks (CNNs), typically trained in a supervised fashion, have started to be largely employed in biological image classification. Moreover, the advancement of automatic acquisition systems has been generating a massive amount of biological data, which requires to be analyzed by domain experts. However, the cost of manual annotation of such data has become a bottleneck, impairing the application of supervised machine learning algorithms. Biological images generally have an intrinsic high variability, whose identity is sometimes hard to assign and strongly dependent on the annotator’s expertise. In this context, a limited number of annotation-free (i.e., unsupervised) learning solutions have been proposed, typically based on hand-crafted features, specifically tailored for a certain biological domain. Nonetheless, a successful unsupervised learning approach must be accurate, and sufficiently robust to deal with different biological domains. This paper aims at providing a viable solution to these issues, proposing an unsupervised learning algorithm based on compressed deep features for image classification. We exploit features extracted from ImageNet pre-trained transformers and CNNs, further compressed with a customized β-Variational AutoEncoder (β-VAE), that we call reconstruction VAE (R-VAE). We test our algorithm on biological images coming from diverse domains characterized by high variability in shape and texture information and acquired with widely differing imaging platforms. Considered image datasets range from multi-cellular organisms (plankton, coral) to sub-cellular organelles (budding yeast vacuoles, human cells’ nuclei, etc.). Our results show that the compressed deep features extracted from different pre-trained vision models establish new unsupervised learning state-of-the-art performances for the investigated datasets.

Read full abstract

To conduct a classification study of high myopic maculopathy (HMM) using limited datasets, including tessellated fundus, diffuse chorioretinal atrophy, patchy chorioretinal atrophy, and macular atrophy, and minimize annotation costs, and to optimize the ALFA-Mix active learning algorithm and apply it to HMM classification. The optimized ALFA-Mix algorithm (ALFA-Mix+) was compared with five algorithms, including ALFA-Mix. Four models, including ResNet18, were established. Each algorithm was combined with four models for experiments on the HMM dataset. Each experiment consisted of 20 active learning rounds, with 100 images selected per round. The algorithm was evaluated by comparing the number of rounds in which ALFA-Mix+ outperformed other algorithms. Finally, this study employed six models, including EfficientFormer, to classify HMM. The best-performing model among these models was selected as the baseline model and combined with the ALFA-Mix+ algorithm to achieve satisfactory classification results with a small dataset. ALFA-Mix+ outperforms other algorithms with an average superiority of 16.6, 14.75, 16.8, and 16.7 rounds in terms of accuracy, sensitivity, specificity, and Kappa value, respectively. This study conducted experiments on classifying HMM using several advanced deep learning models with a complete training set of 4252 images. The EfficientFormer achieved the best results with an accuracy, sensitivity, specificity, and Kappa value of 0.8821, 0.8334, 0.9693, and 0.8339, respectively. Therefore, by combining ALFA-Mix+ with EfficientFormer, this study achieved results with an accuracy, sensitivity, specificity, and Kappa value of 0.8964, 0.8643, 0.9721, and 0.8537, respectively. The ALFA-Mix+ algorithm reduces the required samples without compromising accuracy. Compared to other algorithms, ALFA-Mix+ outperforms in more rounds of experiments. It effectively selects valuable samples compared to other algorithms. In HMM classification, combining ALFA-Mix+ with EfficientFormer enhances model performance, further demonstrating the effectiveness of ALFA-Mix+.

Read full abstract

Annotation Cost Research Articles

Related Topics

Articles published on Annotation Cost

Multiscale progressive text prompt network for medical image segmentation

SatelliteCloudGenerator: Controllable Cloud and Shadow Synthesis for Multi-Spectral Optical Satellite Images

Automatic intracranial abnormality detection and localization in head CT scans by learning from free-text reports

Uncertainty-aware and dynamically-mixed pseudo-labels for semi-supervised defect segmentation

HAL-IA: A Hybrid Active Learning framework using Interactive Annotation for medical image segmentation.

Dissecting self-supervised learning methods for surgical computer vision.

MTMVC: Semi-supervised 3D hand pose estimation using multi-task and multi-view consistency

Prescriptive Method for Optimizing Cost of Data Collection and Annotation in Machine Learning of Clinical Ultrasound.

Density-based one-shot active learning for image segmentation

Dynamic facial expression recognition with pseudo‐label guided multi‐modal pre‐training

Efficient unsupervised learning of biological images with compressed deep features

Learning from algorithm-generated pseudo-annotations for detecting ants in videos

Research on classification method of high myopic maculopathy based on retinal fundus images and optimized ALFA-Mix active learning algorithm.

WGAN-CL: A Wasserstein GAN with confidence loss for small-sample augmentation

CSAL: Cost sensitive active learning for multi-source drifting stream

SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition

Unsupervised Cross-Corpus Speech Emotion Recognition Using a Multi-Source Cycle-GAN

Novel Intent Detection and Active Learning Based Classification (Student Abstract)

LWSIS: LiDAR-Guided Weakly Supervised Instance Segmentation for Autonomous Driving

PARCS: A Deployment-Oriented AI System for Robust Parcel-Level Cropland Segmentation of Satellite Images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Annotation Cost Research Articles

Related Topics

Articles published on Annotation Cost

Multiscale progressive text prompt network for medical image segmentation

SatelliteCloudGenerator: Controllable Cloud and Shadow Synthesis for Multi-Spectral Optical Satellite Images

Automatic intracranial abnormality detection and localization in head CT scans by learning from free-text reports

Uncertainty-aware and dynamically-mixed pseudo-labels for semi-supervised defect segmentation

HAL-IA: A Hybrid Active Learning framework using Interactive Annotation for medical image segmentation.

Dissecting self-supervised learning methods for surgical computer vision.

MTMVC: Semi-supervised 3D hand pose estimation using multi-task and multi-view consistency

Prescriptive Method for Optimizing Cost of Data Collection and Annotation in Machine Learning of Clinical Ultrasound.

Density-based one-shot active learning for image segmentation

Dynamic facial expression recognition with pseudo‐label guided multi‐modal pre‐training

Efficient unsupervised learning of biological images with compressed deep features

Learning from algorithm-generated pseudo-annotations for detecting ants in videos

Research on classification method of high myopic maculopathy based on retinal fundus images and optimized ALFA-Mix active learning algorithm.

WGAN-CL: A Wasserstein GAN with confidence loss for small-sample augmentation

CSAL: Cost sensitive active learning for multi-source drifting stream

SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition

Unsupervised Cross-Corpus Speech Emotion Recognition Using a Multi-Source Cycle-GAN

Novel Intent Detection and Active Learning Based Classification (Student Abstract)

LWSIS: LiDAR-Guided Weakly Supervised Instance Segmentation for Autonomous Driving

PARCS: A Deployment-Oriented AI System for Robust Parcel-Level Cropland Segmentation of Satellite Images