Semantic Augmentation Research Articles

Data augmentation is an effective technique for automatically expanding training data in deep learning. Brain-inspired methods are approaches that draw inspiration from the functionality and structure of the human brain and apply these mechanisms and principles to artificial intelligence and computer science. When there is a large style difference between training data and testing data, common data augmentation methods cannot effectively enhance the generalization performance of the deep model. To solve this problem, we improve modeling Domain Shifts with Uncertainty (DSU) and propose a new brain-inspired computer vision image data augmentation method which consists of two key components, namely, using Robust statistics and controlling the Coefficient of variance for DSU (RCDSU) and Feature Data Augmentation (FeatureDA). RCDSU calculates feature statistics (mean and standard deviation) with robust statistics to weaken the influence of outliers, making the statistics close to the real values and improving the robustness of deep learning models. By controlling the coefficient of variance, RCDSU makes the feature statistics shift with semantic preservation and increases shift range. FeatureDA controls the coefficient of variance similarly to generate the augmented features with semantics unchanged and increase the coverage of augmented features. RCDSU and FeatureDA are proposed to perform style transfer and content transfer in the feature space, and improve the generalization ability of the model at the style and content level respectively. On Photo, Art Painting, Cartoon, and Sketch (PACS) multi-style classification task, RCDSU plus FeatureDA achieves competitive accuracy. After adding Gaussian noise to PACS dataset, RCDSU plus FeatureDA shows strong robustness against outliers. FeatureDA achieves excellent results on CIFAR-100 image classification task. RCDSU plus FeatureDA can be applied as a novel brain-inspired semantic data augmentation method with implicit robot automation which is suitable for datasets with large style differences between training and testing data.

Read full abstract

In real-world applications, robotic systems collect vast amounts of new data from ever-changing environments over time. They need to continually interact and learn new knowledge from the external world to adapt to the environment. Particularly, lifelong object recognition in an online and interactive manner is a crucial and fundamental capability for robotic systems. To meet this realistic demand, in this article, we propose an online active continual learning (OACL) framework for robotic lifelong object recognition, in the scenario of both classes and domains changing with dynamic environments. First, to reduce the labeling cost as much as possible while maximizing the performance, a new online active learning (OAL) strategy is designed by taking both the uncertainty and diversity of samples into account to protect the information volume and distribution of data. In addition, to prevent catastrophic forgetting and reduce memory costs, a novel online continual learning (OCL) algorithm is proposed based on the deep feature semantic augmentation and a new loss-based deep model and replay buffer update, which can mitigate the class imbalance between the old and new classes and alleviate confusion between two similar classes. Moreover, the mistake bound of the proposed method is analyzed in theory. OACL allows robots to select the most representative new samples to query labels and continually learn new objects and new variants of previously learned objects from a nonindependent and identically distributed (i.i.d.) data stream without catastrophic forgetting. Extensive experiments conducted on real lifelong robotic vision datasets demonstrate that our algorithm, even trained with fewer labeled samples and replay exemplars, can achieve state-of-the-art performance on OCL tasks.

Read full abstract

Semantic Augmentation Research Articles

Related Topics

Articles published on Semantic Augmentation

An enhanced domain generalization method for object detection based on text guided feature disentanglement

Representation-Enhanced Status Replay Network for Multisource Remote-Sensing Image Classification.

PLRTE: Progressive learning for biomedical relation triplet extraction using large language models

Supervised pyramid network based on semantic consistency for object detection

Toward Robust Graph Semi-Supervised Learning Against Extreme Data Scarcity.

Marginalized Augmented Few-Shot Domain Adaptation.

PatchNet: Maximize the Exploration of Congeneric Semantics for Weakly Supervised Semantic Segmentation.

Semantic Augmentation in Chinese Adversarial Corpus for Discourse Relation Recognition Based on Internal Semantic Elements

SSGAN: A Semantic Similarity-Based GAN for Small-Sample Image Augmentation

Search Engine for Open Geospatial Consortium Web Services Improving Discoverability through Natural Language Processing-Based Processing and Ranking

Brain-inspired semantic data augmentation for multi-style images.

Generating Images of Rare Concepts Using Pre-trained Diffusion Models

Fine-grained Recognition with Learnable Semantic Data Augmentation.

Online Active Continual Learning for Robotic Lifelong Object Recognition.

Inter-Class and Inter-Domain Semantic Augmentation for Domain Generalization.

Instance-specific Semantic Augmentation for Long-tailed Image Classification.

Multi-granularity cross-modal representation learning for named entity recognition on social media

Measuring the Quality of Semantic Data Augmentation for Sarcasm Detection

Domain adaptation with contrastive and adversarial oriented transferable semantic augmentation

Robust visual question answering via semantic cross modal augmentation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semantic Augmentation Research Articles

Related Topics

Articles published on Semantic Augmentation

An enhanced domain generalization method for object detection based on text guided feature disentanglement

Representation-Enhanced Status Replay Network for Multisource Remote-Sensing Image Classification.

PLRTE: Progressive learning for biomedical relation triplet extraction using large language models

Supervised pyramid network based on semantic consistency for object detection

Toward Robust Graph Semi-Supervised Learning Against Extreme Data Scarcity.

Marginalized Augmented Few-Shot Domain Adaptation.

PatchNet: Maximize the Exploration of Congeneric Semantics for Weakly Supervised Semantic Segmentation.

Semantic Augmentation in Chinese Adversarial Corpus for Discourse Relation Recognition Based on Internal Semantic Elements

SSGAN: A Semantic Similarity-Based GAN for Small-Sample Image Augmentation

Search Engine for Open Geospatial Consortium Web Services Improving Discoverability through Natural Language Processing-Based Processing and Ranking

Brain-inspired semantic data augmentation for multi-style images.

Generating Images of Rare Concepts Using Pre-trained Diffusion Models

Fine-grained Recognition with Learnable Semantic Data Augmentation.

Online Active Continual Learning for Robotic Lifelong Object Recognition.

Inter-Class and Inter-Domain Semantic Augmentation for Domain Generalization.

Instance-specific Semantic Augmentation for Long-tailed Image Classification.

Multi-granularity cross-modal representation learning for named entity recognition on social media

Measuring the Quality of Semantic Data Augmentation for Sarcasm Detection

Domain adaptation with contrastive and adversarial oriented transferable semantic augmentation

Robust visual question answering via semantic cross modal augmentation