Semi-supervised Learning Tasks Research Articles

Data augmentation is widely known as a simple yet surprisingly effective technique for regularizing deep networks. Conventional data augmentation schemes, e.g., flipping, translation or rotation, are low-level, data-independent and class-agnostic operations, leading to limited diversity for augmented samples. To this end, we propose a novel semantic data augmentation algorithm to complement traditional approaches. The proposed method is inspired by the intriguing property that deep networks are effective in learning linearized features, i.e., certain directions in the deep feature space correspond to meaningful semantic transformations, e.g., changing the background or view angle of an object. Based on this observation, translating training samples along many such directions in the feature space can effectively augment the dataset for more diversity. To implement this idea, we first introduce a sampling based method to obtain semantically meaningful directions efficiently. Then, an upper bound of the expected cross-entropy (CE) loss on the augmented training set is derived by assuming the number of augmented samples goes to infinity, yielding a highly efficient algorithm. In fact, we show that the proposed implicit semantic data augmentation (ISDA) algorithm amounts to minimizing a novel robust CE loss, which adds minimal extra computational cost to a normal training procedure. In addition to supervised learning, ISDA can be applied to semi-supervised learning tasks under the consistency regularization framework, where ISDA amounts to minimizing the upper bound of the expected KL-divergence between the augmented features and the original features. Although being simple, ISDA consistently improves the generalization performance of popular deep models (e.g., ResNets and DenseNets) on a variety of datasets, i.e., CIFAR-10, CIFAR-100, SVHN, ImageNet, and Cityscapes. Code for reproducing our results is available at https://github.com/blackfeather-wang/ISDA-for-Deep-Networks.

Read full abstract

Semi-Supervised Support Vector Machines (S3VMs) provide a powerful framework for Semi-Supervised Learning (SSL) tasks which leverage widely available unlabeled data to improve performance. However, there exist three issues in S3VMs: (i) S3VMs require concurrently training c one-against-all (OAA) classifiers (c is the number of classes) for multiclass classification, which is prohibitive for large c; (ii) S3VMs require huge computational time and large storage (because of the large kernel matrix) in large-scale training and testing; (iii) S3VMs require the balance constraint in the unlabeled data, which not only needs prior knowledge from the unlabeled data (the prior knowledge is unavailable in some applications), but also makes their nonconvex optimization problem more intractable. To address these issues, a novel method called Extreme Semi-Supervised Learning (ESSL) is proposed in this paper. First, the framework of Extreme Learning Machine (ELM) is adopted to handle both binary and multiclass classification problems in a unified model. Second, the hidden layer is encoded by an extremely small approximate empirical kernel map (AEKM) to greatly reduce the computational cost and the memory usage for training and testing. Third, the balance constraint (prior knowledge) in the unlabeled data is removed through the elaborative design of weighting function (which emphasizes the importance of labeled data and the minority pattern in the labeled data).By these three ways, ESSL can be solved effectively and efficiently based on alternating optimization (AO). More specifically, ESSL can be analytically and simply solved by generalized pseudoinverse and oneHotMap function (without any optimization solver and the OAA strategy) in the AO procedure, and consequently, better performance and much faster training speed are always achieved in ESSL. Our empirical study shows that ESSL significantly outperforms existing efficient SSL methods (e.g., meanS3VM and SS-ELM) in terms of accuracy, efficiency and memory, especially for large-scale multiclass problems. As an example, on the 20Newsgroups dataset, ESSL respectively runs 45 and 120 times faster than meanS3VM for training and testing with the improvement in accuracy of 3%, while the memory usage is reduced to 1/14. It is noteworthy that even though all the model parameters are with default values, ESSL already produces very excellent performance without fine-tuning parameters.

Read full abstract

Semi-supervised Learning Tasks Research Articles

Related Topics

Articles published on Semi-supervised Learning Tasks

Generative Adversarial Training for Supervised and Semi-supervised Learning.

An Empirical Study of Graph-Based Approaches for Semi-supervised Time Series Classification

The Representation of Large-Scale Graph Based on Semi-Supercised Learning

E-GCN: graph convolution with estimated labels

Higher-Order Graph Convolutional Networks With Multi-Scale Neighborhood Pooling for Semi-Supervised Node Classification

Probabilistic Reconstruction of Spatio-Temporal Processes Over Multi-Relational Graphs

Semi-Supervised Classification of Graph Convolutional Networks with Laplacian Rank Constraints

Regularizing Deep Networks With Semantic Data Augmentation.

Human activity recognition by manifold regularization based dynamic graph convolutional networks

FMixCutMatch for semi-supervised deep learning

Anisotropic Graph Convolutional Network for Semi-Supervised Learning

Robust kernelized graph-based learning

SCOs: Semi-Supervised Co-Selection by a Similarity Preserving Approach

A fast graph-based data classification method with applications to 3D sensory data in the form of point clouds

Deep representation clustering-based fault diagnosis method with unsupervised data applied to rotating machinery

Combining deep generative and discriminative models for Bayesian semi-supervised learning

Mutual Improvement Between Temporal Ensembling and Virtual Adversarial Training

Extreme semi-supervised learning for multiclass classification

Data induced masking representation learning for face data analysis

Semi-supervised online structure learning for composite event recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semi-supervised Learning Tasks Research Articles

Related Topics

Articles published on Semi-supervised Learning Tasks

Generative Adversarial Training for Supervised and Semi-supervised Learning.

An Empirical Study of Graph-Based Approaches for Semi-supervised Time Series Classification

The Representation of Large-Scale Graph Based on Semi-Supercised Learning

E-GCN: graph convolution with estimated labels

Higher-Order Graph Convolutional Networks With Multi-Scale Neighborhood Pooling for Semi-Supervised Node Classification

Probabilistic Reconstruction of Spatio-Temporal Processes Over Multi-Relational Graphs

Semi-Supervised Classification of Graph Convolutional Networks with Laplacian Rank Constraints

Regularizing Deep Networks With Semantic Data Augmentation.

Human activity recognition by manifold regularization based dynamic graph convolutional networks

FMixCutMatch for semi-supervised deep learning

Anisotropic Graph Convolutional Network for Semi-Supervised Learning

Robust kernelized graph-based learning

SCOs: Semi-Supervised Co-Selection by a Similarity Preserving Approach

A fast graph-based data classification method with applications to 3D sensory data in the form of point clouds

Deep representation clustering-based fault diagnosis method with unsupervised data applied to rotating machinery

Combining deep generative and discriminative models for Bayesian semi-supervised learning

Mutual Improvement Between Temporal Ensembling and Virtual Adversarial Training

Extreme semi-supervised learning for multiclass classification

Data induced masking representation learning for face data analysis

Semi-supervised online structure learning for composite event recognition