Semi-supervised Learning Setting Research Articles

The commonsense natural language inference (CNLI) tasks aim to select the most likely follow-up statement to a contextual description of ordinary, everyday events and facts. Current approaches to transfer learning of CNLI models across tasks require many labeled data from the new task. This paper presents a way to reduce this need for additional annotated training data from the new task by leveraging symbolic knowledge bases, such as ConceptNet. We formulate a teacher-student framework for mixed symbolic-neural reasoning, with the large-scale symbolic knowledge base serving as the teacher and a trained CNLI model as the student. This hybrid distillation process involves two steps. The first step is a symbolic reasoning process. Given a collection of unlabeled data, we use an abductive reasoning framework based on Grenander's pattern theory to create weakly labeled data. Pattern theory is an energy-based graphical probabilistic framework for reasoning among random variables with varying dependency structures. In the second step, the weakly labeled data, along with a fraction of the labeled data, is used to transfer-learn the CNLI model into the new task. The goal is to reduce the fraction of labeled data required. We demonstrate the efficacy of our approach by using three publicly available datasets (OpenBookQA, SWAG, and HellaSWAG) and evaluating three CNLI models (BERT, LSTM, and ESIM) that represent different tasks. We show that, on average, we achieve 63% of the top performance of a fully supervised BERT model with no labeled data. With only 1,000 labeled samples, we can improve this performance to 72%. Interestingly, without training, the teacher mechanism itself has significant inference power. The pattern theory framework achieves 32.7% accuracy on OpenBookQA, outperforming transformer-based models such as GPT (26.6%), GPT-2 (30.2%), and BERT (27.1%) by a significant margin. We demonstrate that the framework can be generalized to successfully train neural CNLI models using knowledge distillation under unsupervised and semi-supervised learning settings. Our results show that it outperforms all unsupervised and weakly supervised baselines and some early supervised approaches, while offering competitive performance with fully supervised baselines. Additionally, we show that the abductive learning framework can be adapted for other downstream tasks, such as unsupervised semantic textual similarity, unsupervised sentiment classification, and zero-shot text classification, without significant modification to the framework. Finally, user studies show that the generated interpretations enhance its explainability by providing key insights into its reasoning mechanism.

Read full abstract

Deep learning methods are successfully used in applications pertaining to ubiquitous computing, pervasive intelligence, health, and well-being. Specifically, the area of human activity recognition (HAR) is primarily transformed by the convolutional and recurrent neural networks, thanks to their ability to learn semantic representations directly from raw input. However, in order to extract generalizable features massive amounts of well-curated data are required, which is a notoriously challenging task; hindered by privacy issues and annotation costs. Therefore, unsupervised representation learning (i.e., learning without manually labeling the instances) is of prime importance to leverage the vast amount of unlabeled data produced by smart devices. In this work, we propose a novel self-supervised technique for feature learning from sensory data that does not require access to any form of semantic labels, i.e., activity classes. We learn a multi-task temporal convolutional network to recognize transformations applied on an input signal. By exploiting these transformations, we demonstrate that simple auxiliary tasks of the binary classification result in a strong supervisory signal for extracting useful features for the down-stream task. We extensively evaluate the proposed approach on several publicly available datasets for smartphone-based HAR in unsupervised, semi-supervised and transfer learning settings. Our method achieves performance levels superior to or comparable with fully-supervised networks trained directly with activity labels, and it performs significantly better than unsupervised learning through autoencoders. Notably, for the semi-supervised case, the self-supervised features substantially boost the detection rate by attaining a kappa score between 0.7 - 0.8 with only 10 labeled examples per class. We get similar impressive performance even if the features are transferred from a different data source. Self-supervision drastically reduces the requirement of labeled activity data, effectively narrowing the gap between supervised and unsupervised techniques for learning meaningful representations. While this paper focuses on HAR as the application domain, the proposed approach is general and could be applied to a wide variety of problems in other areas.

Read full abstract

Semi-supervised Learning Setting Research Articles

Related Topics

Articles published on Semi-supervised Learning Setting

Predicting pathological complete response based on weakly and semi-supervised joint learning in breast cancer multi-parametric MRI

Learning to Predict Gradients for Semi-Supervised Continual Learning.

Self-Supervised 3D Action Representation Learning With Skeleton Cloud Colorization.

Self-Supervised Contrastive Representation Learning for Semi-Supervised Time-Series Classification.

On the use of matrix profiles and optimal transport theory for multivariate time series anomaly detection within structural health monitoring

Semisupervised Federated Learning for Temporal News Hyperpatism Detection

Learning Graph Representations With Maximal Cliques.

Leveraging Symbolic Knowledge Bases for Commonsense Natural Language Inference Using Pattern Theory.

Influential Attributed Communities via Graph Convolutional Network (InfACom-GCN)

Self-supervised learning for macromolecular structure classification based on cryo-electron tomograms.

Perceiving Stroke-Semantic Context: Hierarchical Contrastive Learning for Robust Scene Text Recognition

Fuzziness based semi-supervised multimodal learning for patient’s activity recognition using RGBDT videos

Robust and efficient semi-supervised estimation of average treatment effects with application to electronic health records data.

Supervised and semi-supervised twin parametric-margin regularized extreme learning machine

Fuzzy weighted sparse reconstruction error-steered semi-supervised learning for face recognition

Multi-task Self-Supervised Learning for Human Activity Detection

Kernel conditional clustering and kernel conditional semi-supervised learning

Semi-Supervised Deep Neural Network for Joint Intensity Estimation of Multiple Facial Action Units

Learning a discriminant graph-based embedding with feature selection for image categorization

Graph-based sparse linear discriminant analysis for high-dimensional classification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semi-supervised Learning Setting Research Articles

Related Topics

Articles published on Semi-supervised Learning Setting

Predicting pathological complete response based on weakly and semi-supervised joint learning in breast cancer multi-parametric MRI

Learning to Predict Gradients for Semi-Supervised Continual Learning.

Self-Supervised 3D Action Representation Learning With Skeleton Cloud Colorization.

Self-Supervised Contrastive Representation Learning for Semi-Supervised Time-Series Classification.

On the use of matrix profiles and optimal transport theory for multivariate time series anomaly detection within structural health monitoring

Semisupervised Federated Learning for Temporal News Hyperpatism Detection

Learning Graph Representations With Maximal Cliques.

Leveraging Symbolic Knowledge Bases for Commonsense Natural Language Inference Using Pattern Theory.

Influential Attributed Communities via Graph Convolutional Network (InfACom-GCN)

Self-supervised learning for macromolecular structure classification based on cryo-electron tomograms.

Perceiving Stroke-Semantic Context: Hierarchical Contrastive Learning for Robust Scene Text Recognition

Fuzziness based semi-supervised multimodal learning for patient’s activity recognition using RGBDT videos

Robust and efficient semi-supervised estimation of average treatment effects with application to electronic health records data.

Supervised and semi-supervised twin parametric-margin regularized extreme learning machine

Fuzzy weighted sparse reconstruction error-steered semi-supervised learning for face recognition

Multi-task Self-Supervised Learning for Human Activity Detection

Kernel conditional clustering and kernel conditional semi-supervised learning

Semi-Supervised Deep Neural Network for Joint Intensity Estimation of Multiple Facial Action Units

Learning a discriminant graph-based embedding with feature selection for image categorization

Graph-based sparse linear discriminant analysis for high-dimensional classification