Graph-based Semi-supervised Learning Research Articles

Classification of high dimensional data finds wide-ranging applications. In many of these applications equipping the resulting classification with a measure of uncertainty may be as important as the classification itself. In this paper we introduce, develop algorithms for, and investigate the properties of a variety of Bayesian models for the task of binary classification; via the posterior distribution on the classification labels, these methods automatically give measures of uncertainty. The methods are all based on the graph formulation of semisupervised learning. We provide a unified framework which brings together a variety of methods that have been introduced in different communities within the mathematical sciences. We study probit classification [C. K. Williams and C. E. Rasmussen, “Gaussian Processes for Regression,” in Advances in Neural Information Processing Systems 8, MIT Press, 1996, pp. 514--520] in the graph-based setting, generalize the level-set method for Bayesian inverse problems [M. A. Iglesias, Y. Lu, and A. M. Stuart, Interfaces Free Bound., 18 (2016), pp. 181--217] to the classification setting, and generalize the Ginzburg--Landau optimization-based classifier [A. L. Bertozzi and A. Flenner, Multiscale Model. Simul., 10 (2012), pp. 1090--1118], [Y. Van Gennip and A. L. Bertozzi, Adv. Differential Equations, 17 (2012), pp. 1115--1180] to a Bayesian setting. We also show that the probit and level-set approaches are natural relaxations of the harmonic function approach introduced in [X. Zhu et al., “Semi-supervised Learning Using Gaussian Fields and Harmonic Functions,” in ICML, Vol. 3, 2003, pp. 912--919]. We introduce efficient numerical methods, suited to large datasets, for both MCMC-based sampling and gradient-based MAP estimation. Through numerical experiments we study classification accuracy and uncertainty quantification for our models; these experiments showcase a suite of datasets commonly used to evaluate graph-based semisupervised learning algorithms.

Semi-supervised learning (SSL) concerns the problem of how to improve classifiers’ performance through making use of prior knowledge from unlabeled data. Many SSL methods have been developed to integrate unlabeled data into the classifiers based on either the manifold or cluster assumption in recent years. In particular, the graph-based approaches, following the manifold assumption, have achieved a promising performance in many real-world applications. However, most of them work well on small-scale data sets only and lack probabilistic outputs. In this paper, a scalable graph-based SSL framework through sparse Bayesian model is proposed by defining a graph-based sparse prior. Based on the traditional Bayesian inference technique, a sparse Bayesian SSL algorithm (SBS <inline-formula> <tex-math notation="LaTeX">$^2$</tex-math></inline-formula> L) is obtained, which can remove the irrelevant unlabeled samples and make probabilistic prediction for out-of-sample data. Moreover, in order to scale SBS <inline-formula> <tex-math notation="LaTeX">$^2$</tex-math></inline-formula> L to large-scale data sets, an incremental SBS <inline-formula> <tex-math notation="LaTeX">$^2$</tex-math></inline-formula> L (ISBS <inline-formula><tex-math notation="LaTeX">$^2$</tex-math></inline-formula> L) is derived. The key idea of ISBS <inline-formula><tex-math notation="LaTeX">$^2$</tex-math></inline-formula> L is employing an incremental strategy and sequentially selecting parts of unlabeled samples that contribute to the learning instead of using all available unlabeled samples directly. ISBS <inline-formula><tex-math notation="LaTeX">$^2$</tex-math></inline-formula> L has lower time and space complexities than previous SSL algorithms with the use of all unlabeled samples. Extensive experiments on various data sets verify that our algorithms can achieve comparable classification effectiveness and efficiency with much better scalability. Finally, the generalization error bound is derived based on robustness analysis.

Graph-based Semi-supervised Learning Research Articles

Related Topics

Articles published on Graph-based Semi-supervised Learning

Leveraging multi-modal fusion for graph-based image annotation

A Semi-Supervised Approach to Bearing Fault Diagnosis under Variable Conditions towards Imbalanced Unlabeled Data.

Semisupervised Learning With Parameter-Free Similarity of Label and Side Information.

Bootstrapped Graph Diffusions

Interpretable Graph-Based Semi-Supervised Learning via Flows

Deeper Insights Into Graph Convolutional Networks for Semi-Supervised Learning

Safety-aware Graph-based Semi-Supervised Learning

Bootstrapped Graph Diffusions

Spatial and class structure regularized sparse representation graph for semi-supervised hyperspectral image classification

Robust Graph-Based Semisupervised Learning for Noisy Labeled Data via Maximum Correntropy Criterion.

Instance selection method for improving graph-based semi-supervised learning

그래프 기반 준지도 학습에서 빠른 낮은 계수 표현 기반 그래프 구축

Manifold Adaptive Kernelized Low-Rank Representation for Semisupervised Image Classification

Representation Space-Based Discriminative Graph Construction for Semisupervised Hyperspectral Image Classification

Uncertainty Quantification in Graph-Based Classification of High Dimensional Data

Joint Sparse Representation and Embedding Propagation Learning: A Framework for Graph-Based Semisupervised Learning.

Scalable Graph-Based Semi-Supervised Learning through Sparse Bayesian Model

A comparison of graph- and kernel-based \u2013omics data integration algorithms for classifying complex traits

GRAPH-BASED SEMI-SUPERVISED HYPERSPECTRAL IMAGE CLASSIFICATION USING SPATIAL INFORMATION

Spectral salient object detection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Graph-based Semi-supervised Learning Research Articles

Related Topics

Articles published on Graph-based Semi-supervised Learning

Leveraging multi-modal fusion for graph-based image annotation

A Semi-Supervised Approach to Bearing Fault Diagnosis under Variable Conditions towards Imbalanced Unlabeled Data.

Semisupervised Learning With Parameter-Free Similarity of Label and Side Information.

Bootstrapped Graph Diffusions

Interpretable Graph-Based Semi-Supervised Learning via Flows

Deeper Insights Into Graph Convolutional Networks for Semi-Supervised Learning

Safety-aware Graph-based Semi-Supervised Learning

Bootstrapped Graph Diffusions

Spatial and class structure regularized sparse representation graph for semi-supervised hyperspectral image classification

Robust Graph-Based Semisupervised Learning for Noisy Labeled Data via Maximum Correntropy Criterion.

Instance selection method for improving graph-based semi-supervised learning

그래프 기반 준지도 학습에서 빠른 낮은 계수 표현 기반 그래프 구축

Manifold Adaptive Kernelized Low-Rank Representation for Semisupervised Image Classification

Representation Space-Based Discriminative Graph Construction for Semisupervised Hyperspectral Image Classification

Uncertainty Quantification in Graph-Based Classification of High Dimensional Data

Joint Sparse Representation and Embedding Propagation Learning: A Framework for Graph-Based Semisupervised Learning.

Scalable Graph-Based Semi-Supervised Learning through Sparse Bayesian Model

A comparison of graph- and kernel-based \u2013omics data integration algorithms for classifying complex traits

GRAPH-BASED SEMI-SUPERVISED HYPERSPECTRAL IMAGE CLASSIFICATION USING SPATIAL INFORMATION

Spectral salient object detection