Semi-supervised Support Vector Machines Research Articles

BackgroundThe necessity to analyze medium-throughput data in epidemiological studies with small sample size, particularly when studying biomedical data may hinder the use of classical statistical methods. Support vector machines (SVM) models can be successfully applied in this setting because they are a powerful tool to analyze data with large number of predictors and limited sample size, especially when handling binary outcomes. However, biomedical research often involves analysis of time-to-event outcomes and has to account for censoring. Methods to handle censored data in the SVM framework can be divided into two classes: those based on support vector regression (SVR) and those based on binary classification. Methods based on SVR seem to be suboptimal to handle sparse data and yield results comparable to Cox proportional hazards model and kernel Cox regression. The limited work dedicated to assess methods based on of SVM for binary classification has been based on SVM learning using privileged information and SVM with uncertain classes.ResultsThis paper proposes alternative methods and extensions within the binary classification framework, specifically, a conditional survival approach for weighting censored observations and a semi-supervised SVM with local invariances. Using simulation studies and some real datasets, we evaluate those two methods and compare them with a weighted SVM model, SVM extensions found in the literature, kernel Cox regression and Cox model.ConclusionsOur proposed methods perform generally better under a wide variety of realistic scenarios about the structure of biomedical data. Specifically, the local invariances method using the conditional survival approach is the most robust method under different scenarios and is a good approach to consider as an alternative to other time-to-event methods. When analysing real data is a method to be considered and recommended since outperforms other methods in proportional and non-proportional scenarios and sparse data, which is something usual in biomedical data and biomarkers analysis.

Read full abstract

Semi-Supervised Support Vector Machines (S3VMs) provide a powerful framework for Semi-Supervised Learning (SSL) tasks which leverage widely available unlabeled data to improve performance. However, there exist three issues in S3VMs: (i) S3VMs require concurrently training c one-against-all (OAA) classifiers (c is the number of classes) for multiclass classification, which is prohibitive for large c; (ii) S3VMs require huge computational time and large storage (because of the large kernel matrix) in large-scale training and testing; (iii) S3VMs require the balance constraint in the unlabeled data, which not only needs prior knowledge from the unlabeled data (the prior knowledge is unavailable in some applications), but also makes their nonconvex optimization problem more intractable. To address these issues, a novel method called Extreme Semi-Supervised Learning (ESSL) is proposed in this paper. First, the framework of Extreme Learning Machine (ELM) is adopted to handle both binary and multiclass classification problems in a unified model. Second, the hidden layer is encoded by an extremely small approximate empirical kernel map (AEKM) to greatly reduce the computational cost and the memory usage for training and testing. Third, the balance constraint (prior knowledge) in the unlabeled data is removed through the elaborative design of weighting function (which emphasizes the importance of labeled data and the minority pattern in the labeled data).By these three ways, ESSL can be solved effectively and efficiently based on alternating optimization (AO). More specifically, ESSL can be analytically and simply solved by generalized pseudoinverse and oneHotMap function (without any optimization solver and the OAA strategy) in the AO procedure, and consequently, better performance and much faster training speed are always achieved in ESSL. Our empirical study shows that ESSL significantly outperforms existing efficient SSL methods (e.g., meanS3VM and SS-ELM) in terms of accuracy, efficiency and memory, especially for large-scale multiclass problems. As an example, on the 20Newsgroups dataset, ESSL respectively runs 45 and 120 times faster than meanS3VM for training and testing with the improvement in accuracy of 3%, while the memory usage is reduced to 1/14. It is noteworthy that even though all the model parameters are with default values, ESSL already produces very excellent performance without fine-tuning parameters.

Read full abstract

Semi-supervised Support Vector Machines Research Articles

Articles published on Semi-supervised Support Vector Machines

Mixed-integer quadratic optimization and iterative clustering techniques for semi-supervised support vector machines

Reject inference in credit scoring based on cost-sensitive learning and joint distribution adaptation method

A classification method of fuzzy semi-supervised support vector machines for the problems of imbalance

Semi-Supervised Ensemble Learning Framework for Accelerating Power System Transient Stability Knowledge Base Generation

Comparison of general kernel, multiple kernel, infinite ensemble and semi-supervised support vector machines for landslide susceptibility prediction

Applications of Semi-Supervised Support Vector Machines in Data Separation Methods in Structural Health Monitoring

Safe transductive support vector machine

Semi-Supervised Support Vector Machine for Digital Twins Based Brain Image Fusion.

One novel class of Bézier smooth semi-supervised support vector machines for classification

Enhancing SVM for survival data using local invariances and weighting

Multi-view semi-supervised least squares twin support vector machines with manifold-preserving graph reduction

Extreme semi-supervised learning for multiclass classification

Semi-supervised modeling and compensation for the thermal error of precision feed axes

Detection of Human Fall Using Floor Vibration and Multi-Features Semi-Supervised SVM.

Seismic classification-based method for recognizing epicenter-neighboring orbits

Reject inference in credit scoring using Semi-supervised Support Vector Machines

Distributed semi-supervised support vector machines

An aggregate and iterative disaggregate algorithm with proven optimality in machine learning

A New Conic Approach to Semisupervised Support Vector Machines

Conic Relaxations for Semi-supervised Support Vector Machines

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semi-supervised Support Vector Machines Research Articles

Articles published on Semi-supervised Support Vector Machines

Mixed-integer quadratic optimization and iterative clustering techniques for semi-supervised support vector machines

Reject inference in credit scoring based on cost-sensitive learning and joint distribution adaptation method

A classification method of fuzzy semi-supervised support vector machines for the problems of imbalance

Semi-Supervised Ensemble Learning Framework for Accelerating Power System Transient Stability Knowledge Base Generation

Comparison of general kernel, multiple kernel, infinite ensemble and semi-supervised support vector machines for landslide susceptibility prediction

Applications of Semi-Supervised Support Vector Machines in Data Separation Methods in Structural Health Monitoring

Safe transductive support vector machine

Semi-Supervised Support Vector Machine for Digital Twins Based Brain Image Fusion.

One novel class of Bézier smooth semi-supervised support vector machines for classification

Enhancing SVM for survival data using local invariances and weighting

Multi-view semi-supervised least squares twin support vector machines with manifold-preserving graph reduction

Extreme semi-supervised learning for multiclass classification

Semi-supervised modeling and compensation for the thermal error of precision feed axes

Detection of Human Fall Using Floor Vibration and Multi-Features Semi-Supervised SVM.

Seismic classification-based method for recognizing epicenter-neighboring orbits

Reject inference in credit scoring using Semi-supervised Support Vector Machines

Distributed semi-supervised support vector machines

An aggregate and iterative disaggregate algorithm with proven optimality in machine learning

A New Conic Approach to Semisupervised Support Vector Machines

Conic Relaxations for Semi-supervised Support Vector Machines