Mixed-integer quadratic optimization and iterative clustering techniques for semi-supervised support vector machines

Jan Pablo Burgard,Martin Schmidt,Maria Eduarda Pinheiro

doi:10.1007/s11750-024-00668-w

Abstract

Among the most famous algorithms for solving classification problems are support vector machines (SVMs), which find a separating hyperplane for a set of labeled data points. In some applications, however, labels are only available for a subset of points. Furthermore, this subset can be non-representative, e.g., due to self-selection in a survey. Semi-supervised SVMs tackle the setting of labeled and unlabeled data and can often improve the reliability of the results. Moreover, additional information about the size of the classes can be available from undisclosed sources. We propose a mixed-integer quadratic optimization (MIQP) model that covers the setting of labeled and unlabeled data points as well as the overall number of points in each class. Since the MIQP’s solution time rapidly grows as the number of variables increases, we introduce an iterative clustering approach to reduce the model’s size. Moreover, we present an update rule for the required big-M values, prove the correctness of the iterative clustering method as well as derive tailored dimension-reduction and warm-starting techniques. Our numerical results show that our approach leads to a similar accuracy and precision than the MIQP formulation but at much lower computational cost. Thus, we can solve larger problems. With respect to the original SVM formulation, we observe that our approach has even better accuracy and precision for biased samples.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mixed-integer quadratic optimization and iterative clustering techniques for semi-supervised support vector machines

Abstract

Talk to us

Similar Papers

More From: TOP

Lead the way for us

Journal: TOP	Publication Date: May 16, 2024
License type: CC BY 4.0

Similar Papers

Innovative approaches to addressing the tradeoff between interpretability and accuracy in ship fuel consumption prediction
Haoqing Wang ... Lu Zhen
Transportation Research Part C: Emerging Technologies | VOL. 157
Haoqing Wang, et. al.Haoqing Wang ... Lu Zhen
12 Oct 2023
Transportation Research Part C: Emerging Technologies | VOL. 157

Sparse Poisson regression via mixed-integer optimization.
Hiroki Saishu ... Yuichi Takano
PloS one | VOL. 16
Hiroki Saishu, et. al.Hiroki Saishu ... Yuichi Takano
22 Apr 2021
PloS one | VOL. 16

Mixed integer quadratic optimization formulations for eliminating multicollinearity based on variance inflation factor
Ryuta Tamura ... Ryuhei Miyashiro
Journal of Global Optimization | VOL. 73
Ryuta Tamura, et. al.Ryuta Tamura ... Ryuhei Miyashiro
22 Oct 2018
Journal of Global Optimization | VOL. 73

Semi-Supervised Learning
Tobias Scheffer
-
Tobias SchefferTobias Scheffer
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mixed-integer quadratic optimization and iterative clustering techniques for semi-supervised support vector machines

Abstract

Talk to us

Similar Papers

More From: TOP