Variable selection for the multicategory SVM via adaptive sup-norm regularization

Hao Helen Zhang,Ji Zhu,Yufeng Liu,Yichao Wu

doi:10.1214/08-ejs122

Abstract

The Support Vector Machine (SVM) is a popular classification paradigm in machine learning and has achieved great success in real applications. However, the standard SVM can not select variables automatically and therefore its solution typically utilizes all the input variables without discrimination. This makes it difficult to identify important predictor variables, which is often one of the primary goals in data analysis. In this paper, we propose two novel types of regularization in the context of the multicategory SVM (MSVM) for simultaneous classification and variable selection. The MSVM generally requires estimation of multiple discriminating functions and applies the argmax rule for prediction. For each individual variable, we propose to characterize its importance by the supnorm of its coefficient vector associated with different functions, and then minimize the MSVM hinge loss function subject to a penalty on the sum of supnorms. To further improve the supnorm penalty, we propose the adaptive regularization, which allows different weights imposed on different variables according to their relative importance. Both types of regularization automate variable selection in the process of building classifiers, and lead to sparse multi-classifiers with enhanced interpretability and improved accuracy, especially for high dimensional low sample size data. One big advantage of the supnorm penalty is its easy implementation via standard linear programming. Several simulated examples and one real gene data analysis demonstrate the outstanding performance of the adaptive supnorm penalty in various data settings.

Highlights

While the Support Vector Machine (SVM) outperforms many other methods in terms of classification accuracy in numerous real problems, the implicit nature of its solution makes it less attractive in providing insight into the predictive ability of individual variables
Variable selection becomes more complex than the binary case, since the multicategory SVM (MSVM) requires estimation of multiple discriminating functions, among which each function has its own subset of important predictors
In contrast to the L1 MSVM, which imposes a penalty on the sum of absolute values of all coefficients, we penalize the sup-norm of the coefficients associated with each variable

Summary

Methodology

The sup-norm penalty shrinks sum of two maximums corresponding to two variables This helps to lead to more parsimonious models. In contrast to the L1 penalty, the sup-norm utilizes the group information of the decision function vector and the sup-norm MSVM can deliver better variable selection. For three-class problems, we show that the L1 MSVM and the new proposed sup-norm MSVM give identical solutions after adjusting the tuning parameters, which is due to the sum-to-zero constraints on w(j)’s. We use leave-one-out cross validation of the misclassification rate to select λ

Computational Algorithms

Adaptive Penalty

Simulation

Five-Class Example

Method

Four-Class Linear Example

Nonlinear Example

Real Example

Discussion

Findings

Literature Cited

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Journal of Statistics	Publication Date: Jan 1, 2008
Citations: 71	License type: cc-by

R Discovery Prime

R Discovery Prime

Variable selection for the multicategory SVM via adaptive sup-norm regularization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

High dimensional low sample size activity recognition using geometric classifiers
Muhammad Shahzad Cheema ... Christian Bauckhage
Digital Signal Processing | VOL. 42
Muhammad Shahzad Cheema, et. al.Muhammad Shahzad Cheema ... Christian Bauckhage
22 Apr 2015
Digital Signal Processing | VOL. 42

Improved Sparse Multi-Class SVM and Its Application for Gene Selection in Cancer Classification
Lingkang Huang ... Zhao-Bang Zeng
Cancer Informatics | VOL. 12
Lingkang Huang, et. al.Lingkang Huang ... Zhao-Bang Zeng
01 Jan 2013
Cancer Informatics | VOL. 12

Gene selection using support vector machines with non-convex penalty
Hao Helen Zhang ... Xiaodong Lin
Bioinformatics | VOL. 22
Hao Helen Zhang, et. al.Hao Helen Zhang ... Xiaodong Lin
25 Oct 2005
Bioinformatics | VOL. 22

An Empirical Study of Several Information Theoretic Based Feature Extraction Methods for Classifying High Dimensional Low Sample Size Data
Sheena Leeza Verghese ... Tomas H Maul
IEEE Access | VOL. 9
Sheena Leeza Verghese, et. al.Sheena Leeza Verghese ... Tomas H Maul
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Variable selection for the multicategory SVM via adaptive sup-norm regularization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics