Variance inflation in high dimensional Support Vector Machines

Trine Julie Abrahamsen,Lars Kai Hansen

doi:10.1016/j.patrec.2013.08.018

Abstract

Many important machine learning models, supervised and unsupervised, are based on simple Euclidean distance or orthogonal projection in a high dimensional feature space. When estimating such models from small training sets we face the problem that the span of the training data set input vectors is not the full input space. Hence, when applying the model to future data the model is effectively blind to the missed orthogonal subspace. This can lead to an inflated variance of hidden variables estimated in the training set and when the model is applied to test data we may find that the hidden variables follow a different probability law with less variance. While the problem and basic means to reconstruct and deflate are well understood in unsupervised learning, the case of supervised learning is less well understood. We here investigate the effect of variance inflation in supervised learning including the case of Support Vector Machines (SVMS) and we propose a non-parametric scheme to restore proper generalizability. We illustrate the algorithm and its ability to restore performance on a wide range of benchmark data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Variance inflation in high dimensional Support Vector Machines

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Similar Papers

Evaluation of SVM, RVM and SMLR for Accurate Image Classification With Limited Ground Data
Mahesh Pal ... Giles M Foody
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 5
Mahesh Pal, et. al.Mahesh Pal ... Giles M Foody
01 Oct 2012
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 5

Support Vector Clustering with RBF Gaussian Kernel Parameter Optimization
...
International Journal of Advanced Research in Computer Science | VOL. 7
, et. al. ...
01 Jan 2015
International Journal of Advanced Research in Computer Science | VOL. 7

Comparing the classification performance of Bayesian linear discriminate analysis (BLDA) and support vector machine (SVM) in BCI P300-speller with familiar face paradigm
Qi Li ... Ning Gao
-
Qi Li, et. al.Qi Li ... Ning Gao
01 Oct 2016
01 Oct 2016

Support Vector Machines and Fuzzy Systems
Yixin Chen
-
Yixin ChenYixin Chen
01 Jan 2008
01 Jan 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Variance inflation in high dimensional Support Vector Machines

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters