Greedy Incremental Support Vector Regression

Dymitr Ruta,Quang Hieu Vu,Ling Cen

doi:10.15439/2019f364

Abstract

Support Vector Regression (SVR) is a powerful supervised machine learning model especially well suited to the normalized or binarized data. However, its quadratic complexity in the number of training examples eliminates it from training on large datasets, especially high dimensional with frequent retraining requirement. We propose a simple two-stage greedy selection of training data for SVR to maximize its validation set accuracy at the minimum number of training examples and illustrate the performance of such strategy in the context of Clash Royale Challenge 2019, concerned with efficient decks’ win rate prediction. Hundreds of thousands of labelled data examples were reduced to hundreds, optimized SVR was trained on to maximize the validation R2 score. The proposed model scored the first place in the Cash Royale 2019 challenge, outperforming over hundred of competitive teams from around the world.

Highlights

Support Vector Machine (SVM) is a supervised machine learning (ML) model developed as far back as in 1963 [1] on the basis of Vapnik-Chervonenkis computational theory of learning [2]
Support Vector Regression (SVR) extends the original capability of the SVM model into the regression space, while sharing the same model fundamental and properties as SVM does for classification: for instance in margin-maximizing hyper-plane characterization, tolerance of errors etc
High cost involved in computing large number of support vectors in SVR training process is a critical drawback compared to simpler supervised ML models, which unable to demonstrate such generalization ingenuity, are able to complete in a reasonable time: [9], [10], [11]

Summary

INTRODUCTION

Support Vector Machine (SVM) is a supervised machine learning (ML) model developed as far back as in 1963 [1] on the basis of Vapnik-Chervonenkis computational theory of learning [2]. High cost involved in computing large number of support vectors in SVR training process is a critical drawback compared to simpler supervised ML models, which unable to demonstrate such generalization ingenuity, are able to complete in a reasonable time: [9], [10], [11]. Based on the observation that a vast majority of the SVM (SVR) predictive power comes from fairly small number of key data-structure-capturing examples, an obvious attempt to eliminate huge computational cost of training SVR could be reduced by carefully selecting a small set of the critical training data points. In an attempt to address this challenge we have proposed a simple two-stage greedy search process that returns an ordered list of most predictive data points offering the most predictive SVR model based on incrementally added number of training examples.

COMPETITION DESCRIPTION

Data preparation

Hyperparameters’ setting

Greedy online backward-forward data selection

Greedy round-exhaustive forward data selection

Fine-tuning for further generalization improvements

EXPERIMENTAL RESULTS

CONCLUSIONS

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Greedy Incremental Support Vector Regression

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Sep 26, 2019
Citations: 13	License type: cc-by

Similar Papers

Efficient Support Vector Regression with Reduced Training Data
Ling Cen ... Quang Hieu Vu
-
Ling Cen, et. al.Ling Cen ... Quang Hieu Vu
26 Sep 2019
26 Sep 2019

Reducing examples to accelerate support vector regression
Gao Guo ... Jiang-She Zhang
Pattern recognition letters | VOL. 28
Gao Guo, et. al.Gao Guo ... Jiang-She Zhang
29 Jun 2007
Pattern recognition letters | VOL. 28

On a new approach for Lagrangian support vector regression
S. Balasundaram ... Gagandeep Benipal
Neural Computing & Applications | VOL. 29
S. Balasundaram, et. al.S. Balasundaram ... Gagandeep Benipal
02 Sep 2016
Neural Computing & Applications | VOL. 29

Pattern Synthesis in SVM Based Classifier
C Radha
-
C RadhaC Radha
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Greedy Incremental Support Vector Regression

Abstract

Highlights

Summary

Talk to us

Similar Papers