Scalable and fast SVM regression using modern hardware

Zeyi Wen,Rui Zhang,Li Yang,Kotagiri Ramamohanarao

doi:10.1007/s11280-017-0445-1

Abstract

Support Vector Machine (SVM) regression is an important technique in data mining. The SVM training is expensive and its cost is dominated by: (i) the kernel value computation, and (ii) a search operation which finds extreme training data points for adjusting the regression function in every training iteration. Existing training algorithms for SVM regression are not scalable to large datasets because: (i) each training iteration repeatedly performs expensive kernel value computations, which is inefficient and requires holding the whole training dataset in memory; (ii) the search operation used in each training iteration considers the whole search space which is very expensive. In this article, we significantly improve the scalability and efficiency of SVM regression by exploiting the high performance of Graphics Processing Units (GPUs) and solid state drives (SSDs). Our key ideas are as follows. (i) To reduce the cost of repeated kernel value computations and avoid holding the whole training dataset in the GPU memory, we precompute all the kernel values and store them in the CPU memory extended by the SSD; together with an efficient strategy to read the precomputed kernel values, reusing precomputed kernel values with an efficient retrieval is much faster than computing them on-the-fly. This also alleviates the restriction that the training dataset has to fit into the GPU memory, and hence makes our algorithm scalable to large datasets, especially for large datasets with very high dimensionality. (ii) To enhance the performance of the frequently used search operation, we design an algorithm that minimizes the search space and the number of accesses to the GPU global memory; this optimized search algorithm also avoids branch divergence (one of the causes for poor performance) among GPU threads to achieve high utilization of the GPU resources. Our proposed techniques together form a scalable solution to the SVM regression which we call SIGMA. Our extensive experimental results show that SIGMA is highly efficient and can handle very large datasets which the state-of-the-art GPU-based algorithm cannot handle. On the datasets of size that the state-of-the-art GPU-based algorithm can handle, SIGMA consistently outperforms the state-of-the-art GPU-based algorithm by an order of magnitude and achieves up to 86 times speedup.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scalable and fast SVM regression using modern hardware

Abstract

Talk to us

Similar Papers

More From: World Wide Web

Lead the way for us

Journal: World Wide Web	Publication Date: Apr 22, 2017
Citations: 6

Similar Papers

Ballooning Graphics Memory Space in Full GPU Virtualization Environments
Younghun Park ... Sungyong Park
Scientific Programming | VOL. 2019
Younghun Park, et. al.Younghun Park ... Sungyong Park
23 Apr 2019
Scientific Programming | VOL. 2019

Fault Classification of Low-Speed Bearings Based on Support Vector Machine for Regression and Genetic Algorithms Using Acoustic Emission
Henry Ogbemudia Omoregbee ... P Stephan Heyns
Journal of Vibration Engineering & Technologies | VOL. 7
Henry Ogbemudia Omoregbee, et. al.Henry Ogbemudia Omoregbee ... P Stephan Heyns
12 Jun 2019
Journal of Vibration Engineering & Technologies | VOL. 7

Acceleration of Large Deep Learning Training with Hybrid GPU Memory Management of Swapping and Re-computing
Haruki Imai ... Yasushi Negishi
-
Haruki Imai, et. al.Haruki Imai ... Yasushi Negishi
10 Dec 2020
10 Dec 2020

Multivariate Calibration Models to Estimate Non-invasively Blood Glucose Levels Based on A Novel Optical Technique Named Pulse Glucometry
Yasuhiro Yamakoshi ... Ken-Ichi Yamakoshi
-
Yasuhiro Yamakoshi, et. al.Yasuhiro Yamakoshi ... Ken-Ichi Yamakoshi
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scalable and fast SVM regression using modern hardware

Abstract

Talk to us

Similar Papers

More From: World Wide Web