Efficient Gradient Boosted Decision Tree Training on GPUs

Zeyi Wen,Shengliang Lu,Jiashuai Shi,Bingsheng He,Ramamohanarao Kotagiri

doi:10.1109/ipdps.2018.00033

Abstract

In this paper, we present a novel parallel implementation for training Gradient Boosting Decision Trees (GBDTs) on Graphics Processing Units (GPUs). Thanks to the wide use of the open sourced XGBoost library, GBDTs have become very popular in recent years and won many awards in machine learning and data mining competitions. Although GPUs have demonstrated their success in accelerating many machine learning applications, there are a series of key challenges of developing a GPU-based GBDT algorithm, including irregular memory accesses, many small sorting operations and varying data parallel granularities in tree construction. To tackle these challenges on GPUs, we propose various novel techniques (including Run-length Encoding compression and thread/block workload dynamic allocation, and reusing intermediate training results for efficient gradient computation). Our experimental results show that our algorithm named GPU-GBDT is often 10 to 20 times faster than the sequential version of XGBoost, and achieves 1.5 to 2 times speedup over a 40 threaded XGBoost running on a relatively high-end workstation of 20 CPU cores. Moreover, GPU-GBDT outperforms its CPU counterpart by 2 to 3 times in terms of performance-price ratio.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Gradient Boosted Decision Tree Training on GPUs

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Exploiting GPUs for Efficient Gradient Boosting Decision Tree Training
Zeyi Wen ... Bingsheng He
IEEE Transactions on Parallel and Distributed Systems | VOL. 30
Zeyi Wen, et. al.Zeyi Wen ... Bingsheng He
01 Dec 2019
IEEE Transactions on Parallel and Distributed Systems | VOL. 30

Practical Federated Gradient Boosting Decision Trees
Qinbin Li ... Bingsheng He
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Qinbin Li, et. al.Qinbin Li ... Bingsheng He
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Pricing barrier and American options under the SABR model on the graphics processing unit
Yu Tian ... Zili Zhu
Concurrency and Computation: Practice and Experience | VOL. 24
Yu Tian, et. al.Yu Tian ... Zili Zhu
07 Jun 2011
Concurrency and Computation: Practice and Experience | VOL. 24

Machine Learning Using Virtualized GPUs in Cloud Environments
Uday Kurkure ... Lan Vu
-
Uday Kurkure, et. al.Uday Kurkure ... Lan Vu
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Gradient Boosted Decision Tree Training on GPUs

Abstract

Talk to us

Similar Papers