Parallel Traversal of Large Ensembles of Decision Trees

Francesco Lettich,Rossano Venturini,Franco Maria Nardini,Raffaele Perego,Nicola Tonellotto,Salvatore Orlando,Claudio Lucchese

doi:10.1109/tpds.2018.2860982

Abstract

Machine-learnt models based on additive ensembles of regression trees are currently deemed the best solution to address complex classification, regression, and ranking tasks. The deployment of such models is computationally demanding: to compute the final prediction, the whole ensemble must be traversed by accumulating the contributions of all its trees. In particular, traversal cost impacts applications where the number of candidate items is large, the time budget available to apply the learnt model to them is limited, and the users’ expectations in terms of quality-of-service is high. Document ranking in web search, where sub-optimal ranking models are deployed to find a proper trade-off between efficiency and effectiveness of query answering, is probably the most typical example of this challenging issue. This paper investigates multi/many-core parallelization strategies for speeding up the traversal of large ensembles of regression trees thus obtaining machine-learnt models that are, at the same time, effective, fast, and scalable. Our best results are obtained by the GPU-based parallelization of the state-of-the-art algorithm, with speedups of up to 102.6x.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parallel Traversal of Large Ensembles of Decision Trees

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: Sep 1, 2019
Citations: 36

Similar Papers

QuickScorer: Efficient Traversal of Large Ensembles of Decision Trees
Claudio Lucchese ... Salvatore Orlando
-
Claudio Lucchese, et. al.Claudio Lucchese ... Salvatore Orlando
01 Jan 2017
01 Jan 2017

Multicore/Manycore Parallel Traversal of Large Forests of Regression Trees
Francesco Lettich ... Rossano Venturini
-
Francesco Lettich, et. al.Francesco Lettich ... Rossano Venturini
01 Jul 2017
01 Jul 2017

Distilled Neural Networks for Efficient Learning to Rank
Franco Maria Nardini ... Salvatore Trani
IEEE Transactions on Knowledge and Data Engineering | VOL. -
Franco Maria Nardini, et. al.Franco Maria Nardini ... Salvatore Trani
01 Jan 2021
IEEE Transactions on Knowledge and Data Engineering | VOL. -

Tree-Based On-Line Reinforcement Learning
Andre Barreto
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 28
Andre BarretoAndre Barreto
21 Jun 2014
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parallel Traversal of Large Ensembles of Decision Trees

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems