Scalable CAIM discretization on multiple GPUs using concurrent kernels

Alberto Cano,Krzysztof J Cios,Sebastián Ventura

doi:10.1007/s11227-014-1151-8

Abstract

Class-attribute interdependence maximization (CAIM) is one of the state-of-the-art algorithms for discretizing data for which classes are known. However, it may take a long time when run on high-dimensional large-scale data, with large number of attributes and/or instances. This paper presents a solution to this problem by introducing a graphic processing unit (GPU)-based implementation of the CAIM algorithm that significantly speeds up the discretization process on big complex data sets. The GPU-based implementation is scalable to multiple GPU devices and enables the use of concurrent kernels execution capabilities of modern GPUs. The CAIM GPU-based model is evaluated and compared with the original CAIM using single and multi-threaded parallel configurations on 40 data sets with different characteristics. The results show great speedup, up to 139 times faster using four GPUs, which makes discretization of big data efficient and manageable. For example, discretization time of one big data set is reduced from 2 h to $$<$$ < 2 min.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Journal of Supercomputing	Publication Date: Mar 16, 2014
Citations: 49	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Scalable CAIM discretization on multiple GPUs using concurrent kernels

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing

Lead the way for us

Similar Papers

Multilevel interference-aware scheduling on modern GPUs
Leiming Yu
-
Leiming YuLeiming Yu
10 May 2021
10 May 2021

Practical and comparative application of efficient data reduction - Multivariate curve resolution
Somaiyeh Khodadadi Karimvand ... Hamid Abdollahi
Analytica Chimica Acta | VOL. 1243
Somaiyeh Khodadadi Karimvand, et. al.Somaiyeh Khodadadi Karimvand ... Hamid Abdollahi
11 Jan 2023
Analytica Chimica Acta | VOL. 1243

Development of a Powerful Data-Analysis Tool Using Nonparametric Smoothing Models To Identify Drillsites in Tight Shale Reservoirs With High Economic Potential
Quan Cai ... Jenn-Tai Liang
SPE Journal | VOL. 23
Quan Cai, et. al.Quan Cai ... Jenn-Tai Liang
10 Nov 2017
SPE Journal | VOL. 23

Random Sample Partition: A Distributed Data Model for Big Data Analysis
Salman Salloum ... Yulin He
IEEE Transactions on Industrial Informatics | VOL. 15
Salman Salloum, et. al.Salman Salloum ... Yulin He
20 Jan 2018
IEEE Transactions on Industrial Informatics | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scalable CAIM discretization on multiple GPUs using concurrent kernels

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing