Time Performance Analysis of Multi-CPU and Multi-GPU in Big Data Clustering Computation

Widiarto Adiyoso,Ari Wibisono,S Reyneta Carissa Anwar,Ibad Rahadian Saladdin,Sumarsih Condroayu Purbarani,Adila Krisnadhi,Anindhita Dwi Saraswati,Annissa Fildzah Rafi Putri

doi:10.1109/iwbis.2018.8471715

Abstract

Big data is a hot topic that is regularly discussed in the computer science field for the past year. Big data provides numerous benefits for the development of technologies, such as business intelligence and deep learning. Processing big data requires specialized tools and environment, ranging from a commodity-clustered workstation to high performance computing server, especially in big data clustering where unsupervised learning takes place. In this paper, we conduct time analysis of commodity-clustered workstation equipped with Spark as a baseline for multi-CPU big data clustering and TensorFlow installed in a high-performance computing workstation as a baseline for multi-GPU big data clustering. Based on the analysis, it shows that TensorFlow performs have around 5 to 12 times faster computation time than Spark.

Full Text