Equi-Depth Histogram Construction Methodology for Big Data Tools

Tolga Büyüktanir,Ahmet Ercan Topcu

doi:10.2339/politeknik.620198

Abstract

In recent decades, countless data sources such as social media, machines, and networks are constantly pushing data into the digital world. The size of the data has been growing exponentially. To understand the statistical information of data query optimization, equi-depth histograms are essential. In this paper, we present approximate equi-depth histogram construction for big data using both Apache Pig Scripts and Java Web Interface interacting with Apache Hadoop. We use equi-depth histogram construction with quality guarantees for big data approaches and implement them with Apache Hadoop Map-Reduce and Apache Pig user-defined functions. We introduce a prototype implementation of the construction of the approximate equi-depth histogram from the Java Server Face page using Apache Hadoop jobs and the Hadoop Distributed Files System, and we evaluate these methods using the demonstration. We explain Apache Pig Scripts techniques to create equi-depth histograms using big data. The results indicate that our system provides the capability of writing multiple jobs using Apache Pig, and programmers can make use of the advantages of Apache Pig to create histograms and eliminate the complex implementation of Map-Reduce jobs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Equi-Depth Histogram Construction Methodology for Big Data Tools

Abstract

Talk to us

Similar Papers

More From: Politeknik Dergisi

Lead the way for us

Journal: Politeknik Dergisi	Publication Date: Sep 1, 2020
License type: cc-by-sa

Similar Papers

Developing a big data analytics platform using Apache Hadoop Ecosystem for delivering big data services in libraries
Ranjeet Kumar Singh
Digital Library Perspectives | VOL. 40
Ranjeet Kumar SinghRanjeet Kumar Singh
22 Feb 2024
Digital Library Perspectives | VOL. 40

An Overview of Apache Pig and Apache Hive
Saiyam Arora ... Abinesh Verma
International Journal of Scientific Research in Computer Science, Engineering and Information Technology | VOL. 5
Saiyam Arora, et. al.Saiyam Arora ... Abinesh Verma
05 Mar 2019
International Journal of Scientific Research in Computer Science, Engineering and Information Technology | VOL. 5

The research of social processes at the university using big data
Abdullayev Vugar Hacimahmud ... S Krit
MATEC Web of Conferences | VOL. 348
Abdullayev Vugar Hacimahmud, et. al.Abdullayev Vugar Hacimahmud ... S Krit
01 Jan 2020
MATEC Web of Conferences | VOL. 348

Twitter Archives and the Challenges of "Big Social Data" for Media and Communication Research
Jean Burgess ... Axel Bruns
M/C Journal | VOL. 15
Jean Burgess, et. al.Jean Burgess ... Axel Bruns
11 Oct 2012
M/C Journal | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Equi-Depth Histogram Construction Methodology for Big Data Tools

Abstract

Talk to us

Similar Papers

More From: Politeknik Dergisi