An Overview of Apache Pig and Apache Hive

Saiyam Arora,Richa Vasuja,Abinesh Verma

doi:10.32628/cseit195250

Abstract

Ever since the enhancement of technology has taken place, the data is growing at an alarming rate. The most prominent factor of data growth is the “Social Media”, leads to the origination of a tremendous amount of data called Big Data. Big Data is a term used for data sets that are extremely large in size as well as complicated to store and process using traditional database processing applications. A saviour to deal with Big Data is “Hadoop” and two major components of Hadoop which are HDFS (Distributed Storage) and Map Reduce(Parallel Processing). Apache Pig and Hive is an essential part of the Hadoop Ecosystem. This paper covers an overview of both Apache Pig and Hive with their architecture. As Hadoop, no doubt is doing tremendously great work by storing and processing the huge volume of data but there are more frameworks now a days to increase the efficiency of Hadoop framework which are basically seen as the layers of Hadoop or a part of Apache Hadoop project. And that is why this paper includes the two most important layers namely Apache Pig and Apache Hive.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Overview of Apache Pig and Apache Hive

Abstract

Talk to us

Similar Papers

More From: International Journal of Scientific Research in Computer Science, Engineering and Information Technology

Lead the way for us

Journal: International Journal of Scientific Research in Computer Science, Engineering and Information Technology	Publication Date: Mar 5, 2019
Citations: 1

Similar Papers

Temporal Performance Evaluation of Hadoop Variants for Diabetes Big Data
Aamna Arshed ... Muhammad Asif Habib
-
Aamna Arshed, et. al.Aamna Arshed ... Muhammad Asif Habib
25 Jan 2022
25 Jan 2022

Performance Analysis of ECG Big Data using Apache Hive and Apache Pig
Mudassar Ahmad ... Safina Kanwal
-
Mudassar Ahmad, et. al.Mudassar Ahmad ... Safina Kanwal
01 Nov 2019
01 Nov 2019

Simulation of Performance Analysis of MongoDB, PIG, HIVE Storage, Map Reduce, Spark and Yarn
Monika Monu ... Sat Pal
SSRN Electronic Journal | VOL. -
Monika Monu, et. al.Monika Monu ... Sat Pal
14 Jun 2019
SSRN Electronic Journal | VOL. -

Theoretical and Empirical Analysis of Usage of MapReduce and Apache Tez in Big Data
Rupinder Singh ... Puneet Jai Kaur
-
Rupinder Singh, et. al.Rupinder Singh ... Puneet Jai Kaur
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Overview of Apache Pig and Apache Hive

Abstract

Talk to us

Similar Papers

More From: International Journal of Scientific Research in Computer Science, Engineering and Information Technology