Data Optimization using Apache Flink

Vikas S,Thimmaraju S N

doi:10.35940/ijeat.b3081.129219

Abstract

Map Reduce, Flink, and Spark, also become more popular in the processing of big data lately. Flink will be an open platform Big Data processing system for Apache-powered batch storage and streaming of data. Flink's query optimizer is constructed for historical information processing (batch) based on parallel storage systems approaches. Flink query query optimizer interprets the questions into jobs of different tasks that are regularly sent. Therefore, taking advantage of task similarities should prevent redundant computation. In this article, the multi-demand optimization model for Flink, Flink was planned and designed on Flink Software Stack's top priority. It's thought-about as an associate in Apache Flink's nursing add-on to maximize multi-demand information sharing. The Flink system takes advantage of option operators ' information sharing resources to reduce overlap and duplication of multi-query in-network information movement. Research findings show that the leveraging of shared option operations in vast information on multiple requests would offer promising time to perform queries. Therefore, in the stream phase, Without doubt the Flink approach can be used to boost application performance over time periods.

Highlights

Big data emerged in the digital data age as an innovation space to address the enormous amounts of information produced precisely
For information purposes, delimited information such as conventional data warehouses could be an unique case of similar unbounded information flow, thereby moving traditional information warehouse systems stored on the hard disk to an analytical model in memory like a Flink (Carbone et al, 2015) In this article, batch processing is called a special streaming case wherever the flow is small and record sequence and duration may not matter
It tests the efficacy of the expected batch processing of the Flink Framework and the stream processing that was implemented in Java

Summary

Introduction

In a broad range of applications, successful technology has been implemented (Hsu, 2014), For example, data processing, knowledge analysis, computer programming and scientific computing. MapReduce has existed for the previous couple of years because it is the favored computing model for parallel, batch-style and large-scale data processing (Apache Flink (2016a).

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data Optimization using Apache Flink

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Engineering and Advanced Technology

Lead the way for us

Journal: International Journal of Engineering and Advanced Technology	Publication Date: Dec 30, 2019
License type: cc-by

Similar Papers

High-Performance Big Data Computing
Dhabaleswar K Panda ... Dipti Shankar
-
Dhabaleswar K Panda, et. al.Dhabaleswar K Panda ... Dipti Shankar
02 Aug 2022
02 Aug 2022

Big data processing and analysis platform for condition monitoring of electric power system
Yuanjun Guo ... Yong Wang
-
Yuanjun Guo, et. al.Yuanjun Guo ... Yong Wang
01 Aug 2016
01 Aug 2016

Cloud computing and big data: Technologies and applications
Mostapha Zbakh ... Mohamed Essaaidi
Concurrency and Computation: Practice and Experience | VOL. 29
Mostapha Zbakh, et. al.Mostapha Zbakh ... Mohamed Essaaidi
29 Mar 2017
Concurrency and Computation: Practice and Experience | VOL. 29

Adding Velocity to BigBench
Todor Ivanov ... Ahmad Ghazal
-
Todor Ivanov, et. al.Todor Ivanov ... Ahmad Ghazal
15 Jun 2018
15 Jun 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Optimization using Apache Flink

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Engineering and Advanced Technology