Big data multi-query optimisation with Apache Flink

Radhya Sahal,Mohamed H Khafagy,Fatma A Omara

doi:10.1504/ijwet.2018.092401

Abstract

Big data analytic frameworks, such as MapReduce, Spark and Flink, have recently gained more popularity to process large data. Flink is an open-source Apache-hosted big data analytic framework for processing batch and streaming data. For historical data processing (batch), Flink's query optimiser is built based on techniques which have been used in the parallel database systems. Flink query optimiser translates the queries into jobs which are repeatedly submitted with similar tasks. Therefore, exploiting the similarity of tasks can avoid redundant computation. In this paper, Flink multi-query optimisation system, Flink-MQO, has been proposed and built on top of Flink software stack. It is considered as an add-on to Apache Flink to optimise multi-query based on data sharing. The Flink-MQO system exploits the data sharing opportunities of selection operators to eliminate the redundancy and duplication of data in-network movement of multi-query. Experimental results show that the exploiting of shared selection operators in big data multi-query can provide promising query execution time. Therefore, Flink-MQO system can potentially be used in the stream processing to improve the performance of the real-time applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Big data multi-query optimisation with Apache Flink

Abstract

Talk to us

Similar Papers

More From: International Journal of Web Engineering and Technology

Lead the way for us

Journal: International Journal of Web Engineering and Technology	Publication Date: Jan 1, 2018
Citations: 6

Similar Papers

Network security and anomaly detection with Big-DAMA, a big data analytics framework
Pedro Casas ... Giuseppe Settanni
-
Pedro Casas, et. al.Pedro Casas ... Giuseppe Settanni
01 Sep 2017
01 Sep 2017

Smart Cities and Big Data Analytics: A Data-Driven Decision-Making Use Case
Ahmed M Shahat Osman ... Ahmed Elragal
Smart Cities | VOL. 4
Ahmed M Shahat Osman, et. al.Ahmed M Shahat Osman ... Ahmed Elragal
28 Feb 2021
Smart Cities | VOL. 4

New Framework Modeling for Big Data Analysis of the Future
Mirza Tanweer Ahmad Beig ... Varun Kashyap
-
Mirza Tanweer Ahmad Beig, et. al.Mirza Tanweer Ahmad Beig ... Varun Kashyap
20 May 2024
20 May 2024

The state of the art and taxonomy of big data analytics: view from new big data framework
Azlinah Mohamed ... Ruhaila Maskat
Artificial Intelligence Review | VOL. 53
Azlinah Mohamed, et. al.Azlinah Mohamed ... Ruhaila Maskat
01 Feb 2019
Artificial Intelligence Review | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Big data multi-query optimisation with Apache Flink

Abstract

Talk to us

Similar Papers

More From: International Journal of Web Engineering and Technology