Fries

Zuozhi Wang,Chen Li,Shengquan Ni,Avinash Kumar

doi:10.14778/3565816.3565827

Abstract

A computing job in a big data system can take a long time to run, especially for pipelined executions on data streams. Developers often need to change the computing logic of the job such as fixing a loophole in an operator or changing the machine learning model in an operator with a cheaper model to handle a sudden increase of the data-ingestion rate. Recently many systems have started supporting runtime reconfigurations to allow this type of change on the fly without killing and restarting the execution. While the delay in reconfiguration is critical to performance, existing systems use epochs to do runtime reconfigurations, which can cause a long delay. In this paper we develop a new technique called Fries that leverages the emerging availability of fast control messages in many systems, since these messages can be sent without being blocked by data messages. We formally define consistency in runtime reconfigurations, and develop a Fries scheduler with consistency guarantees. The technique not only works for different classes of dataflows, but also works for parallel executions and supports fault tolerance. Our extensive experimental evaluation on clusters show the advantages of this technique compared to epoch-based schedulers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fries

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Journal: Proceedings of the VLDB Endowment	Publication Date: Oct 1, 2022
Citations: 2

Similar Papers

Lifelong Machine Learning and root cause analysis for large-scale cancer patient data
Gautam Pal ... Xianbin Hong
Journal of Big Data | VOL. 6
Gautam Pal, et. al.Gautam Pal ... Xianbin Hong
01 Dec 2019
Journal of Big Data | VOL. 6

On providing scalable self-healing adaptive fault-tolerance to RTR SoCs
Byron Navas ... Ingo Sander
-
Byron Navas, et. al.Byron Navas ... Ingo Sander
01 Dec 2014
01 Dec 2014

Fault tolerance in big data storage and processing systems: A review on challenges and solutions
Muntadher Saadoon ... Hamza H.M Altarturi
Ain Shams Engineering Journal | VOL. 13
Muntadher Saadoon, et. al.Muntadher Saadoon ... Hamza H.M Altarturi
01 Mar 2022
Ain Shams Engineering Journal | VOL. 13

Development and implementation of interactive 3D video environment on run-time reconfigurable FPGA platform
Sergiy Zhelnakov
-
Sergiy ZhelnakovSergiy Zhelnakov
22 May 2021
22 May 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fries

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment