Abstract

Fault management systems that use real-time analytics based on Machine Learning (ML) help provide the reliability required in edge clouds, though they suffer from frequent changes in data distribution (concept drift) caused by highly dynamic traffic in edge clouds, requiring frequent adaptations of the ML model. We propose an automated concept drift handling framework for fault prediction in edge clouds using Reinforcement Learning (RL) to select the most appropriate drift adaptation method as well as the amount of data needed for adaptation, while considering edge cloud operator’s requirements. We implemented an edge cloud testbed and introduced infrastructure and network faults to it in the presence of abrupt and incremental concept drift. According to the obtained results, our proposed framework achieves up to 40% higher accuracy compared to a system without drift handling, and up to <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$13\times $ </tex-math></inline-formula> and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$30\times $ </tex-math></inline-formula> less regret for selecting adaptation methods and amount of data, respectively, compared to other approaches.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call