Abstract

Cloud-based virtualization has become a key part of building distributed applications. One of its many benefits is the ability to dynamically manage system capacity by creating, deleting and migrating virtual machines (VMs) on-demand. This management process, however, depends on complex pipelines, involving multiple services invocations across distributed nodes. This makes troubleshooting and debugging difficult, as these complex pipelines lack an integrated logging system. Instead, each service generates independent and unstructured log messages without the ability to link logs into a single integrated workflow. We present LogChain, a tool that gathers and processes distributed unstructured logs to diagnose failures in cloud management tasks. It contains three key functions: (i) It infers task workflows from distributed unstructured logs; (ii) it labels these workflows with the tasks that triggered them; and (iii) it diagnoses potential failures in the workflow’s execution, to support administrator with troubleshooting. We evaluate LogChain with realistic workloads, and show that it exceeds the state-of-the-art in terms of performance and accuracy.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.