Online workflow management and performance analysis with stampede

Dan Gunter ,Monte Goode ,Ewa Deelman ,Martin Swany ,Taghrid Samak ,Priscilla Moraes ,Fábio Silva ,Gaurang Mehta ,Gideon Juve ,Christopher Brooks ,Karan Vahi

doi:10.5555/2147671.2147695

Abstract

Scientific workflows are an enabler of complex scientific analyses. They provide both a portable representation and a foundation upon which results can be validated and shared. Large-scale scientific workflows are executed on equally complex parallel and distributed resources, where many things can fail. Application scientists need to track the status of their workflows in real time, detect execution anomalies automatically, and perform troubleshooting -- without logging into remote nodes or searching through thousands of log files. As part of the NSF Stampede project, we have developed an infrastructure to answer these needs. The infrastructure captures application-level logs and resource information, normalizes these to standard representations, and stores these logs in a centralized general-purpose schema. Higher-level tools mine the logs in real time to determine current status, predict failures, and detect anomalous performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online workflow management and performance analysis with stampede

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Online Fault and Anomaly Detection for Large-Scale Scientific Workflows
Taghrid Samak ... Gaurang Mehta
-
Taghrid Samak, et. al.Taghrid Samak ... Gaurang Mehta
01 Sep 2011
01 Sep 2011

Scheduling parameter sweep workflow in the grid

-

17 Feb 2017
17 Feb 2017

From the Desktop to the Grid: conversion of KNIME Workflows to gUSE
...
-
, et. al. ...
01 Jan 2013
01 Jan 2013

Complicated Geospatial Flow Processing with Scientific Workflow
...
-
, et. al. ...
05 Dec 2020
05 Dec 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online workflow management and performance analysis with stampede

Abstract

Talk to us

Similar Papers