Distributed Computing Infrastructures Research Articles

The.LHC, operating at CERN, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe.ATLAS, one of the largest collaborations ever assembled in the sciences, is at the forefront of research at the LHC. To address an unprecedented multi-petabyte data processing challenge, the ATLAS experiment is relying on a heterogeneous distributed computational infrastructure. The ATLAS experiment uses PanDA (Production and Data Analysis) Workload Management System for managing the workflow for all data processing on over 150 data centers. Through PanDA, ATLAS physicists see a single computing facility that enables rapid scientific breakthroughs for the experiment, even though the data centers are physically scattered all over the world. While PanDA currently uses more than 250,000 cores with a peak performance of 0.3 petaFLOPS, LHC data taking runs require more resources than grid can possibly provide. To alleviate these challenges, LHC experiments are engaged in an ambitious program to expand the current computing model to include additional resources such as the opportunistic use of supercomputers.We will describe a project aimed at integration of PanDA WMS with supercomputers in United States, in particular with Titan supercomputer at Oak Ridge Leadership Computing Facility. Current approach utilizes modified PanDA pilot framework for job submission to the supercomputers batch queues and local data management, with light-weight MPI wrappers to run single threaded workloads in parallel on LCFs multi-core worker nodes. This implementation was tested with a variety of Monte-Carlo workloads on several supercomputing platforms for ALICE and ATLAS experiments and it is in full pro duction for the ATLAS since September 2015.We will present our current accomplishments with running PanDA at supercomputers and demonstrate our ability to use PanDA as a portal independent of the computing facilities infrastructure for High Energy and Nuclear Physics as well as other data-intensive science applications, such as bioinformatics and astro-particle physics.

Read full abstract

The Large Hadron Collider (LHC), operating at the international CERN Laboratory in Geneva, Switzerland, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe, and were recently credited for the discovery of a Higgs boson. ATLAS, one of the largest collaborations ever assembled in the sciences, is at the forefront of research at the LHC. To address an unprecedented multi-petabyte data processing challenge, the ATLAS experiment is relying on a heterogeneous distributed computational infrastructure. The ATLAS experiment uses PanDA (Production and Data Analysis) Workload Management System for managing the workflow for all data processing on over 140 data centers. Through PanDA, ATLAS physicists see a single computing facility that enables rapid scientific breakthroughs for the experiment, even though the data centers are physically scattered all over the world. While PanDA currently uses more than 250000 cores with a peak performance of 0.3+ petaFLOPS, next LHC data taking runs will require more resources than Grid computing can possibly provide. To alleviate these challenges, LHC experiments are engaged in an ambitious program to expand the current computing model to include additional resources such as the opportunistic use of supercomputers. We will describe a project aimed at integration of PanDA WMS with supercomputers in United States, Europe and Russia (in particular with Titan supercomputer at Oak Ridge Leadership Computing Facility (OLCF), Supercomputer at the National Research Center “Kurchatov Institute”, IT4 in Ostrava, and others). The current approach utilizes a modified PanDA pilot framework for job submission to the supercomputers batch queues and local data management, with light-weight MPI wrappers to run singlethreaded workloads in parallel on Titan’s multi-core worker nodes. This implementation was tested with a variety of Monte-Carlo workloads on several supercomputing platforms. We will present our current accomplishments in running PanDA WMS at supercomputers and demonstrate our ability to use PanDA as a portal independent of the computing facility’s infrastructure for High Energy and Nuclear Physics, as well as other data-intensive science applications, such as bioinformatics and astro-particle physics.

Read full abstract

Distributed Computing Infrastructures Research Articles

Related Topics

Articles published on Distributed Computing Infrastructures

Extension of Distributed Computing Infrastructure and Services Portfolio for Research and Educational Activities

Evaluating Distributed Computing Infrastructures: An Empirical Study Comparing Hadoop Deployments on Cloud and Local Systems

End-to-End SDN/NFV Orchestration of Multi-Domain Transport Networks and Distributed Computing Infrastructure for Beyond-5G Services

Exploring the self-service model to visualize the results of the ATLAS Machine Learning analysis jobs in BigPanDA with Openshift OKD3

Exascale Data Processing in Heterogeneous Distributed Computing Infrastructure for Applications in High Energy Physics

ArchaeoGRID Science Gateways for Easy Access to Distributed Computing Infrastructure for Large Data Storage and Analysis in Archaeology and History

The INDIGO-Datacloud Authentication and Authorization Infrastructure

AGIS: Integration of new technologies used in ATLAS Distributed Computing

Monitoring performance of a highly distributed and complex computing infrastructure in LHCb

CASAS: A tool for composing automatically and semantically astrophysical services

Web Services as Building Blocks for Science Gateways in Astrophysics

Integration Of PanDA Workload Management System With Supercomputers for ATLAS and Data Intensive Science

A Meta-Brokering Framework for Science Gateways

Integration of Panda Workload Management System with supercomputers

Using OpenMI and a Model MAP to Integrate WaterML2 and NetCDF Data Sources into Flood Modeling of Genoa, Italy

Integration and Combined Use of Distributed Computing Resources with Everest

Accelerating Science Impact through Big Data Workflow Management and Supercomputing

ATLAS Distributed Computing in LHC Run2

A Set of Successive Job Allocation Models in Distributed Computing Infrastructures

Remote storage management in science gateways via data bridging

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Distributed Computing Infrastructures Research Articles

Related Topics

Articles published on Distributed Computing Infrastructures

Extension of Distributed Computing Infrastructure and Services Portfolio for Research and Educational Activities

Evaluating Distributed Computing Infrastructures: An Empirical Study Comparing Hadoop Deployments on Cloud and Local Systems

End-to-End SDN/NFV Orchestration of Multi-Domain Transport Networks and Distributed Computing Infrastructure for Beyond-5G Services

Exploring the self-service model to visualize the results of the ATLAS Machine Learning analysis jobs in BigPanDA with Openshift OKD3

Exascale Data Processing in Heterogeneous Distributed Computing Infrastructure for Applications in High Energy Physics

ArchaeoGRID Science Gateways for Easy Access to Distributed Computing Infrastructure for Large Data Storage and Analysis in Archaeology and History

The INDIGO-Datacloud Authentication and Authorization Infrastructure

AGIS: Integration of new technologies used in ATLAS Distributed Computing

Monitoring performance of a highly distributed and complex computing infrastructure in LHCb

CASAS: A tool for composing automatically and semantically astrophysical services

Web Services as Building Blocks for Science Gateways in Astrophysics

Integration Of PanDA Workload Management System With Supercomputers for ATLAS and Data Intensive Science

A Meta-Brokering Framework for Science Gateways

Integration of Panda Workload Management System with supercomputers

Using OpenMI and a Model MAP to Integrate WaterML2 and NetCDF Data Sources into Flood Modeling of Genoa, Italy

Integration and Combined Use of Distributed Computing Resources with Everest

Accelerating Science Impact through Big Data Workflow Management and Supercomputing

ATLAS Distributed Computing in LHC Run2

A Set of Successive Job Allocation Models in Distributed Computing Infrastructures

Remote storage management in science gateways via data bridging