Amazon EC2 Research Articles

The demand to deliver fast responses in processing time-evolving graphs is higher than ever before in a large number of big data applications. This problem promotes extensive uses of an incremental computing model, which executes the underlying graph algorithm on the newly updated graph structure by taking the results of the computation on the outdated graph structure as initial values, in distributed time-evolving graph computing systems. In this paper, we experimentally study how the initial values of the computation on a newly updated graph structure influence the convergence of the iterative graph analysis, and we develop an optimization framework on the basis of the incremental computing model to accelerate the convergence of processing time-evolving graphs thus achieving high performance for time-evolving graph analysis. In contrast to the traditional incremental computing model, which uses the results of the computation on the outdated graph structure directly, the proposed framework predicts the optimal initial values of the computation on the new graph structure and thereby reduces the number of iterations. Two different prediction approaches are designed to optimize the initial values based on a combination of the results of the computation on the previous graph data and the newly incoming graph data. We have evaluated our optimization framework using the graph algorithms PageRank and KMeans on Amazon EC2 clusters. The experiments demonstrate that the incremental computing implementation with the initial value prediction have reduced the number of iterations by 30% for the PageRank algorithm and 13.7% for the KMeans algorithm, and reduced the response time by 12.7% and 10.6% accordingly compared to the traditional incremental computing model.

Read full abstract

We propose a new technique for the mitigation of fail-stop failures and/or silent data corruptions (SDCs) within linear, sesquilinear or bijective (LSB) operations on $M$ integer data streams ( $M\geq 3$ ). In the proposed approach, the $M$ input streams are linearly superimposed to form $M$ numerically entangled integer data streams that are stored in-place of the original inputs, i.e., no additional (aka. “checksum”) streams are used. An arbitrary number of LSB operations can then be performed in $M$ processing cores using these entangled data streams. The output results can be extracted from any $M-K$ entangled output streams by additions and arithmetic shifts, thereby mitigating $K$ fail-stop failures ( $K\leq \left\lfloor \frac{M-1}{2}\right\rfloor$ ), or detecting up to $K$ SDCs per $M$ -tuple of outputs at corresponding in-stream locations. Therefore, unlike other methods, the number of operations required for the entanglement, extraction and recovery of the results is linearly related to the number of the inputs and does not depend on the complexity of the performed LSB operations. Our proposal is validated within an Amazon EC2 instance (Haswell architecture with AVX2 support) via integer matrix product operations. Our analysis and experiments for fail-stop failure mitigation and SDC detection reveal that the proposed approach incurs 0.75 to 37.23 percent reduction in processing throughput in comparison to the equivalent error-intolerant processing. This overhead is found to be up to two orders of magnitude smaller than that of the equivalent checksum-based method, with increased gains offered as the complexity of the performed LSB operations is increasing. Therefore, our proposal can be used in distributed systems, unreliable multicore clusters and safety-critical applications, where robustness against failures and SDCs is a necessity.

Read full abstract

Amazon EC2 Research Articles

Related Topics

Articles published on Amazon EC2

Multi-Objective Workflow Scheduling With Deep-Q-Network-Based Multi-Agent Reinforcement Learning

Scheming an Proficient Auto Scaling Technique for Minimizing Response Time in Load Balancing on Amazon AWS Cloud

A low-latency computing framework for time-evolving graphs

Cost-Effective Algorithm for Workflow Scheduling in Cloud Computing Under Deadline Constraint

A Game Theoretic Approach to Estimate Fair Cost of VM Placement in Cloud Data Center

Low-Complexity Distributed Radio Access Network Slicing: Algorithms and Experimental Results

ECloud - Magnetotelluric Webapp

Multiagent system for dynamic resource provisioning in cloud computing platforms

Truthful Online Auction Toward Maximized Instance Utilization in the Cloud

Dynamic Load Balancing Model Based on Server Status (DLBS) for Green Computing

Generalized Numerical Entanglement for Reliable Linear, Sesquilinear and Bijective Operations on Integer Data Streams

A MOBILE CROWDSOURCING SYSTEM FOR FOOTBALL MATCH LIVE VIDEO STREAMING

Online Auction for IaaS Clouds: Towards Elastic User Demands and Weighted Heterogeneous VMs

Analysis of Bounds on Hybrid Vector Clocks

Deploying Docker Swarm cluster on hybrid clouds using Occopus

Cluster Programming using the OpenMP Accelerator Model

Evidence-Aware Mobile Computational Offloading

Power and resource-aware virtual machine placement for IaaS cloud

Development of Real Time System for File Compression on Amazon EC2

An MCDM method for cloud service selection using a Markov chain and the best-worst method

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Amazon EC2 Research Articles

Related Topics

Articles published on Amazon EC2

Multi-Objective Workflow Scheduling With Deep-Q-Network-Based Multi-Agent Reinforcement Learning

Scheming an Proficient Auto Scaling Technique for Minimizing Response Time in Load Balancing on Amazon AWS Cloud

A low-latency computing framework for time-evolving graphs

Cost-Effective Algorithm for Workflow Scheduling in Cloud Computing Under Deadline Constraint

A Game Theoretic Approach to Estimate Fair Cost of VM Placement in Cloud Data Center

Low-Complexity Distributed Radio Access Network Slicing: Algorithms and Experimental Results

ECloud - Magnetotelluric Webapp

Multiagent system for dynamic resource provisioning in cloud computing platforms

Truthful Online Auction Toward Maximized Instance Utilization in the Cloud

Dynamic Load Balancing Model Based on Server Status (DLBS) for Green Computing

Generalized Numerical Entanglement for Reliable Linear, Sesquilinear and Bijective Operations on Integer Data Streams

A MOBILE CROWDSOURCING SYSTEM FOR FOOTBALL MATCH LIVE VIDEO STREAMING

Online Auction for IaaS Clouds: Towards Elastic User Demands and Weighted Heterogeneous VMs

Analysis of Bounds on Hybrid Vector Clocks

Deploying Docker Swarm cluster on hybrid clouds using Occopus

Cluster Programming using the OpenMP Accelerator Model

Evidence-Aware Mobile Computational Offloading

Power and resource-aware virtual machine placement for IaaS cloud

Development of Real Time System for File Compression on Amazon EC2

An MCDM method for cloud service selection using a Markov chain and the best-worst method