Amazon EC2 Spot Instances Research Articles

Analytical data processing has become the cornerstone of today's businesses success, and it is facilitated by Big Data platforms that offer virtually limitless scalability. However, minimizing the total cost of ownership (TCO) for the infrastructure can be challenging. We propose a novel method to build resilient clusters on cloud resources that are fine-tuned to the particular data processing task. The presented architecture follows the infrastructure-as-a-code paradigm so that the cluster can be dynamically configured and managed. It first identifies the optimal cluster size to perform a job in the required time. Then, by analyzing spot instance price history and using ARIMA models, it optimizes the schedule of the job execution to leverage the discounted prices of the cloud spot market. In particular, we evaluated savings opportunities when using Amazon EC2 spot instances comparing to on-demand resources. The performed experiments confirmed that the prediction module significantly improved the cost-effectiveness of the solution – up to 80% savings compared to the on-demand prices, and at the worst-case, 1% more cost than the absolute minimum. The production deployments of the architecture show that it is invaluable for minimizing the total cost of ownership of analytical data processing solutions. • Extended cloud architecture that dynamically selects and schedules resources. • Evaluated several spot price prediction models to forecast future prices. • Confirmed feasibility of short-term spot price prediction through real data from AWS. • Cost savings of up to 80% compared to on-demand and within 1% of the absolute minimum.

Read full abstract

We present Dithen, a novel computation-as-a-service (CaaS) cloud platform specifically tailored to the parallel execution of large-scale multimedia tasks. Dithen handles the upload/download of both multimedia data and executable items, the assignment of compute units to multimedia workloads, and the reactive control of the available compute units to minimize the cloud infrastructure cost under deadline-abiding execution. Dithen combines three key properties: (i) the reactive assignment of individual multimedia tasks to available computing units according to availability and predetermined time-to-completion constraints; (ii) optimal resource estimation based on Kalman-filter estimates; (iii) the use of additive increase multiplicative decrease (AIMD) algorithms (famous for being the resource management in the transport control protocol) for the control of the number of units servicing workloads. The deployment of Dithen over Amazon EC2 spot instances is shown to be capable of processing more than 80,000 video transcoding, face detection and image processing tasks (equivalent to the processing of more than 116 GB of compressed data) for less than $1 in billing cost from EC2. Moreover, the proposed AIMD-based control mechanism, in conjunction with the Kalman estimates, is shown to provide for more than 27 percent reduction in EC2 spot instance cost against methods based on reactive resource estimation. Finally, Dithen is shown to offer a 38 to 500 percent reduction of the billing cost against the current state-of-the-art in CaaS platforms on Amazon EC2 (Amazon Lambda and Amazon Autoscale). A baseline version of Dithen is currently available at dithen.com under the “AutoScale” option.

Read full abstract

Amazon EC2 Spot Instances Research Articles

Articles published on Amazon EC2 Spot Instances

Machine Learning on Volatile Instances: Convergence, Runtime, and Cost Tradeoffs

FSpot: Fast and Efficient Video Encoding Workloads Over Amazon Spot Instances

Cost Optimization for Big Data Workloads Based on Dynamic Scheduling and Cluster-Size Tuning

Constructing Reliable Computing Environments on Top of Amazon EC2 Spot Instances

Workflow Scheduling with Amazon EC2 Spot Instances: Building Reliable Compute Environments

Towards increasing reliability of Amazon EC2 spot instances with a fault-tolerant multi-agent architecture

Dithen: A Computation-as-a-Service Cloud Platform for Large-Scale Multimedia Processing

Scheduling Flexible Demand in Cloud Computing Spot Markets

Perceptive bidding strategy for Amazon EC2 spot instance market

OptiSpot: minimizing application deployment cost using spot cloud resources.

Large-scale Image Processing using Amazon EC2 Spot Instances

SipaaS: Spot instance pricing as a Service framework and its implementation in OpenStack

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Amazon EC2 Spot Instances Research Articles

Articles published on Amazon EC2 Spot Instances

Machine Learning on Volatile Instances: Convergence, Runtime, and Cost Tradeoffs

FSpot: Fast and Efficient Video Encoding Workloads Over Amazon Spot Instances

Cost Optimization for Big Data Workloads Based on Dynamic Scheduling and Cluster-Size Tuning

Constructing Reliable Computing Environments on Top of Amazon EC2 Spot Instances

Workflow Scheduling with Amazon EC2 Spot Instances: Building Reliable Compute Environments

Towards increasing reliability of Amazon EC2 spot instances with a fault-tolerant multi-agent architecture

Dithen: A Computation-as-a-Service Cloud Platform for Large-Scale Multimedia Processing

Scheduling Flexible Demand in Cloud Computing Spot Markets

Perceptive bidding strategy for Amazon EC2 spot instance market

OptiSpot: minimizing application deployment cost using spot cloud resources.

Large-scale Image Processing using Amazon EC2 Spot Instances

SipaaS: Spot instance pricing as a Service framework and its implementation in OpenStack