Abstract

This is a survey article that explores six Cloud-based deployment archetypes for Cloud applications and the tradeoffs between them to achieve high availability, low end-user latency, and acceptable costs. These are (1) Zonal, (2) Regional, (3) Multi-regional, (4) Global, (5) Hybrid, and (6) Multi-cloud deployment archetypes. The goal is to classify cloud applications into a set of deployment archetypes and deployment models that tradeoff their needs around availability, latency, and geographical constraints with a focus on serving applications. This enables application owners to better examine the tradeoffs of each deployment model and what is needed for achieving the availability and latency goals for their application.

Highlights

  • The approaches described in our article are standard ones existing in the cloud computing industry [41], and we have focused on which ones to use for the various archetype models described

  • Fog and mobile deployments minimize user-perceived latency, they come with limited capacity, storage, and available energy as discussed in Reference [126]. These limitations create new deployment archetypes where computations are offloaded from mobile to edge cloud, where a cloud application stack goes all the way from cloud to edge to the mobile devices

  • We examined several deployment archetypes for Cloud applications

Read more

Summary

INTRODUCTION

In looking at how applications have changed over the past 20 years, we have evolved from a world where planned maintenance downtime was standard and business applications were typically available only 99% of the year [117] to today where applications are expected to be up and running 24/7. Even applications built around deployment options that were only available 20+ years ago need to be supported in this 24/7 available and low-latency world. Calder existing boundaries or go for partial or complete rewrites to achieve higher availability, better end-user latency, and increased operational efficiency and agility. We explore six Cloud-based deployment archetypes for Cloud applications and the tradeoffs between them to achieve high availability and low end-user latency. These are (1) Zonal, (2) Regional, (3) Multi-regional, (4) Global, (5) Hybrid, and (6) Multi-cloud deployment archetypes. We look at multiple categories of application deployments from Enterprise to Cloud-native applications, their impact on availability and latency, and how they can leverage these six deployment archetypes

Principles of Availability
Types of Applications
Data Durability, Availability, and Backup
Six Deployment Archetypes for Cloud Applications
Single Zone
Primary Zone with Failover Zone
REGIONAL
Single Region
Primary Region with Failover Region
MULTI-REGIONAL
Fully Isolated Stacks with Data Sharding
DNS Load Balancing
DNS Load Balancing with Isolated Stacks
DNS LB with Custom Multi-regional Load Balancing
GLOBAL
Global Anycast
Global Anycast LB with Isolated Regional Stacks
Global Services Stack In this deployment, services are global
HYBRID
MULTI-CLOUD
COMPARING AND SELECTING DEPLOYMENT MODELS
Failover-to-Standby Versus Load Balancing
Regional Versus Global Application Stacks
Instantaneous Recovery
Cost of Availability
Using Deployment Models
Best Practices
Deployment Efficiency
ADDITIONAL RELATED WORK
10 SUMMARY
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call