disaster recovery with the aws cloud

51
Disaster Recovery with the AWS Cloud Paul Duffy

Upload: amazon-web-services

Post on 15-Jan-2015

2.206 views

Category:

Technology


1 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Disaster Recovery with the AWS Cloud

Disaster Recovery with the AWS Cloud

Paul Duffy

Page 2: Disaster Recovery with the AWS Cloud

Some Context

Page 3: Disaster Recovery with the AWS Cloud

What you told us…“DR ‘top of the list’ for budgeted cloud initiatives in 2012”

Source: AWS Customer Survey, 2011`

Page 4: Disaster Recovery with the AWS Cloud

Why Disaster Recovery on AWS..?

Low cost

Instant Elasticity

Open & Flexible

Secure

Page 5: Disaster Recovery with the AWS Cloud

AWS encourages some new HA/DR thinking…

Amazon S3Bucket

AZ-1

Region

Elastic LoadBalancer

Amazon SNSNotifications

Auto Scaling Group

Web Server

AZ-2

Amazon EC2Instances

WebServer

Page 6: Disaster Recovery with the AWS Cloud

DR on AWS does not mean “rearchitect”

Existing, on-premises Infrastructure

This talk is about using AWS for DR for your datacenters

Page 7: Disaster Recovery with the AWS Cloud

Spot the difference!

Production:RoutersFirewallsIP NetworkApplication LicensesOperating SystemsHypervisorServersStorage NetworkPrimary StorageBackup SWBackup TapesTape SilosArchive SWArchive Storage

DR Site (Datacenter):Routers

Firewalls

IP Network

Application Licenses

Operating Systems

Hypervisor

Servers

Storage Network

Snapshot Storage

Backup SW

Backup Tapes

Tape Silos

Archive SW

Archive Storage

DR Site (AWS):Routers

Firewalls

IP Network

Application Licenses

Operating Systems

Hypervisor

Servers

Storage Network

Snapshot Storage

Backup SW

Backup Tapes

Tape Silos

Archive SW

Archive Storage

Page 8: Disaster Recovery with the AWS Cloud

A different cost model

2nd Site Cost

AWS Cost

Demand

Cost savings w/ AWS

Ability to scale – no arbitrary time limit to

failback

Time

Infr

astr

uct

ure

Co

st

Test Test Failover Failback

Page 9: Disaster Recovery with the AWS Cloud

How to do DR with AWS..?

Page 10: Disaster Recovery with the AWS Cloud

AWS - Flexible, Global Infrastructure

AWS Regions

AWS Edge Locations

Page 11: Disaster Recovery with the AWS Cloud

AWS Security Standards

Certifications

SOC 1 Type 2 (formerly SAS-70)

ISO 27001

PCI DSS for EC2, S3, EBS, VPC, RDS, ELB, IAM

FISMA Moderate Compliant Controls

HIPAA & ITAR Compliant Architecture

Physical Security

Datacenters in nondescript facilities

Physical access strictly controlled

Must pass two-factor authentication at least twice for floor access

Physical access logged and audited

HW, SW, Network

Systematic change management

Phased updates deployment

Safe storage decommission

Automated monitoring and self-audit

Advanced network protection

Page 12: Disaster Recovery with the AWS Cloud

Which AWS Services for DR..?

Amazon Simple Storage Service (Amazon S3)

Amazon Import/Export

Amazon Elastic Compute Cloud (Amazon EC2)

AWS Storage Gateway

Amazon Route 53

Page 13: Disaster Recovery with the AWS Cloud

Scenarios

Page 14: Disaster Recovery with the AWS Cloud

Disaster Recovery Terms

RTO: Recovery Time Objective• Acceptable time period within which normal operation (or

degraded operation) needs to be restored after event

RPO: Recovery Point Objective• Acceptable data loss measured in time

Page 15: Disaster Recovery with the AWS Cloud

Backup and Restore

On-premises Infrastructure

Traditional server

Amazon Route 53

AWS Import/Export

S3 Bucket with Objects

Data copied to S3

Page 16: Disaster Recovery with the AWS Cloud

Backup and Restore – Storage Gateway

Page 17: Disaster Recovery with the AWS Cloud

Backup and Restore

Availability ZoneAWS Region

Data Volume

Amazon EC2Instance

AMI

Amazon S3 Bucket

Data copied from objects in S3

Instance Quickly provisioned from

AMI

Pre-bundled with OS and

applications

Page 18: Disaster Recovery with the AWS Cloud

Backup and Restore

Advantages• Simple to get started• Extremely cost effective (mostly backup storage)

Preparation Phase• Take backups of current systems• Store backups in S3• Describe procedure to restore from backup on AWS

• Know which AMI to use, build your own as needed• Know how to restore system from backups• Know how to switch to new system• Know how to configure the deployment

Page 19: Disaster Recovery with the AWS Cloud

Backup and Restore – Storage Gateway

Advantages• Simple to get started• Extremely cost effective (mostly backup storage)

Preparation Phase• Download AWS Storage Gateway software appliance• Install and configure Storage Gateway• Use Storage Gateway• Describe procedure to restore from backup on AWS

• Know which AMI to use, build your own as needed• Know how to switch to new system• Know how to configure the deployment

Page 20: Disaster Recovery with the AWS Cloud

Backup and Restore

In Case of Disaster• Retrieve backups from S3• Bring up required infrastructure

• EC2 instances with prepared AMIs, Load Balancing, etc.

• Restore system from backup• Switch over to the new system

• Adjust DNS records to point to AWS

Objectives• RTO: as long as it takes to bring up infrastructure and restore system from

backups• RPO: time since last backup

Page 21: Disaster Recovery with the AWS Cloud

Pilot Light

User or system

WebServer

ApplicationServer

DatabaseServer

Data Volume

Web Server

ApplicationServer

DatabaseServer

Data Volume

Data Mirroring/ Replication

Not Running

Smaller Instance

Amazon Route 53

Page 22: Disaster Recovery with the AWS Cloud

Pilot Light

User or system

WebServer

DatabaseServer

Data Volume

Web Server

ApplicationServer

DatabaseServer

Data Volume

Not Running

Smaller Instance

Amazon Route 53

WebServer

ApplicationServer

DatabaseServer Data Mirroring/

Replication

Page 23: Disaster Recovery with the AWS Cloud

ApplicationServer

Web Server

Pilot Light

User or system

WebServer

DatabaseServer

Data Volume

DatabaseServer

Data Volume

Start in minutes

Resize as desired

Amazon Route 53

WebServer

ApplicationServer

DatabaseServer Data Mirroring/

Replication

Page 24: Disaster Recovery with the AWS Cloud

Pilot Light

Advantages• Very cost effective (fewer 24/7 resources)

Preparation Phase• Enable replication of all critical data to AWS• Prepare all required resources for automatic start

• AMIs, Network Settings, Load Balancing, etc.

• Reserved Instances

Page 25: Disaster Recovery with the AWS Cloud

Pilot Light

In Case of Disaster• Automatically bring up resources around the replicated core data

set• Scale the system as needed to handle current production traffic• Switch over to the new system

• Adjust DNS records to point to AWS

Objectives• RTO: as long as it takes to detect need for DR and automatically

scale up replacement system• RPO: depends on replication type

Page 26: Disaster Recovery with the AWS Cloud

WebServer

Fully-Working Low Capacity Standby

User or system

Data Volume

Data Volume

Data Mirroring/ Replication

Low CapacityAmazon Route 53

WebServer

AppServer

DBServer

DatabaseServer

ApplicationServer

Page 27: Disaster Recovery with the AWS Cloud

Fully-Working Low Capacity Standby

User or system

Data Volume

Data Volume

Low CapacityAmazon Route 53

WebServer

AppServer

DBServerData Mirroring/

Replication

WebServer

DatabaseServer

ApplicationServer

Page 28: Disaster Recovery with the AWS Cloud

Fully-Working Low Capacity Standby

User or system

Data Volume

AppServer

DBServer

Data Volume

Grow CapacityAmazon Route 53

WebServer

Web Server

ApplicationServer

DatabaseServer

WebServer

DatabaseServer

ApplicationServer

Data Mirroring/ Replication

Page 29: Disaster Recovery with the AWS Cloud

Fully-Working Low Capacity Standby

User or system

Data Volume

AppServer

DBServer

Data Volume

Grow CapacityAmazon Route 53

WebServer

Web Server

ApplicationServer

DatabaseServer

WebServer

DatabaseServer

ApplicationServer

Data Mirroring/ Replication

Page 30: Disaster Recovery with the AWS Cloud

Fully-Working Low-Capacity Standby

Advantages• Can take some production traffic at any time• Cost savings (IT footprint smaller than full DR)

Preparation• Similar to Pilot Light• All necessary components running 24/7, but not scaled for

production traffic• Best practice – continuous testing

• “Trickle” a statistical subset of production traffic to DR site

Page 31: Disaster Recovery with the AWS Cloud

Fully-Working Low-Capacity Standby

In Case of Disaster• Immediately fail over most critical production load

• Adjust DNS records to point to AWS

• (Auto) Scale the system further to handle all production load

Objectives• RTO: for critical load: as long as it takes to fail over; for all other

load, as long as it takes to scale further• RPO: depends on replication type

Page 32: Disaster Recovery with the AWS Cloud

Multi-Site Hot Standby

User or system

Data Volume

AppServer

DBServer

Data Volume

Data Mirroring/ Replication

Full CapacityAmazon Route 53

WebServer

ApplicationServer

DatabaseServer

Web Server

ApplicationServer

DatabaseServer

Web Server

ApplicationServer

DatabaseServer

Page 33: Disaster Recovery with the AWS Cloud

Multi-Site Hot Standby

Advantages• At any moment can take all production load

Preparation• Similar to Low-Capacity Standby• Fully scaling in/out with production load

In Case of Disaster• Immediately fail over all production load

• Adjust DNS records to point to AWS

Objectives• RTO: as long as it takes fail over• RPO: depends on replication type

Page 34: Disaster Recovery with the AWS Cloud

Please test..!

Page 35: Disaster Recovery with the AWS Cloud

Keith MeadeSenior Account Director, Partner

What’s Up Interactive@keithmeade

Page 36: Disaster Recovery with the AWS Cloud

About What’s Up Interactive

True Digital Agency – Marketing, Creative, Technology

Atlanta, GA

25 employees

Clients: FOX, AT&T, DS Waters, Georgia Lottery, Georgia Aquarium

Growth Businesses (Building a Web Presence)

Page 37: Disaster Recovery with the AWS Cloud
Page 38: Disaster Recovery with the AWS Cloud
Page 39: Disaster Recovery with the AWS Cloud
Page 40: Disaster Recovery with the AWS Cloud
Page 41: Disaster Recovery with the AWS Cloud
Page 42: Disaster Recovery with the AWS Cloud

Our Experience

April 2011 – Implemented new backup and offsite storage solution utilizing S3

August 2011 – Backup of select sites & systems in EC2

Page 43: Disaster Recovery with the AWS Cloud

Rolling Out this Year

Pilot light and Low Capacity solutions for all our clients

Page 44: Disaster Recovery with the AWS Cloud

Savings of $2,500/month via Pilot Light

Page 45: Disaster Recovery with the AWS Cloud
Page 46: Disaster Recovery with the AWS Cloud

What We Learned

Try It• Get a good understanding of capabilities and potential• Get a handle on costs

Test It, and Test Again• Test systems and data flow to ensure replication is working

correctly

Realize Its Potential• Quickly scalable to add resources to production

Page 47: Disaster Recovery with the AWS Cloud

Next steps…

Page 48: Disaster Recovery with the AWS Cloud

Resources

http://aws.amazon.com/disaster-recovery/• Whitepapers• Customer Videos• Case Studies

Page 49: Disaster Recovery with the AWS Cloud

AWS Storage Solution Provider Partners

Page 50: Disaster Recovery with the AWS Cloud

Call to action

Learn more about our DR resources

Evaluate using AWS for a DR project

Start testing – first steps are simple

Give us feedback

Page 51: Disaster Recovery with the AWS Cloud

Thank you!

Come and talk to us at the booth!