journey through the cloud: disaster recovery

70
Journey through the Cloud: Disaster Recovery Ryan Shuttleworth – Technical Evangelist @ryanAWS

Upload: amazon-web-services

Post on 15-Jan-2015

772 views

Category:

Technology


2 download

DESCRIPTION

This complementary presentation discusses the use of AWS as a disaster recovery (DR) environment and how the architectural approaches to DR in the AWS cloud make a perfect learning ground for more complex production application deployments. In this webinar we introduce differing architectural patterns for using AWS as a disaster recovery platform and how AWS can make a cost effective part of your overall business continuity planning. By discussing a customer use-case end to end, we'll highlight key lessons learned to help you on your journey into the AWS cloud.

TRANSCRIPT

Page 1: Journey Through the Cloud: Disaster Recovery

Journey through the Cloud:

Disaster Recovery

Ryan Shuttleworth – Technical Evangelist @ryanAWS

Page 2: Journey Through the Cloud: Disaster Recovery

Common use cases & stepping stones into the AWS cloud Learning from customer journeys

Best practices to bootstrap your projects

Journey through the cloud

Page 3: Journey Through the Cloud: Disaster Recovery

Explore AWS for a ‘non-production’ use case Phase systems into ‘live’ DR use with relative ease

Choose your success objectives for a cloud project ‘out of band’

Disaster recovery

Page 4: Journey Through the Cloud: Disaster Recovery

Why AWS for disaster recovery? AWS services that can be employed Common DR architectures Customer example Where to go next

Agenda

Page 5: Journey Through the Cloud: Disaster Recovery

Why AWS for Disaster Recovery?

Page 6: Journey Through the Cloud: Disaster Recovery

AWS is used in a variety of ways…

AWS & Disaster Recovery

Backup and disaster recovery system for its remote sales offices

Fast, secure and cost effective backup and DR for Oracle Apps

Disaster recovery solution to backup and store critical medical image data

DR and testing environment reducing IT overhead and increasing availability

Page 7: Journey Through the Cloud: Disaster Recovery

You might be able to:

Business & technical drivers

Reduce costs

Slash DR budgets by up to 50%

Reduce on-premise

Eliminate 30%+ of on-premise physical equipment

Consolidate sites

Eliminate the need to run a secondary site

Remove aging technologies

Eliminate tape for backup and

archive

Page 8: Journey Through the Cloud: Disaster Recovery

DR is part of a wider set of policies and controls…

DR & business continuity

High availability Backup Disaster recovery

Keep your applications

running 24x7

Make sure your data is safe Get your applications and

data back after a major

disaster

Page 9: Journey Through the Cloud: Disaster Recovery

DR is part of a wider set of policies and controls…

DR & business continuity

It’s not an all or nothing thing Choose what needs to failover and what does not

Some things more important than others Some things will still be working

High availability Backup Disaster recovery

Keep your applications

running 24x7

Make sure your data is safe Get your applications and

data back after a major

disaster

Page 10: Journey Through the Cloud: Disaster Recovery

Each set of IT assets will have different requirements…

DR & business continuity

Recovery Time

Objective (RTO)

How quickly you need this asset to be

recovered?

e.g. 1min? 15min? 1hr? 4hrs? 1day?

Recovery Point

Objective (RPO)

How ‘fresh’ the recovery must be for the

asset?

e.g. zero data loss, 15mins out of date?

Page 11: Journey Through the Cloud: Disaster Recovery

Assets will sit on a spectrum of technical complexity…

DR & business continuity

Rebuild when required from offsite backup

Run hot-hot configuration with

auto-failover

Page 12: Journey Through the Cloud: Disaster Recovery

The fundamental economic model…

Utility, on-demand datacenter

Primary Site Routers

Firewalls

Network

Application Licenses

Operating Systems

Hypervisor

Servers

SAN

Primary Storage

Backup

Archive

Secondary Site Routers

Firewalls

Network

Application Licenses

Operating Systems

Hypervisor

Servers

SAN

Primary Storage

Backup

Archive

Page 13: Journey Through the Cloud: Disaster Recovery

The fundamental economic model…

Utility, on-demand datacenter

Primary Site Routers

Firewalls

Network

Application Licenses

Operating Systems

Hypervisor

Servers

SAN

Primary Storage

Backup

Archive

AWS Routers

Firewalls

Network

Application Licenses

Operating Systems

Hypervisor

Servers

SAN

Snapshot Storage

Backup

Archive

Page 14: Journey Through the Cloud: Disaster Recovery

The fundamental economic model…

Utility, on-demand datacenter

Primary Site Routers

Firewalls

Network

Application Licenses

Operating Systems

Hypervisor

Servers

SAN

Primary Storage

Backup

Archive

AWS Routers

Firewalls

Network

Application Licenses

Operating Systems

Hypervisor

Servers

SAN

Snapshot Storage

Backup

Archive

Secondary site costs

Page 15: Journey Through the Cloud: Disaster Recovery

Availability Zone

AWS is global Region

Page 16: Journey Through the Cloud: Disaster Recovery

Certifications

SOC 2

ISO 27001

PCI DSS for EC2, S3, EBS, VPC, RDS, ELB, IAM

FISMA Moderate Compliant Controls

HIPAA & ITAR Compliant Architecture

Physical Security

Datacenters in nondescript facilities

Physical access strictly controlled

Must pass two-factor authentication at least twice

for floor access

Physical access logged and audited

HW, SW, Network

Systematic change management

Phased updates deployment

Safe storage decommission

Automated monitoring and self-audit

Advanced network protection

Built to enterprise security standards

http://aws.amazon.com/security

Page 17: Journey Through the Cloud: Disaster Recovery

AWS services that can be employed

Page 18: Journey Through the Cloud: Disaster Recovery

Amazon

Simple

Storage

Service (S3)

AWS Import/Export

AWS Storage

Gateway Service

AWS Direct

Connect

Amazon Virtual

Private Cloud

(VPC)

Amazon

Route 53

Amazon Elastic

Compute Cloud

(EC2)

Amazon Relational

Database Service (RDS)

Amazon

Elastic Block

Storage (EBS)

Object storage & transfer services

Networking services Foundation services

Page 19: Journey Through the Cloud: Disaster Recovery

S3 and Elastic Block Store

AWS storage is ideal for DR

Simple Storage Service Highly scalable object storage

1 byte to 5TB in size

99.999999999% durability

Elastic Block Store High performance block storage device

1GB to 1TB in size

Mount as drives to instances with

snapshot/cloning functionalities

Page 20: Journey Through the Cloud: Disaster Recovery

Direct Connect Dedicated connection between your IT

infrastructure and the AWS datacenters

Extend your network infrastructure and

VLANs into AWS

VPN Connection A Hardware VPN connection connects

amazon environment to your datacenter

Internet Protocol security (IPsec) VPN

connection

Commonly used hardware supported

Virtual Private Cloud Private, isolated section of the AWS Cloud

Launch resources in a virtual network that you

define

complete control over your virtual networking

environment

Internet

Internet

Networking options

Page 21: Journey Through the Cloud: Disaster Recovery

Common DR architectures

Page 22: Journey Through the Cloud: Disaster Recovery

4 main patterns

Common DR architectures

Backup & Restore Pilot light

Warm standby in AWS

Multi-site solution in AWS & on-

premise

Page 23: Journey Through the Cloud: Disaster Recovery

We’ll focus on 2 of them…

Common DR architectures

Backup & Restore Pilot light

Warm standby in AWS

Multi-site solution in AWS & on-

premise

Page 24: Journey Through the Cloud: Disaster Recovery

Let’s start with Backup & Restore

Common DR architectures

Backup & Restore Pilot light

Warm standby in AWS

Multi-site solution in AWS & on-

premise

Page 25: Journey Through the Cloud: Disaster Recovery

Advantages to starting a journey with this pattern

Backup & Restore pattern

Simple to get started

Easy starting point for exploring the

AWS cloud

Low technical barrier to entry

Focus on incorporating cloud into your

DR strategy, not on complex technical

issues related to hot-hot systems

Cost effective

Very high levels of data durability at

low price

Cost of storing snapshots in S3

Archiving possibilities beyond tape

using Glacier

Page 26: Journey Through the Cloud: Disaster Recovery

The preparation process…

Backup & Restore pattern

Take backups of

current systems

Store backups

in S3

Move to long term

archive in Glacier

Page 27: Journey Through the Cloud: Disaster Recovery

The process…

Backup & Restore pattern

Take backups of

current systems

Store backups

in S3

Detail how you will restoring from backup or

recover from archive

Move to long term

archive in Glacier

Page 28: Journey Through the Cloud: Disaster Recovery

Glacier Long term durable archive

Page 29: Journey Through the Cloud: Disaster Recovery

Long term Glacier archive

Durable Designed for 99.999999999%

durability of archives

Cost effective Write-once, read-never. Cost effective for long term storage. Pay for accessing data

Page 30: Journey Through the Cloud: Disaster Recovery

Logs accessible from S3

time

Exp

iry

Page 31: Journey Through the Cloud: Disaster Recovery

Logs ✗ accessible from S3

Objects expire and are deleted

time

Exp

iry

Page 32: Journey Through the Cloud: Disaster Recovery

Logs

Txns

✗ accessible from S3

Objects expire and are deleted

time

accessible from S3

Object transition to

Glacier invoked

Exp

iry

Tran

siti

on

Page 33: Journey Through the Cloud: Disaster Recovery

Logs

Txns

✗ accessible from S3

Objects expire and are deleted

time

accessible from S3

Object transition to

Glacier invoked

Restoration of object requested

for x hrs

Exp

iry

Tran

siti

on

Page 34: Journey Through the Cloud: Disaster Recovery

Logs

Txns

✗ accessible from S3

Objects expire and are deleted

time

accessible from S3

Object transition to

Glacier invoked

Restoration of object requested

for x hrs

3-5hrs

Object held in S3 RRS for x hrs

Exp

iry

Tran

siti

on

Page 35: Journey Through the Cloud: Disaster Recovery

3-5 hour retrieval time We assume you won’t access often

Page 36: Journey Through the Cloud: Disaster Recovery

Push backups to AWS

Page 37: Journey Through the Cloud: Disaster Recovery

Store AMIs for servers

Page 38: Journey Through the Cloud: Disaster Recovery

Recover servers during DR

Page 39: Journey Through the Cloud: Disaster Recovery

Corporate Data

Center

© 2012 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified or distributed in whole or in part without the express consent of Amazon.com, Inc.

Elastic Data

Center

AWS Storage

Gateway

AWS Storage

Gateway installed

on-premise to

synchronize local

volumes

Page 40: Journey Through the Cloud: Disaster Recovery

Corporate Data

Center

© 2012 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified or distributed in whole or in part without the express consent of Amazon.com, Inc.

Elastic Data

Center

AWS Storage

Gateway

Local volumes

created under

Storage

Gateway

Page 41: Journey Through the Cloud: Disaster Recovery

Corporate Data

Center

© 2012 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified or distributed in whole or in part without the express consent of Amazon.com, Inc.

Elastic Data

Center

AWS Storage

Gateway

Usable with on-

premise

servers via

iSCSI interface

Page 42: Journey Through the Cloud: Disaster Recovery

Corporate Data

Center

© 2012 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified or distributed in whole or in part without the express consent of Amazon.com, Inc.

Elastic Data

Center

AWS Storage

Gateway

Primary on-

premise volumes

snapshotted,

compressed and

stored in Amazon

S3

Page 43: Journey Through the Cloud: Disaster Recovery

Corporate Data

Center

© 2012 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified or distributed in whole or in part without the express consent of Amazon.com, Inc.

Elastic Data

Center

AWS Storage

Gateway

Page 44: Journey Through the Cloud: Disaster Recovery

Corporate Data

Center

© 2012 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified or distributed in whole or in part without the express consent of Amazon.com, Inc.

Elastic Data

Center

AWS Storage

Gateway

Snapshot

pulled from S3

to restore local

volume

Page 45: Journey Through the Cloud: Disaster Recovery

Corporate Data

Center

© 2012 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified or distributed in whole or in part without the express consent of Amazon.com, Inc.

Elastic Data

Center

AWS Storage

Gateway

Snapshot

pulled from S3

to create cloud

instance

backed by

Volume

Page 46: Journey Through the Cloud: Disaster Recovery

Gateway stored

volumes

Data stored locally

Asynchronous backup

EBS snapshots

iSCSI local interface

Up to 1TB volumes

Gateway cached

volumes

Data stored in S3

Recently read data cached

Low latency

iSCSI local interface

Up to 32TB volumes

Page 47: Journey Through the Cloud: Disaster Recovery

AWS Storage appliances and backup management

Page 48: Journey Through the Cloud: Disaster Recovery

RDS and Oracle RMAN

Page 49: Journey Through the Cloud: Disaster Recovery

Let’s look at the Pilot Light pattern…

Common DR architectures

Backup & Restore Pilot light

Warm standby in AWS

Multi-site solution in AWS & on-

premise

Page 50: Journey Through the Cloud: Disaster Recovery

Moving along the DR spectrum…

Pilot light architecture

Build resources around

replicated dataset

Keep ‘pilot light’ on by replicating core

databases

Build AWS resources around dataset and

leave in stopped state

Page 51: Journey Through the Cloud: Disaster Recovery

Moving along the DR spectrum…

Pilot light architecture

Build resources around

replicated dataset

Keep ‘pilot light’ on by replicating core

databases

Build AWS resources around dataset and

leave in stopped state

Scale resources in AWS in

response to a DR event

Start up pool of resources in AWS when

events dictate

Match current production capacity through

auto-scaling polcies

Page 52: Journey Through the Cloud: Disaster Recovery

Moving along the DR spectrum…

Pilot light architecture

Build resources around

replicated dataset

Keep ‘pilot light’ on by replicating core

databases

Build AWS resources around dataset and

leave in stopped state

Scale resources in AWS in

response to a DR event

Start up pool of resources in AWS when

events dictate

Match current production capacity through

auto-scaling policies

Switch-over to system in AWS

Page 53: Journey Through the Cloud: Disaster Recovery

Pilot light

Stopped instances

Page 54: Journey Through the Cloud: Disaster Recovery

Pilot light

Running instances

Page 55: Journey Through the Cloud: Disaster Recovery

Customer example

Page 56: Journey Through the Cloud: Disaster Recovery

EU region DR site for range of business applications

All running in a Virtual Private Cloud (VPC)

DR provision for applications dependent on Oracle and SQL Server databases

Includes DR for Active Directory and Windows file shares

Page 57: Journey Through the Cloud: Disaster Recovery

VPC Subnet B

Region

Availability Zone

Client-to-site VPN Site-to-site VPN

S3 Bucketswith Objects

Bastion Host

Internet

On-premiseData Centre A

RemoteDesktops

AWS Direct Connect

On-premiseData Centre B

VPC Subnet D VPC Subnet F

Databases

VPC Subnet E

Applications

VPC Subnet A

SmartSentinel

VPC Subnet G

FileServers

VPC Subnet C

ActiveDirectory

Proxy Server

Page 58: Journey Through the Cloud: Disaster Recovery

VPC Subnet B

Region

Availability Zone

Client-to-site VPN Site-to-site VPN

S3 Bucketswith Objects

Bastion Host

Internet

On-premiseData Centre A

RemoteDesktops

AWS Direct Connect

On-premiseData Centre B

VPC Subnet D VPC Subnet F

Databases

VPC Subnet E

Applications

VPC Subnet A

SmartSentinel

VPC Subnet G

FileServers

VPC Subnet C

ActiveDirectory

Proxy Server

Dual route connectivity

Page 59: Journey Through the Cloud: Disaster Recovery

VPC Subnet B

Region

Availability Zone

Client-to-site VPN Site-to-site VPN

S3 Bucketswith Objects

Bastion Host

Internet

On-premiseData Centre A

RemoteDesktops

AWS Direct Connect

On-premiseData Centre B

VPC Subnet D VPC Subnet F

Databases

VPC Subnet E

Applications

VPC Subnet A

SmartSentinel

VPC Subnet G

FileServers

VPC Subnet C

ActiveDirectory

Proxy Server

Active Directory Replication

Page 60: Journey Through the Cloud: Disaster Recovery

VPC Subnet B

Region

Availability Zone

Client-to-site VPN Site-to-site VPN

S3 Bucketswith Objects

Bastion Host

Internet

On-premiseData Centre A

RemoteDesktops

AWS Direct Connect

On-premiseData Centre B

VPC Subnet D VPC Subnet F

Databases

VPC Subnet E

Applications

VPC Subnet A

SmartSentinel

VPC Subnet G

FileServers

VPC Subnet C

ActiveDirectory

Proxy Server

Bastion Host

Page 61: Journey Through the Cloud: Disaster Recovery

VPC Subnet B

Region

Availability Zone

Client-to-site VPN Site-to-site VPN

S3 Bucketswith Objects

Bastion Host

Internet

On-premiseData Centre A

RemoteDesktops

AWS Direct Connect

On-premiseData Centre B

VPC Subnet D VPC Subnet F

Databases

VPC Subnet E

Applications

VPC Subnet A

SmartSentinel

VPC Subnet G

FileServers

VPC Subnet C

ActiveDirectory

Proxy Server

Database replication

Page 62: Journey Through the Cloud: Disaster Recovery

VPC Subnet B

Region

Availability Zone

Client-to-site VPN Site-to-site VPN

S3 Bucketswith Objects

Bastion Host

Internet

On-premiseData Centre A

RemoteDesktops

AWS Direct Connect

On-premiseData Centre B

VPC Subnet D VPC Subnet F

Databases

VPC Subnet E

Applications

VPC Subnet A

SmartSentinel

VPC Subnet G

FileServers

VPC Subnet C

ActiveDirectory

Proxy Server

Application images

Page 63: Journey Through the Cloud: Disaster Recovery

VPC Subnet B

Region

Availability Zone

Client-to-site VPN Site-to-site VPN

S3 Bucketswith Objects

Bastion Host

Internet

On-premiseData Centre A

RemoteDesktops

AWS Direct Connect

On-premiseData Centre B

VPC Subnet D VPC Subnet F

Databases

VPC Subnet E

Applications

VPC Subnet A

SmartSentinel

VPC Subnet G

FileServers

VPC Subnet C

ActiveDirectory

Proxy Server

Desktop environments

Page 64: Journey Through the Cloud: Disaster Recovery

VPC Subnet B

Region

Availability Zone

Client-to-site VPN Site-to-site VPN

S3 Bucketswith Objects

Bastion Host

Internet

On-premiseData Centre A

RemoteDesktops

AWS Direct Connect

On-premiseData Centre B

VPC Subnet D VPC Subnet F

Databases

VPC Subnet E

Applications

VPC Subnet A

SmartSentinel

VPC Subnet G

FileServers

VPC Subnet C

ActiveDirectory

Proxy Server

Durable data backups

Page 65: Journey Through the Cloud: Disaster Recovery

Where to go next

Page 66: Journey Through the Cloud: Disaster Recovery

Technology and services organisations

Rich partner ecosystem

Page 67: Journey Through the Cloud: Disaster Recovery

http://aws.amazon.com/backup-storage

http://aws.typepad.com

http://aws.amazon.com/whitepapers

Page 68: Journey Through the Cloud: Disaster Recovery

Summary

Page 69: Journey Through the Cloud: Disaster Recovery

The cloud makes backup and recovery easy

You can get started for pennies per month

The cloud will scale to accommodate all of your data

You retain visibility and control of your information

Page 70: Journey Through the Cloud: Disaster Recovery

aws.amazon.com