aws sydney summit 2013 - technical lessons on how to do dr in the cloud

54
Dean Samuels Technical lessons on how to do Backup and Disaster Recovery in the Cloud Solution Architect, Amazon Web Services

Upload: amazon-web-services

Post on 15-Jan-2015

734 views

Category:

Technology


3 download

DESCRIPTION

Session 3, Presentation 2 from the AWS Sydney Summit

TRANSCRIPT

Page 1: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Dean Samuels

Technical lessons on how to doBackup and Disaster Recovery in the Cloud

Solution Architect, Amazon Web Services

Page 2: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Agenda

• Prologue The need for Backup and DR

• Customer Story Ausenco Limited

• Lessons Backup and DR with AWS

• Conclusion Q&A

Page 3: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

The need for Backup & DR

Page 4: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

• Archiving is the process of moving data that is no longer actively used to a separate data storage device for long-term retention. Data archives are indexed and have search capabilities so that files and parts of files can be easily located and retrieved.

• A backup or the process of backing up is making copies of data which may be used to restore the original after a data loss event. The primary purpose is to recover data after its loss, be it by data deletion or corruption. The secondary purpose of backups is to recover data from an earlier time.

• Disaster recovery (DR) is the process, policies and procedures related to preparing for recovery or continuation of technology infrastructure critical to an organization after a natural or human-induced disaster.

Some Definitions

Page 5: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Where is this…. Brisbane, QLD

Source: http://www.abc.net.au/news/specials/qld-floods/

Page 6: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Where is this…. Brisbane, QLD - January 13th 2011

Ausenco

Source: http://www.abc.net.au/news/specials/qld-floods/

Page 7: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Milton, QLD (Lang Park)

Source: http://www.abc.net.au/news/specials/qld-floods/

Page 8: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Milton, QLD (Lang Park) – January 13th 2011

Source: http://www.abc.net.au/news/specials/qld-floods/

Page 9: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Coronation Drive, Milton, QLD

Source: http://www.abc.net.au/news/specials/qld-floods/

Page 10: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Coronation Drive, Milton, QLD – January 13th 2011

Source: http://www.abc.net.au/news/specials/qld-floods/

Page 11: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Backup, Disaster Recovery and Data Tiering with AWS

Mike Latsky

Director, IT Operations

Ausenco

Page 12: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

About Ausenco

We provide:Engineering, Procurement, Construction Management andOperational Servicesto the energy, resource, and process industries

Presentation title | DD Month Year | 12

Page 13: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

IT Perspective• Engineers generate large 2D and 3D

CAD files

• CAD performance demands close network proximity

• Drawings are Ausenco’s deliverable and MUST be protected

Page 14: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

• ASX200 company

• Head office in Brisbane

• 29 Offices in 19 countries

• 3500 Employees

Ausenco Offices

Page 15: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Our IT Environment

• Completely virtualised

• Windows

• ERP - Oracle SaaS

• Email, Lync, SharePoint – MS Office 365 SaaS

• Onsite storage

• 2 x primary data centres

• 1 x secondary data centre

Page 16: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Our Challenge

• Backup – Unreliable

• Disaster Recovery– Sites were exposed

• Local Disk Storage– At capacity

Possible Solutions

An Enterprise Backup Solution?

Secondary Data Centres?

More Disk Storage?

Page 17: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Our initial approach

• Large Integrators

• Leading backup providers

• Leading archiving providers

• Hardware providers

• Data centre hosting providers

• Leading cloud providers

Page 18: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Paradigm Shift

• Why use traditional technologies?

• How can we better leverage the AWS cloud?

• How do we do more with less?

• Address the complete data lifecycle

• Flexible, Scalable, Cost Effective

Disaster Recovery Archiving Backup

Page 19: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Solution

Page 20: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

The Result

• Low effort• Incremental rollout• Cloud migration

• Efficient, effective usage of bandwidth• Application/database servers• Regionalised deployment

Page 21: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Our Roadmap

• Wider deployment

• Decommission costly secondary data centre

• Migration of production systems to AWS

Page 22: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Thank-you

Page 23: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Backup and DR with AWS

Backup & DR Lessons

Page 24: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Business and Technical Drivers….

Reduce costs

Slash DR budgets by up to 50%

Consolidate sites

Eliminate the need to run a secondary site

Reduce on-premise

Eliminate 30%+ of on-premise physical equipment

Remove aging technologies

Eliminate tape for backup and archive

Page 25: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

The fundamental economic model…

Utility, on-demand datacenter

Primary SiteRouters

Firewalls

Network

Application Licenses

Operating Systems

Hypervisor

Servers

SAN

Primary Storage

Backup

Archive

AWSRouters

Firewalls

Network

Application Licenses

Operating Systems

Hypervisor

Servers

SAN

Snapshot Storage

Backup

Archive

Secondary site costs

Page 26: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Backup and DR with AWS

Backup Lessons

Page 27: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Backup Lessons – My backup should be accessible

Source: http://www.abc.net.au/news/specials/qld-floods/

a.k.a. the pain of physical data transfer

Page 28: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

AWS Direct ConnectDedicated bandwidth between

your site and AWS

Amazon Storage GatewayShrink-wrapped gateway for volume

synchronization

AWS Import/ExportPhysical transfer of media into and

out of AWS

Getting data into the cloud

Page 29: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Simple Storage ServiceHighly scalable object storage

1 byte to 5TB in size

99.999999999% durability

Elastic Block StoreHigh performance block storage device

1GB to 1TB in size

Mount as drives to instances with

snapshot/cloning functionalities

GlacierLong term object archive

Extremely low cost per gigabyte

99.999999999% durability

Storage Options

Very fast ‘instance’ disks

Slow, rare accessFast web object storage

Page 30: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Cost ($/GB/month) Performance Durability

Amazon S3 0.095

Amazon Glacier 0.01

AWS Storage Gateway 0.095(+ 125/GW)

Amazon EBS 0.10

Amazon EBS (PIOPS) 0.125

*AWS Oregon pricingStorage Options

Page 31: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Snapshot of local volumes

Restoration from snapshots

Backup Lessons – My backup should be accessible

Page 32: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

VPN

Public / AWS Direct Connect

AWS Import/Export

z

Backup Lessons – My backup should be accessible

Page 33: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

GW-Cached volumes

GW-stored volumes

“Cool” storage

“Cold”

Backup Lessons – My backup should be accessible

Page 34: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Backup Lessons – My backup should be accessible

Page 35: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

• “Infinite” scale with Amazon S3 and Amazon Glacier

• Scale to multiple regions

• Seamless

• No need to provision

• Cost tiers (cheaper at scale)

Backup Lessons – My backup should be able to scale

Page 36: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

• SSL Endpoints (Amazon S3 and Amazon Glacier)

• Signed API calls

• Store encrypted files

• Server-side encryption

• Multiple copies across different data centers

• Local/cloud with AWS Storage Gateway

Backup Lessons – My backup should be safe

Page 37: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

• Easy to integrate within AWS or Hybrid

• AWS Storage Gateway: Run services on Amazon EC2 (DR)

• Clear costs

• Reduced costs

• I decide redundancy/availability in relation to costs

Backup Lessons – My backup should work with a DR policy

Page 38: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Backup Lessons – My backup should work with a DR policy

Page 39: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

• Clear ownership

• Permissions with IAM: Users, groups roles

• Logs

• AWS support

Backup Lessons – Someone should care about it

Page 40: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

1. My backup should be accessible

2. My backup should be able to scale

3. My backup should be safe

4. My backup should work with a DR policy

5. Someone should care about it

Backup Lessons

Page 41: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Backup and DR with AWS

DR Lessons

Page 42: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

DR is part of a wider set of policies and controls…

DR & business continuity

It’s not an all or nothing thingChoose what needs to failover and what does not

Some things more important than othersSome things will still be working

High availability Backup Disaster recovery

Keep your applications

running 24x7

Make sure your data is safe Get your applications and

data back after a major

disaster

Page 43: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Each set of IT assets will have different requirements…

DR & business continuity

Recovery Time Objective (RTO)

How quickly you need this asset to be

recovered?

e.g. 1min? 15min? 1hr? 4hrs? 1day?

Recovery Point Objective (RPO)

How ‘fresh’ the recovery must be for the

asset?

e.g. zero data loss, 15mins out of date?

Page 44: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Assets will sit on a spectrum of technical complexity…

DR & business continuity

Rebuild when required from offsite backup

Run hot-hot configuration with

auto-failover

Custo

mer

facin

g tra

nsac

tiona

l

web ap

plica

tion

Inte

rnal

com

mun

icatio

ns

syste

m

Daily

sche

duled

proc

esse

s & sy

stem

s

Back

end

repo

rting

syste

m

& dat

abas

e

Page 45: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

DR Lessons – You NEED a DR plan in place

DR with High Availability

Page 46: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

App DR with Standby

DR Lessons – You NEED a DR plan in place

Page 47: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

DR Lessons – Testing your DR

• Dev/test in the cloud is super easy

• Spin up capacity only for the test

• Regularly test your DR

• Cost is minimal

• What about data transfer speed?

s3cmd ls --recursive

s3://datasets.elasticmapreduce/ngra

ms/books/ | awk '{print $4;

sub(/s3:\/\/datasets.elasticmapredu

ce/, "/array", $4); print $4}' |

parallel -j0 -N2 --progress

/usr/bin/s3cmd --no-progress get

{1} {2}Copying 2.4 TB down from 48 hours to 9 hours (5x faster)

Page 48: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

DR Lessons – Reducing Costs

• Dev/test in the cloud is super easy

• Spin up capacity only for the test

• Regularly test your DR

• Cost is minimal

• What about data transfer speed?

Page 49: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

DR Lessons – You can have different DR solutions

• Easy to integrate existing vendors with DR on AWS

• Approach: One vendor/hybrid/multiple vendors

• One region/multi-regions (if you need geo-diversity)

• Different DR Architectures

Backup & Restore Pilot light

Warm standby in AWS

Multi-site solution in AWS & on-

premise

Page 50: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Pilot light

Stopped instances

Page 51: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Pilot light

Running instances

Page 52: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

1. You NEED a DR plan in place

2. Testing your DR

3. Reducing costs

4. You can have different DR solutions

DR Lessons

Page 53: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

aws.amazon.comget started on the free tier

Page 54: AWS Sydney Summit 2013 - Technical Lessons on How to do DR in the Cloud

Technical lessons on how to doBackup and Disaster Recovery in the Cloud