brainshare 2014 presentation final
TRANSCRIPT
CAS8247
Customer Experience: PlateSpin Forge Disaster Recovery November 2014
Tom Lockhart IT Systems Manager
Hastings & Prince Edward Counties Health Unit
Mike Robinson Senior Solutions Marketing Manager
NetIQ #BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 2
Hastings & Prince Edward Counties Health Unit
• 6 offices
• 175,000 residents served
• Health services include:
– Immunization
– Restaurant inspections
– Smoking cessation
– Dietician consultations
– and more
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 3
Backup and Disaster Recovery
What was working…
Daily file & database
backups
Recover deleted files
Archive data
What was not…
× Recovering whole servers
× Restoring servers took
too long
× Unacceptable RPO and
RTO
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 4
What is a Server Workload?
Server
Data
Applications
Operating System
A workload is the operating system, applications & middleware, and data that reside on a physical server or virtual host.
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 5
The 3 Phases of IT Service Recovery
IT Service Recovery means: 1. Backing up (replicating) entire server workloads (the
contents of a server, including the operating system, applications & middleware, and data),
2. Recovering workloads during an outage, and
3. Restoring workloads to their original production locations after the outage.
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 6
Key IT Service Recovery Concepts
RTO: Recovery Time Objective
– The target maximum allowable time to
recover from an outage.
– An RTO of 4 hours means systems have to
be back up and operational no more than
4 hours after an outage.
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 7
Key IT Service Recovery Concepts
RPO: Recovery Point Objective
– A measure of maximum acceptable data loss
in terms of time (minutes, hours, days).
– An RPO of 4 hours means that the most
recent backup has to be no more than
4 hours old at the time of an outage.
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 8
DIY Disaster Recovery
Requirements
• Putting together server +
storage, software takes
time and expertise
• Even if you have the
expertise, who has the
time?
• Licensing
Impacts
• Time required to build
and test
• Maintain and support
multiple systems
• Multiple vendors mean
potential support issues
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 9
Key DR Requirements
Protect Windows and Linux workloads
Protect physical and virtual workloads
Fail back to physical servers or virtual machines
Tiered RPO & RTO for different workloads
Easy to use, comprehensive test failover & failback
On-site & off-premises deployment options
Simple user management
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 10
PlateSpin Forge
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 11
PlateSpin Forge Appliances
Features
• All-in-one appliance
• Protect 40+ workloads
• Protect physical & virtual,
Windows & Linux
workloads
• “Plug in & Protect”
• Single pane of glass
management
Benefits
• No virtualization expertise
needed
• Single solution for all
workloads
• Lower RPO & RTO
• Rapid failover
• Flexible failback to
physical or virtual servers
• Easy testing
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 12
PlateSpin Forge Key Advantages
All-in-one appliance
Integrated storage and compute
Simple expandable licensing
Portable; can easily move off-site
Single SKU to buy
Single vendor for support
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 13
PlateSpin Forge Lineup
700 Series
• 10+, 25+, 40+ workload
models
• Two 8-core HT CPUs
• 128 GB RAM
• ~20 TB usable storage
• Starts at $54,000 with
standard maintenance
500 Series
• 10+ and 25 workload
models
• Two quad-core CPUs
• 32 GB RAM
• ~4.5 TB usable storage
• Starts at $44,200 with
standard maintenance
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 14
Setting Up an Appliance
Simple, easy setup
Setup poster & quick-
start guide
Unbox, plug it in
Started protecting
workloads the same day
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 15
Sandbox Testing
• Isolated from production
network
• No impact to production
workloads
• Easy non-intrusive testing
• Promotes more frequent
testing
• Automatically deleted
when testing is complete
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 16
Internal web server Repaired Web server
PlateSpin Forge in Theory
Rapid failover to virtual recovery
workload
Failback to dissimilar hardware
New web server
Virtual recovery (remote site)
Failback with sync to repaired hardware
OR
Block-based replication
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 17
PlateSpin Forge in Practice
• Decision time:
troubleshoot vs failover
• Time spent trouble-
shooting before failover
extends downtime
• Higher confidence
in PlateSpin Forge faster
decision to failover; less
production downtime
• PlateSpin Forge failover
is consistently 20-30
minutes
Time spent troubleshooting
after failover is much less
stressful!
Troubleshoot
Failover
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 18
Disaster Recovery Technologies
Availability %
90 “one nine”
95
99 “two nines”
99.9 “three nines”
99.99 “four nines”
99.999 “five nines”
Downtime per Year
36.5 days
18.25 days
3.65 days
8.76 hours
52.56 minutes
5.26 minutes
Downtime per Month
72 hours
36 hours
7.2 hours
43.8 minutes
4.32 minutes
25.9 seconds
Typical RTO/RPO
12 – 24 hours
15 – 60 minutes
<5 minutes
Cost
Solution
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation and its affiliates. All Rights Reserved. 19
Next Steps…
• Determine your tolerance
for downtime & data loss
• Establish recovery metrics
– Categorize server workloads into
tiers by RPO, RTO
• Match organizational
needs to technologies
– You can and will use multiple
technologies
– Balance budget with needs
#BrainShare #NetIQ8247
© 2014 NetIQ Corporation. All rights reserved. 21
Thank you.
Don’t miss the Identity-Powered Experience in IT Central.
This document could include technical inaccuracies or typographical errors. Changes are
periodically made to the information herein. These changes may be incorporated in new
editions of this document. NetIQ Corporation may make improvements in or changes to the
software described in this document at any time.
Copyright © 2014 NetIQ Corporation and its affiliates. All Rights Reserved.
ActiveAudit, ActiveView, Aegis, AppManager, Change Administrator, Change Guardian, Compliance Suite, the
cube logo design, Directory and Resource Administrator, Directory Security Administrator, Domain Migration
Administrator, Exchange Administrator, File Security Administrator, Group Policy Administrator, Group Policy
Guardian, Group Policy Suite, IntelliPolicy, Knowledge Scripts, NetConnect, NetIQ, the NetIQ logo, PSAudit,
PSDetect, PSPasswordManager, PSSecure, Secure Configuration Manager, Security Administration Suite,
Security Manager, Server Consolidator, VigilEnt, and Vivinet are trademarks or registered trademarks of NetIQ
Corporation or its subsidiaries in the United States.