protect enterprise data, achieve long-term data retention...the media server provides additional...

15
Technical white paper Protect enterprise data, achieve long-term data retention HP StoreOnce Catalyst and Symantec NetBackup OpenStorage Table of contents Introduction 2 Technology overview 3 HP StoreOnce Backup systems—key features and benefits 3 HP StoreOnce Catalyst and Symantec NetBackup OpenStorage (OST) 3 HP StoreOnce Catalyst—seamless data movement across the enterprise 4 HP StoreOnce Backup systems in small to large data centers 4 Advantages of using HP StoreOnce B6200 Backup systems with Symantec NetBackup 4 HP StoreOnce Backup systems for enterprise data backup using Symantec NetBackup 5 Symantec NetBackup infrastructure components 5 Capacity planning 6 Windows vs. Linux deduplication ratios with HP StoreOnce Catalyst 8 Symantec NetBackup schedule type characteristics with an HP StoreOnce B6200 Backup system 9 Symantec NetBackup block size effect on throughput and deduplication ratio 10 DR with local HP StoreOnce Backup system and remote replication 12 Symantec NetBackup replication using HP StoreOnce Catalyst 12 Recovery scenarios 13 Recommendations 14 Conclusion 14 Useful links 15 For more information 15

Upload: others

Post on 24-Apr-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Technical white paper

Protect enterprise data, achieve long-term data retention HP StoreOnce Catalyst and Symantec NetBackup OpenStorage

Table of contents

Introduction 2

Technology overview 3 HP StoreOnce Backup systems—key features and benefits 3 HP StoreOnce Catalyst and Symantec NetBackup OpenStorage (OST) 3

HP StoreOnce Catalyst—seamless data movement across the enterprise 4

HP StoreOnce Backup systems in small to large data centers 4 Advantages of using HP StoreOnce B6200 Backup systems with Symantec NetBackup 4

HP StoreOnce Backup systems for enterprise data backup using Symantec NetBackup 5

Symantec NetBackup infrastructure components 5 Capacity planning 6 Windows vs. Linux deduplication ratios with HP StoreOnce Catalyst 8 Symantec NetBackup schedule type characteristics with an HP StoreOnce B6200 Backup system 9 Symantec NetBackup block size effect on throughput and deduplication ratio 10

DR with local HP StoreOnce Backup system and remote replication 12 Symantec NetBackup replication using HP StoreOnce Catalyst 12 Recovery scenarios 13

Recommendations 14

Conclusion 14

Useful links 15

For more information 15

2

Introduction In today’s business environment, customers rely on the most efficient, high-performing, and reliable backup systems to protect critical business information. Customers need to protect increasing levels of data while keeping costs under control. HP StoreOnce Backup systems provide a disk-based data protection platform while addressing data growth by applying HP StoreOnce deduplication software for efficient, long-term data retention.

The HP StoreOnce B6200 Backup system, the latest deduplication appliance in the HP StoreOnce product line, provides a unique combination of features, including industry-leading performance (up to 100 TB per hour), high availability, and high capacity making the HP StoreOnce B6200 Backup system the industry leader in the enterprise deduplication sector.

HP StoreOnce Catalyst software was developed to dramatically improve the performance, function, and integration of backup applications such as Symantec NetBackup. HP StoreOnce Catalyst delivers deduplication on an appliance server, media server, or dedicated appliance. Since it uses the same deduplication algorithm globally, data can be moved between platforms without rehydration. HP StoreOnce Catalyst allows better utilization of advanced, disk-based storage solutions while increasing efficiency and performance.

This document describes the benefits of using HP StoreOnce B6200 Backup systems combined with HP StoreOnce Catalyst software and Symantec NetBackup to back up important enterprise data. This document also recommends backup and recovery implementations.

The following are key recommendations for backing up data to an HP StoreOnce B6200 Backup system utilizing HP StoreOnce Catalyst software:

• Increased backup speed: Use low bandwidth as the Primary Transfer Policy on the HP StoreOnce B6200 Backup Catalyst store to improve throughput performance.

• Improved backup storage utilization: Back up one server at a time (sequentially) for quickly rising deduplication ratios that are maintained over time.

• Trade-off between backup impact and ease of recovery: Configure a weekly full with daily incremental or initial full with daily incremental backup schedule to reduce the amount of end-to-end data and decrease the time required to run daily backups. HP StoreOnce Backup systems data deduplication will further reduce the end-to-end data size.

• Efficient and cost-effective movement of backup data offsite: Use the HP StoreOnce remote replication feature to seamlessly replicate all servers to an appliance in a remote facility for simpler recovery in the event of a disaster.

HP StoreOnce Backup systems are a disk-based backup system that delivers leading price-performance and deduplicates enterprise backup data. The HP StoreOnce Backup system can be used to automate and consolidate multiple backups onto a single, rack-mountable device while improving reliability by reducing errors caused by media handling. For business environments with remote offices, or a disaster recovery (DR) site, the HP StoreOnce Backup system can be used to replicate data to a central data center or remote facility.

HP StoreOnce Backup systems are ideal for mission-critical application backup data for small to large data centers running key business applications. Proper backup configurations with a data protection application to the HP StoreOnce Backup system provide the best backup throughput performance and data deduplication ratios. HP StoreOnce Backup systems integrate seamlessly into current IT environments and offer the flexibility of VTL and NAS targets, as well as Catalyst stores.

3

Technology overview

HP StoreOnce Backup systems—key features and benefits • HP StoreOnce deduplication—store more data on disk

HP StoreOnce deduplication reduces the disk space required to store backup data sets without impacting backup performance. Retaining more backup data on disk for longer enables greater data accessibility for rapid restore of lost or corrupt files and reduces downtime.

Deduplication ratios are strongly influenced by two factors—data change rate and backup data retention periods. Low data change rates and data retained for longer periods of time yield higher deduplication ratios.

• Deduplication-enabled replication HP StoreOnce Deduplication is the technology enabler for HP StoreOnce Deduplication-enabled replication which allows fully automated replication over low bandwidth links to a DR site, giving Remote Office/Branch Office (ROBO) and small data centers a cost effective DR solution for the first time.

• Rapid restore of data for dependable, worry-free data protection HP StoreOnce Backup systems offer immediate access to backups for rapid restores. HP StoreOnce deduplication allows more data to be stored closer to the data center for longer periods of time which offers immediate access for rapid restores.

• Automate, simplify, and improve the backup process HP StoreOnce Backup systems automate your backup processes allowing you to reduce the time spent managing your data protection. Implementing hands-free, unattended daily backup is especially valuable for environments with limited IT resources, such as remote or branch offices.

HP StoreOnce systems can backup multiple servers via a standard Ethernet or Fibre Channel network simultaneously to a disk-based solution at peak speeds of up to 100 TB per hour instead of sequentially to a tape drive or autoloader, meaning that you can substantially reduce your backup window.

HP StoreOnce systems can be intuitively managed and configured by using the built-in Web browser’s administrative interface. For larger deployments of replicating HP StoreOnce appliances, the HP StoreOnce Replication Manager can monitor multiple backup systems across geographies. HP StoreOnce systems are self-managing backup appliances that require little, if any, routine maintenance. Unlike other disk-based storage devices, HP StoreOnce systems do not require virus protection or LUN provisioning.

HP StoreOnce Catalyst and Symantec NetBackup OpenStorage (OST) • HP StoreOnce Catalyst

HP StoreOnce Catalyst is an object-based storage target on an HP StoreOnce Backup system that offers client side deduplication for use with backup and recovery applications.

• NetBackup OST

OST is a Symantec backup interface that allows intelligent storage devices, like the HP B6200 StoreOnce Backup system, to work with Symantec's NetBackup software. OST provides NetBackup administrators with advanced capabilities, such as optimized duplication and DR copy creation.

• HP OST plug-in for Symantec NetBackup

The HP OST plug-in is installed on the NetBackup media servers. It uses an HP StoreOnce Catalyst client-side library interface to interact with the HP StoreOnce Backup systems.

4

HP StoreOnce Catalyst—seamless data movement across the enterprise HP StoreOnce Catalyst brings the HP StoreOnce vision of a single, integrated enterprise-wide deduplication algorithm a step closer. It allows the seamless movement of deduplicated data across the enterprise to other HP StoreOnce Catalyst systems without rehydration. This means that you can benefit from:

• Simplified management of data movement from a single pane of glass: Tighter integration with your backup application to centrally manage file replication across the enterprise.

• Seamless control across complex environments: This supports a range of flexible configurations that enable the concurrent movement of data from one site to multiple sites, and the ability to cascade data around the enterprise (sometimes referred to as multi-hop).

• Enhanced performance: Distributed deduplication processing using HP StoreOnce Catalyst stores on the B6200 and on multiple servers can optimize loading and utilization of backup hardware, network links, and backup servers for faster deduplication and backup performance.

• Faster time to backup to meet shrinking backup windows: Up to 100 TB per hour aggregate throughput, 4X faster than backup to a NAS target.

Note: Actual performance is dependent upon configuration data set type, compression levels, number of data streams, number of devices emulated, and number of concurrent tasks, such as housekeeping or replication.

HP StoreOnce Backup systems in small to large data centers

Advantages of using HP StoreOnce B6200 Backup systems with Symantec NetBackup • Easier setup to protect, manage, and access data

• A single portable data deduplication algorithm that can run on an application server, media server, or dedicated appliance

• Faster backup and restoration of more data across more usage models

• Reduced workload on IT infrastructure, thus reducing CPU and memory requirements on the HP StoreOnce B6200 Backup system allowing for faster throughput performance

• Allows more backup data to be retained on disk for longer periods

• Improved functionality, performance, and total cost of ownership (TCO) while migrating data protection environments from disparate small systems into scalable HP StoreOnce Backup systems

• Improved performance, function, and integration of backup applications such as Symantec NetBackup

5

HP StoreOnce Backup systems for enterprise data backup using Symantec NetBackup An important part of server administration is maintaining a consistent set of backup data, which should be available for recovery. When data is lost due to user error, system failure, or catastrophic site failure, there is a need for complete server recovery along with application data recovery. Symantec NetBackup provides data protection for multiple, heterogeneous servers and operating systems. The backup data can be consolidated to a single HP StoreOnce Backup system leveraging 10GbE Ethernet and Fibre Channel speed. HP StoreOnce Backup systems integrated with a well-planned data protection strategy include regular backups to maintain a consistent set of data for recovery purposes.

Symantec NetBackup infrastructure components

Table 1. Symantec NetBackup architecture

Component Description

Master server The master server manages backups, archives, and restores and is responsible for media and device selection.

Media server The media server provides additional storage by allowing NetBackup to use the storage devices that are attached to it.

Client A client is any system that requires protection. A NetBackup client sends and receives data across the local area network (LAN) to and from a NetBackup Master/Media Server. A media server can also be considered a client.

SAN client A SAN client contains critical data that requires high bandwidth for backups. It connects to a NetBackup Media Server over Fibre Channel.

Figure 1 shows an example of Symantec NetBackup architecture connections with an HP StoreOnce Backup system.

6

Figure 1. Symantec NetBackup architecture with an HP StoreOnce Backup system

Capacity planning The required backup storage capacity for server backups depends on the following:

• Size of data and number of servers

• Backup retention policy (recovery points needed)

• Type of backups (full, incremental, differential)

• Frequency of backups

• Data rate of change

• The deduplication ratio achieved by the HP StoreOnce B6200 Backup system

7

HP StoreOnce B6200 Backup systems do not deduplicate across Catalyst stores. Each Catalyst store is an independent deduplication domain. To achieve better deduplication ratio a unique Catalyst store should be created specifically for similar data types. For larger environments, multiple Catalyst stores may work best with backups of similar type of operating system data stored in the same HP StoreOnce B6200 Backup system.

Note: The rate of change of data refers to the amount of data that would be contained in an incremental backup as a percentage of a full backup. A 100 GB full backup with a subsequent 5 GB incremental backup before the next full backup would be a five percent rate of change.

In performing these tests HP used a standard customer representative data set with realistic structure and content.

An administrator may desire to have two weeks of backup stored on the StoreOnce system for quick recovery access.

The HP OpenStorage plug-in for Symantec NetBackup has two write mode options for transferring data to HP StoreOnce B6200 Backup systems:

• Low bandwidth (client-side deduplication on)

• High bandwidth (client-side deduplication off)

Figure 2 shows the throughput performance comparison between HP StoreOnce Catalyst low bandwidth and high bandwidth transfer policies.

Figure 2. HP StoreOnce Catalyst throughput performance comparisons

Figure 3 shows the data rate of change effect on deduplication ratios when backing up data using HP StoreOnce Catalyst. The low bandwidth transfer policy was used and the deduplication is taking place at the media server (client-side deduplication).

1 2 3 4 5 6 7 8 9 10 11 12 13 14

HP StoreOnce Catalyst throughput—1% rate of change low bandwidth vs. high bandwidth

Low Bandwidth

High Bandwidth

Number of backups

Back

up th

roug

hput

Low bandwidth

High bandwidth

8

Figure 3. HP StoreOnce Catalyst rate of change effect on HP StoreOnce deduplication ratios

Windows vs. Linux deduplication ratios with HP StoreOnce Catalyst • Windows® and RHEL servers have comparable deduplication ratios

Figure 4 illustrates little deduplication difference between Windows and Linux operating system types (using low bandwidth). For the chart below, data was updated between each backup until the desired rate of change was reached.

Figure 4. Windows vs. Linux server backup deduplication ratios (using low bandwidth)

1% 2% 3% 4% 5%

Expe

cted

ded

uplic

atio

n ra

tio

Data rate of change between each backup

HP StoreOnce Catalyst deduplication ratios relative to the data rate of change

1 2 3 4 5 6 7 8 9 10 11 12 13 14

HP StoreOnce Catalyst deduplication ratios—5% rate of change

Windows

RHEL

Number of backups

Expe

cted

ded

uplic

atio

n ra

tio

9

Symantec NetBackup schedule type characteristics with an HP StoreOnce B6200 Backup system Many backup environments take advantage of Symantec NetBackup’s different backup schedule types such as:

• Daily full

• Weekly full with daily incremental

Table 2 lists some characteristics of full and incremental backups to an HP StoreOnce B6200 Backup system over a period of 20 days with a 30 GB data set.

Table 2. Comparison of different backup schedule types

Daily full Weekly full/Daily incremental

Description Daily backup of all files Weekly backup of all files, daily backup of files that have changed since the last backup

Backed up data All data Weekly full—all data

Daily incremental—changed data since the last backup

Relative backup time Long Short, except on days of full backup

Relative recovery time Short Long

Server load High Low, except on days of full backup

SAN/LAN bandwidth requirement High Low, except on days of full backup

Relative deduplication ratio on appliance High Low

Size on HP StoreOnce compared to non-deduplication

Small Very small

Figure 5 illustrates the end-to-end data compaction of full and incremental backups to an HP StoreOnce B6200 Backup system over a period of 20 days (using low bandwidth).

Note: Data compaction refers to the removal of redundant information from a backup set prior to storing on a backup device. Incremental backups, deduplication, and compression are all methods for removing redundant data from a backup set.

Each schedule shows the overall size of the backup data without deduplication vs. the size of the data on the HP StoreOnce B6200 Backup system after deduplication with Catalyst.

10

Figure 5. Data compaction comparison of different backup schedule types (using low bandwidth)

Symantec NetBackup block size effect on throughput and deduplication ratio Symantec NetBackup supports varying block sizes for backing up server data. Larger block sizes usually result in better throughput and an increase in data deduplication ratios. HP recommends keeping the block size set at 256 and above for better throughput.

Figure 6 illustrates how HP StoreOnce Catalyst backup throughput benefits from higher block size (using low bandwidth).

0.00

1.00

2.00

3.00

4.00

5.00

Tera

byte

s

HP StoreOnce Catalyst data compaction with different schedule types

Client backup

Size on StoreOnce

Deduplication

Weekly full + daily incremental Daily full

Size on HP StoreOnce

11

Figure 6. Varying block sizes vs. backup throughput (using low bandwidth)

Figure 7 illustrates StoreOnce Catalyst deduplication ratios when using different backup block sizes (using low bandwidth).

Figure 7. Varying block sizes vs. deduplication ratios (using low bandwidth).

1 2 3 4 5 6 7 8 9 10 11 12 13 14

HP StoreOnce Catalyst throughput—varying block sizes 1% rate of change

128KB

256KB

512KB

Number of backups

Back

up t

hrou

ghpu

t

1 2 3 4 5 6 7 8 9 10 11 12 13 14

HP StoreOnce Catalyst deduplication ratio—varying block sizes 1% rate of change

128KB

256KB

512KB

Number of backups

Expe

cted

ded

uplic

atio

n ra

tio

128 KB

256 KB

512 KB

128 KB

256 KB

512 KB

12

Other factors affecting throughput:

• Data multiplexing: When multiple backups run in parallel to a single device, multiplexing results in better utilization of the device which helps in better overall throughput and reduction of the backup window. With the help of multiplexing, it is possible to back up multiple servers simultaneously.

• Streams and data writers: A data stream can be thought of as a data channel that connects client file systems to the storage media. Multiple streams running in parallel through a single channel increases the rate at which data can be written to storage media.

Other factors affecting deduplication ratio:

• Backup policies: Retaining data for longer period of time improves the chance that common data will already exist in storage, resulting in greater storage savings and better deduplication ratio.

• Compression/Encryption: Software compression and encryption prevents optimal deduplication since data that is already compressed and encrypted cannot be efficiently deduplicated.

• Data types: To improve deduplication ratio, configure individual VTL/NAS devices as separate library/disk devices in Symantec NetBackup for individual server/type of operating systems.

DR with local HP StoreOnce Backup system and remote replication Most companies recognize the importance of a robust data protection strategy. Enterprise-level customers are likely to invest in local server’s recovery, as well as, site DR at remote site using replication. Many companies, large and small, are protecting servers in remote offices where untrained IT staff is expected to manage a daily backup process—generally involving the changing of physical tapes, which is a process prone to more time consumption, resource draining, inefficient and human error.

Replicating large volumes of data backup over a typical WAN is expensive. However, today's products with data deduplication have made it possible to replicate data over lower bandwidth links for a more cost effective, network-efficient replication solution that provides a practical DR solution and an ideal solution for centralizing the backup of remote offices.

Data deduplication shrinks the amount of backup data that needs to be replicated from the source HP StoreOnce appliance, and as a result significantly reduces replication bandwidth requirements. Once a replica of the data backup set has been created on a remote HP StoreOnce target appliance all that is required to keep the replica identical to the source is the automatic, periodic copying and movement of the new data segments which are created during each backup. With such small amounts of data being transmitted asynchronously, lower bandwidth networks offer sufficient performance and a much lower cost solution.

Note: Replication of data can only occur between devices within the same product family that is, HP StoreOnce B6200, but not VLS.

Symantec NetBackup replication using HP StoreOnce Catalyst One of the key features that HP StoreOnce Catalyst stores provide is allowing NetBackup to utilize low bandwidth duplication (or copy) jobs. NetBackup allows duplicate backup jobs to function between Catalyst stores, allowing complete control over to NetBackup the backup data lifecycle. This is accomplished by using NetBackup’s Storage Lifecycle Policies to control the lifecycle of backup data. Replicating backup data between HP StoreOnce Backup systems is accomplished by properly configuring a Storage Lifecycle Policy to contain a set of sequential jobs, one being the backup job and the other being a duplicate (or copy) job. Therefore, whenever the backup job completes, the duplicate (or copy) job is executed to the alternate HP StoreOnce Backup system.

There is flexibility in doing data recovery, depending upon the situation or type of failure (see figure 8). For instance:

• Clients can be recovered at the HP StoreOnce source site (original client location).

• In the event of a client source site disaster, the target site HP StoreOnce can be shipped to the source site or the backup data can be replicated back to the source site for complete client recovery.

• Clients can be recovered at the HP StoreOnce target site (remote location).

13

Recovery scenarios Figure 8 illustrates DR scenarios that may occur and the recovery path available when replicating between HP StoreOnce Backup systems.

Figure 8. Recovery scenarios

14

Recommendations

• Increased backup speed

Use low bandwidth as the Primary Transfer Policy on the HP StoreOnce B6200 backup store to improve –throughput performance.

If backup throughput performance is the highest priority, use Symantec NetBackup multiplexing to send multiple –streams and use multiple data readers/writers for client to send multiple server backups simultaneously to the HP StoreOnce Backup system; however, the data will be interleaved on the HP StoreOnce system, resulting in a decline in the deduplication ratio.

• Do not multiplex if you want increased deduplication ratios.

Back up one server at a time by disabling NetBackup multiplexing and client side deduplication to achieve the best –possible deduplication ratio.

• Back up block size

Use a larger block size for improved backup throughput performance –

The backup block size used by Symantec NetBackup may have a small effect on data deduplication ratios on the –HP StoreOnce Backup system.

• Daily full or weekly full with daily incremental backups

Daily full backups deduplicate at a much higher rate than weekly full with daily incremental, but require more –server and StoreOnce processing resources and SAN/LAN bandwidth.

End-to-end data compaction is greater for incremental backup schedules over an extended time period, which –means less storage space will be used on the StoreOnce Backup system.

• DR

HP StoreOnce remote replication offers a low bandwidth replication solution to and from remote sites, which is –ideal for server DR.

Backups through Symantec NetBackup to an HP StoreOnce pair configured with remote replication provides –recovery for local disk failure, complete server failure, or complete site failure by keeping server backup copies at local and remote sites.

Conclusion Enterprise customers demand an efficient, reliable data growth management backup system environment, while keeping costs under control. HP provides a variety of reliable data protection storage solutions that address such requirements. HP StoreOnce Catalyst, along with Symantec NetBackup OST, is one such solution. HP StoreOnce Backup systems offer high performance and reliability, while addressing data growth through HP StoreOnce data deduplication technology. In addition, Symantec NetBackup’s data protection solution brings together a full generation of traditional and next generation data protection from backup, to disk to replication management, to tape under one platform. In all, HP StoreOnce Backup systems integrate easily with Symantec NetBackup to protect important data for mission-critical applications.

15

Useful links HP StoreOnce Backup System user guide

http://bizsupport1.austin.hp.com/bc/docs/support/SupportManual/c02295179/c02295179.pdf

HP StoreOnce Backup systems—Linux and UNIX® configuration guide

http://h20000.www2.hp.com/bc/docs/support/SupportManual/c02299831/c02299831.pdf

HP StoreOnce Backup systems: best practices for VTL, NAS, and replication implementations

http://bizsupport2.austin.hp.com/bc/docs/support/SupportManual/c02511912/c02511912.pdf

Symantec NetBackup—backup and recovery documentation

http://www.symantec.com/netbackup

For more information Address your requirements for an efficient, reliable, data-growth management backup system in an enterprise server environment, while keeping costs under control, by combining HP StoreOnce Backup systems with Symantec NetBackup. To learn how, visit hp.com/go/StoreOnce.

Get connected hp.com/go/getconnected

Current HP driver, support, and security alerts delivered directly to your desktop

© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein.

Windows is a U.S. registered trademark of Microsoft Corporation. UNIX is a registered trademark of The Open Group.

4AA4-2021ENW, Created October 2012