data provisioning

16

Upload: rajat-agrawal

Post on 18-Dec-2015

245 views

Category:

Documents


8 download

DESCRIPTION

SAP HANA Data Provisioning options

TRANSCRIPT

PowerPoint Presentation

How to Select the Appropriate SAP HANA Data Provisioning Tool

Learning points

What is HANA Database

Data Provisioning toolsLog-based replication with Sybase Replication ServerTrigger-based replication with SAP LT Replication Server (SLT)ETL-based replication with SAP BusinessObjects Data ServicesSAP HANA Direct extract connection (DXC)

Comparison of the replication methodsWhat is HANA DatabaseSAP HANA is a data-source-agnostic database that facilitates a large volume of data storage. The data agnostic capability presents the opportunity to engage a number of data provisioning tools seamlessly with a range of data transformation capabilities.

When SAP HANA is used in a side-car data mart approach (that is, not as a database server), data has to be fed from the source system into SAP HANA by replication.

A single HANA instance can simultaneously source more than one data provisioning tool and provide the ability to combine data from multiple sources into a single HANA view. The ability to merge large amounts of data in HANA provides a broader information capability and results in a more effective decision-making tool.

There are several replication methods available to accomplish this:Sybase Replication ServerSAP LT Replication Server (SLT)SAP Data ServicesDirect Extractor Connection (DXC)

Ill illustrate the technical architecture of four data provisioning tools and the criteria for selecting the right tool for your needs.Note: Other than these widely used data provisioning tools, you can also consider loading flat files by SAP HANA Studio or through the operating system level using Python scripts and SQL script uploads. These two options can be used as one-time activities or for test data loads.4Log based and SLT Replication

5Log-based replication with Sybase Replication ServerThe log-based replication for SAP HANA is realized with Sybase Replication Server. It captures table changes from low-level database log files and transforms them into SQL statements that are in turn executed on the SAP HANA database. This is very similar to what is known as log shipping between two database instances.

This is very fast and consumes little processing power due to its closeness to the database system

However, this mode of operation makes this replication method highly database dependant, and the source database system coverage is limited (Only certain versions of IBM DB2 on AIX, Linux, and HP-UX are supported with this replication method).

Replication with Sybase Replication Server only supports Unicode source databases.

To set up replication the tables must initially be copied from the source database to the SAP HANA database. During copying the source tables must not be altered, which usually is only achievable with a planned downtime of the system.This replication method is only recommended for SAP customers who have been invited to use it during the ramp up of SAP HANA 1.0.6Trigger-based replication with SAP LT ReplicationSAP Landscape Transformation (LT) Replication is also based on tracking database changes, but on a much higher level. The efficient initialization of data replication is based on database triggers and a delta logging concept. It allows real-time or scheduled data replication of the tables that you choose

SAP Landscape Transformation (LT) Replication allows for real-time data replication from SAP and non-SAP sources into HANA. It uses DBconnect because HANA is a database.

SAP LT Replication leverages proven System Landscape Optimization (SLO) technologies (such as Near Zero Downtime, Test Data Migration Server (TDMS), and SAP LT) and can handle both unicode and non-unicode source databases.

SLT Replication Server does not have to be a separate SAP system. However, high replication load would impact the performance of the base system. Hence recommended to use a separate Replication Server.

The SAP LT Replication Server is the ideal and recommended solution for all SAP HANA customers who need real-time (or scheduled) data replication sourcing from SAP ERP systems into SAP HANA.

7SLT installation options

8ETL-based replication with SAP BusinessObjects Data ServicesData Services is an ETL tool designed to support complex, batch-based ETL data.Data transfers are done in batch mode, which limits the real-time capabilities of this replication method.Data Services provides several kinds of data quality and data transformation functionality. Due to the rich feature set available, implementation time for the ETL-based replication is longer than for the other replication methods.The newer version of Data Services, 4.1, is a highly integrated tool that enables schema and table data definition management within Data Services.

You can connect Data Services through:SAP ERPRemote Function Call (RFC) connectionABAP workflowOperational Data Provider ODP DataSourceSAP NetWeaver BW - Open Hub methodNon-SAP systems - connection is made directly to the database via the appropriate driver

An RFC connection is a relatively quick way to connect and extract data from the ERP system. Tables with a smaller data connection might be better options in terms of performance and usability. The RFC connection has a limit of 512 bytes per record

For tables that have large amounts of data, or when it is necessary to load multiple tables, SAP has developed ABAP workflow. ABAP workflow enables these bulk file-based extractions

The ODP method enables the reuse of existing BW extractors. This method enables the extraction of complex data in highly denormalized forms of operational data, such as billing document details. You can enable ODP for custom extractors as well as for specific standard extractors.9SAP HANA Direct extract connection (DXC)DXC is an Extractor based replication technology for data replication from existing SAP Data Source extractors into SAP HANADXC utilizes the DataSource Extractors that exist in SAP to load data directly from the BW InfoPackage into HANA. It works with generic DataSources and with any SAP-delivered DataSource with customer extensions.Unlike SLT, DXC provides the batch load option with minimal transformation capabilities. However, you can use custom user exits for additional transformations.DXC does not necessitate additional licensing requirements; therefore, it is a more cost-effective method than Data Services or SLTDXC uses the embedded BW system (which runs within the SAP NetWeaver applications) to enable extraction. Embedded BW is used as a data transference tool rather than as a data storage or transformation engine.DXC is used to create InfoPackages and process chains, as well as to monitor and schedule loads through embedded BWXSEngine, HANAs built-in Web service engine, enables DXC for HANA. Therefore, the connection between the source system and HANA is made as an HTTP-type connection. Once the extractor is enabled for DXC, HANA-controlled tables aregenerated to handle the loads. These tables are called In-Memory DataStore Objects (IMDSOs). It enables delta management and sequencing.10DXC Architecture

11DXC Architecture

12Comparison of the Replication Methods

13Comparison of Sybase Replication Server and SLT System Sybase Replication Server SLT SystemIt provides the fastest replication from the source database to SAP HANA

It is limited to unicode-encoded source databases, it does not support all table types used in SAP applications

it provides no transformation functionality, and the source database system support is very limited.

It provides real-time replication while supporting a wide range of source database systems.

It can handle both unicode and non-unicode databases, and makes use of proven data migration technology.

It offers a broader support of source systems, while providing almost similar real-time capabilities

It is the recommended choice for replication from SAP source systemsComparison of DXC and Data Services

Q & A