real-time data analysis with hadoop and sap hana · real-time data analysis with hadoop and sap...

38
Real-time Data Analysis with Hadoop and SAP HANA TUT18851 dr. Tamas Szirtes Director Innovation & Technology SOA People Nederland [email protected]

Upload: dodung

Post on 29-Aug-2018

243 views

Category:

Documents


3 download

TRANSCRIPT

Real-time Data Analysis with Hadoop and SAP HANATUT18851

dr. Tamas Szirtes

Director Innovation & Technology

SOA People Nederland

[email protected]

2

Leading European consulting and services group

specialized in SAP solutions

412

19

38 4048 50 53

100

2007 2008 2009 2010 2011 2012 2013 2014 2015

+400 customersIn all kinds of industries and companies of

all sizes

+500experts

+8 years of success

+8 regionaloffices in Europe

SOA People in numbers

3

Organi-zation

CUSTOMERS’ CHALLENGES

STRATEGY

Business Processes

Tools

MobileAnalyticsApplications Big data & real time

CloudNetworks

Based on disruptive technology to gain higher competitiveness

Being your IT partner to develop your Business strategy in full

alignment with SAP leading solutions

SOA People’s positioning

4

Agenda

• Introduction

• Technologies covered

– Hadoop

– SAP HANA

• Typical solution architectures

• SUSE enabling technology

• Q&A

Introduction

6

The Big Data Opportunity

7

Transactions

“Which types of data do you anticipate using in the next year?”

Source: Paradigm4 data scientist survey 2014www.paradigm4.com/wp-content/uploads/2014/06/P4-data-scientist-survey-FINAL.pdf

8

SAP Big Data Applications

9

SAP Big Data Analytics

10

SAP Big Data IT Landscape Challenge

Technologies covered: Hadoop

12

What is Hadoop

13

Spark

14

CDH

Technologies covered: SAP HANA

16

What is SAP HANA

17

In Memory Technology

18

SAP HANA Live

19

SAP HANA Two-Tier Architecture

20

Predictive Analytics

21

SAP HANA Evolution

Typical solution architectures

23

Big Data Architectures

24

What does Hadoop bring to HANA?

Batch processingWhere fast response times are less critical than reliability and scalability

Complex information processing at scaleEnable heavily recursive algorithms, machine learning, & queries that cannot be easily expressed in SQL

Massive store for low value dataData stays available, though access is much slower

Post-hoc analysis Mine raw data that is either schema-less or where schema changes over time

Cost efficient data storage and processing for large volumes of structured, semi-structured, and unstructured data such as web logs, machine data, text data, call data records (CDRs), audio, video data

25

Scenarios for implementations

26

SAP Big Data Acquisition

27

Data Aging

+ Data Tiering

28

Intel + Intenzz

http://www.intel.com/content/www/us/en/big-data/intenzz-xeon-e7-4880-big-data-brief.html

29

Hadoop integration

30

Hadoop Integration Roadmap

31

HANA Vora

SAP HANA Vora is an in-memory query engine which leverages and extends the Apache Spark execution framework to provide enriched interactive analytics on Hadoop. Drill Downs on HDFS

Mashup API Enhancements

Compiled Queries

HANA-Spark Adapter

Unified Landscape

Open Programming

Make PrecisionDecisions

DemocratizeData Access

SimplifyBig DataOwnership

Any Hadoop Clusters

SUSE enabling technology

33

SUSE for SAP

34

SUSE for SAP

35

SUSE for SAP

• Optimized for fast deployment.• Seamless integration.• Optimized for business continuity.• Integrated 24x7 support.• Support for large memory intensive workloads.• Extended service pack overlap support.• Flexible cloud options.• Optimized for SAP HANA.• Optimized for performance.

36

SUSE Linux Enterprise + SAP HANA

37

SUSE Linux Enterprise Server for SAP

• Automate HANA System Replication• Patch the OS kernel while it’s running• Undo system changes

38

Q&A