architecture modernization sessions amsterdam 12 oct final

18
1 © Cloudera, Inc. All rights reserved. Architecture Modernization Frank Vullers Business Value Strategist EMEA

Upload: frank-vullers

Post on 13-Apr-2017

37 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Architecture Modernization Sessions Amsterdam 12 oct final

1© Cloudera, Inc. All rights reserved.

Architecture ModernizationFrank VullersBusiness Value Strategist EMEA

Page 2: Architecture Modernization Sessions Amsterdam 12 oct final

2© Cloudera, Inc. All rights reserved.

Page 3: Architecture Modernization Sessions Amsterdam 12 oct final

3© Cloudera, Inc. All rights reserved.

Our relationship with data is changingData is now a strategic asset, how you use it, your key differentiator

Page 4: Architecture Modernization Sessions Amsterdam 12 oct final

4© Cloudera, Inc. All rights reserved.

Evolution in use of data

Traditional BI Big Data Analytics

Fast Data Analytics

More(Different) Data

(near) Real time

Page 5: Architecture Modernization Sessions Amsterdam 12 oct final

5© Cloudera, Inc. All rights reserved.

Traditional BI

Business determine questions to ask

IT structure data to answer the questions

Describe outcomes

What’s Next?

(Outcome Driven)

“Capture only what’s needed”

Page 6: Architecture Modernization Sessions Amsterdam 12 oct final

6© Cloudera, Inc. All rights reserved.

Big Data Analytics

Business Explores Data for Questions Worth Answering

IT Delivers a Platform for Storing, Refining, and Analyzing All Data Sources

Explain Causes

Decisions/Action Plans

“Capture in case it’s needed”

(Process Driven)

Scalable Machine LearningTest, train and run on the same

environment

Page 7: Architecture Modernization Sessions Amsterdam 12 oct final

7© Cloudera, Inc. All rights reserved.

Fast Data Analytics

Real Time event requires Analysis in (near) real time

React in (near) real time with alert or offer

“Analyse and React fast within the time window”

(near) Real time

Recommendation Engine• Next Best Offer• Content and/or Services

Recommendation

Event Detection• Fraud/Risk Detection• Spam Filter• Marketing Alerts

Model scoring• Embedded Analytics• Analytic Aggregates• Reports

Real Time Events

Page 8: Architecture Modernization Sessions Amsterdam 12 oct final

8© Cloudera, Inc. All rights reserved.

Architecture view

Page 9: Architecture Modernization Sessions Amsterdam 12 oct final

9© Cloudera, Inc. All rights reserved.

Schema on Read is the Change Agent

©2014 Cloudera, Inc. All rights reserved.

Schema on Write• Determine Requirements• Design Schema• Collect & Transform Data• Validate Design

Schema on Read• Explore • Transform• Analyze• Iterate

Image source: “Business Process Analytics” by M. Zur Muhlen, Robert Shapiro, in Handbook ofBusiness Process Management 2, Springer Berlin Heidelberg, pp 137-157, 2010.

When storing data in Hadoop it is not necessary to declare its structure or association with any particular application

Page 10: Architecture Modernization Sessions Amsterdam 12 oct final

10© Cloudera, Inc. All rights reserved.

The logical architecture hasn’t changed *

*Ralph Kimball: The Future of Data Warehousing: ETL Will Never be the Same

Page 11: Architecture Modernization Sessions Amsterdam 12 oct final

11© Cloudera, Inc. All rights reserved.

The logical architecture hasn’t changed *

*Ralph Kimball: The Future of Data Warehousing: ETL Will Never be the Same

Page 12: Architecture Modernization Sessions Amsterdam 12 oct final

12© Cloudera, Inc. All rights reserved.

Modernizing (traditional) Architecture

EDW

ERP CRM …

BI

Traditional BI

Page 13: Architecture Modernization Sessions Amsterdam 12 oct final

13© Cloudera, Inc. All rights reserved.

Modernizing (traditional) Architecture

EDW

ERP CRM …

BI

Big Data Analytics

Page 14: Architecture Modernization Sessions Amsterdam 12 oct final

14© Cloudera, Inc. All rights reserved.

Modernizing (traditional) Architecture

EDW

ERP CRM …

BI

Fast Data Analytics

Page 15: Architecture Modernization Sessions Amsterdam 12 oct final

15© Cloudera, Inc. All rights reserved.

Summary Modernizing Architecture

EDW

ERP CRM …

BI

Trad

ition

al B

I Fa

st D

ata

Anal

ytics

Big

Data

An

alyti

cs

Page 16: Architecture Modernization Sessions Amsterdam 12 oct final

16© Cloudera, Inc. All rights reserved.

Logical Information Architecture (1/2)

Landing Zone / Staging Layer

Discovery Zone / Enriched Layer

Integrated Zone / Atomic Layer

Optimized Zone / Mart Layer

• Data from source • Separate directories• original format and

structure

• Still separate directories • Data sets “enriched”• Available for Discovery

and Exploration

• Data joined together• One atomic data model• Not optimized for

speed

• Organized to provide optimized performance

• Organized by use case• Deformalized, uses

optimized formats

Page 17: Architecture Modernization Sessions Amsterdam 12 oct final

17© Cloudera, Inc. All rights reserved.

Logical Information Architecture

Landing Zone Discovery Zone Integrated Zone Optimized Zone

Man

aged

User

Staging Layer Enriched Layer Atomic Layer Mart Layer

Raw Trusted

Z0-M

Z0-U

Z1-M

Z1-U

Z2-M

Z2-U

Z3-M

Ingest Validation &Verification Enrichment Transformation Routing

Logical Information Architecture (2/2)

Page 18: Architecture Modernization Sessions Amsterdam 12 oct final

18© Cloudera, Inc. All rights reserved.

Thank You!@[email protected]+31 6 83 94 6122