new analytical architectures for big data

30
New Analytical Architectures March 21, 2013 Casey Kiernan • [email protected] Blog • www.the-data-platform.com Why Classic Data Warehousing Approaches Miss the Mark with Big Data

Upload: casey-kiernan

Post on 07-Dec-2014

584 views

Category:

Technology


1 download

DESCRIPTION

Why Classic Data Warehousing Architectures miss the mark with the New Analytics

TRANSCRIPT

Page 1: New Analytical Architectures for Big Data

New Analytical Architectures

March 21, 2013

Casey Kiernan • [email protected]

Blog • www.the-data-platform.com

Why Classic Data Warehousing Approaches Miss the Mark with Big Data

Page 2: New Analytical Architectures for Big Data

Doug Cutting“Hadoop is the kernel of a new Distributed Data OS”

“The Future is Data”

Page 3: New Analytical Architectures for Big Data

Transactional

Communities

Personal

Data has Changed

> Trailing Indicators

> Reach/Influence

> Interactive

> Analytics has Changed

Can the Data Warehouse Architecture adapt?

Page 4: New Analytical Architectures for Big Data

The World as I See it

“Data” is the Platform

Page 5: New Analytical Architectures for Big Data

New DataClutch Analytics

Wink Eller

Page 6: New Analytical Architectures for Big Data

My Mountain Bike

Guidance

PerformanceRate of ClimbCalories BurnedMiles ObtainedTotal ClimbedElapsed Time

Current, Average, Max Values

Data CollectionSpeed / Trip Miles

Data CollectionCadence / RPM

Data Collection Heart Rate

Data Collection AltitudeTemperatureTime

Data Architecture - on a Local Wireless Network (ANT+ Protocol)

as a Data Platform

Page 7: New Analytical Architectures for Big Data

“Personal” Ride Analytics

…is this a Data Warehouse?

Page 8: New Analytical Architectures for Big Data

Beha

vior

s

Content

Progression of B

ehaviors

New Data Behaviors (individual actions) > Content > Time

Time Varia

nce

Page 9: New Analytical Architectures for Big Data

9

Gui

danc

e

Data

Meaningful

Massive

New Data More is Better…

Page 10: New Analytical Architectures for Big Data

BUSINESS INTELLIGENCEOLAP / DATA WAREHOUSE

OLTP / TRANSACTIONSDATA.

“Business” Analytics - Classic “DW”

Answers the question: What are our most profitable Products?

Page 11: New Analytical Architectures for Big Data

11

What will Happen?What did Happen?

StrategicTactical TrendingOperational Reporting

Months Weeks Weeks Months Years

Classic “Business” AnalyticsGood for Reporting, Forecasting

Descriptive/Trending Analytics

Page 12: New Analytical Architectures for Big Data

New“Personal” Analytics

Answers the question: Show me a good movie to watch!

DATA.

SELF-SERVICEGUIDANCE

BEHAVIOURS

Page 13: New Analytical Architectures for Big Data

StrategicTactical TrendingOperational Reporting

13

What will Happen?What did Happen?

Months Weeks Weeks Months Years

What is Happening RIGHT NOW!

“Personal” Analytics“Right Now” is a very important time-frame!

Predictive/Prescriptive Analytics

Page 14: New Analytical Architectures for Big Data

14

Page 15: New Analytical Architectures for Big Data

15

Page 16: New Analytical Architectures for Big Data

16

Ordering App

Data WarehouseOLTP to OLAP

Mapping

OLAP / ReportsFacts/DimensionsFinancial App

Master Data

BusinessAnalyst

What are our most Profitable Products?

Stag

ing

“Business” Analytical ArchitectureClassic “DW” Data Flow - Uni-Directional, Latent,…

Business Metrics, KPI, YTD ReportingFacts &

Dimensions

Page 17: New Analytical Architectures for Big Data

17

Application / UX

AnalyticsData

“Personal” Analytical Architecture

DataAnalysts

Analytical CapabilitiesScoring/Ranking, Recommendations,Natural Language Processing, Relevancy, Classification, Optimization, Collaborative Filtering,Personalization,Digital Attribution,…

“New” Data Flow - Iterative, Specialized, Extensible, plug & play Analytics, near real-time [Some components are open-source]

What movie should I watch tonight?

Page 18: New Analytical Architectures for Big Data

18

Published Analytics “Read” Performance

App Persistence“State” PersistencePersistence/Analytics

Mass Data StorageBehaviors / “Write” Performance

PersonalizedRecommendations

Personalization,Preferences, State

End-User ExperienceBrowser, Tablet,

Mobile,…Self-Service Application

“Personal Analytics” Data Architecture

Analytics EnginesPluggable

Data Scientists

“New” Data Flow – Detailed View of Components

Social SignalsRSS/Facebook/…

Page 19: New Analytical Architectures for Big Data

SALLY LIKES TACOS

HOW DO WE MODEL THIS DATA?

Let’s get personal…

Page 20: New Analytical Architectures for Big Data

Classic “DW” Data Model

Page 21: New Analytical Architectures for Big Data

OBJECT PREDICATE (Score) SUBJECT

SALLY LIKES (143) TACOS

MARY LIKES (200) TACOS

THE_TACO_SHOP MENU_ITEM TACOS

SALLY LIKES (125) THE_TACO_SHOP

SALLY CITY VENICE BEACH

THE_TACO_SHOP CITY VENICE BEACH

SALLY FRIEND (187) MARY

“Triples” - Directed (Weighted) Acyclic GraphModeling Social Data

Reach and Influence

Collaborative Filtering

Page 22: New Analytical Architectures for Big Data

Analyzing Relationships Reach and Influence

Page 23: New Analytical Architectures for Big Data

How important is Social?

Install ghostery.comShows you who is actively watching you surf the web! Lots of people!!!

Page 24: New Analytical Architectures for Big Data

Signals – The Core of New Data

SocialPersonalContent

Time

Mixture of Proprietary and Public Data

Page 25: New Analytical Architectures for Big Data

26

Published AnalyticsHbase

App PersistenceCassandra, Riak,…Persistence/Analytics

Data-Center or Cloud

Mass Data StorageHadoop

PersonalizedRecommendations

Personalization,Preferences, State

End-User ExperienceBrowser, Tablet,

Mobile,…Self-Service Application

Specialization of Data Technologies

AnalyticsR, Mahout, Pig

The New “Analytical Application” Architecture“New” Data Flow – Specialized Technology Choices

Page 26: New Analytical Architectures for Big Data

p. 27

Published Analytics

HBase

PersistenceRiak

Mass Data StorageBehaviors / “Write” Performance

Hadoop / AWS

Self-Service Application A

Analytics EnginePluggable

Data Scientists

Analytics EnginePluggableAnalytics Engine

Pluggable

Published Analytics

MySQL

PersistenceCassandra

Self-Service Application B

Servicing Multiple Analytical SystemsUsing Shared Analytical Mas- Storage

Page 27: New Analytical Architectures for Big Data

Integrating the Architectures

28

Data WarehouseOLTP to OLAP Mapping

OLAP / Reports

BusinessAnalystSt

agin

g

AppOnly Financial Events ($$$) cross the threshold(and are recorded into) the Data Warehouse

App

App

“Local” Events stay Local (they are analyzed locally)

“Personal” Analytics Stack + Classic “DW” Stack

Not all DATA Belongs in the Data Warehouse!

Page 28: New Analytical Architectures for Big Data

Classic DW New Analytics

Scope Enterprise Application

Analytics Trailing: OLAP Predictive: Machine LearningSentiment Analysis, Recommendations, Personalization, Natural Language Processing, Classification, Clustering, Optimization, Collaborative Filtering,Digital Attribution,…

Actionable? Loosely Coupled Tightly Coupled Analytics Embedded in Application

Data Structures Facts/Dimensions(Requires a DW)

Semantic Data, Graph / Triples, Observations, Direct Signals

Knowledge Expert Business Analyst Data Scientist

Technology Stack Vendor Driven ($$$) Open-Source

Architecture Scale-Up Scale-Out (or in the Cloud)

Classic DW Vs. the New AnalyticsThe Shift from “Business” Analytics to “Personal” Analytics

Page 29: New Analytical Architectures for Big Data

New Signals + New Analytics = New Scenarios

Data

Signals

Social

Location

Personal

Behaviors

Transactions

Content

Time

New Analytics

Recommendations,Natural Language

Processing, Relevancy,

Classification, Optimization, Collaborative

Filtering,Digital

Attribution,…

NewScenariosCustomer

Engagement, Customer Loyalty / Attrition / Retention, Fraud, Risk Analysis,

Intent, Customer Personalization

Page 30: New Analytical Architectures for Big Data

Thank You!

[email protected]: www.the-data-platform.com