turning your data lake into measurable business value
DESCRIPTION
7 ingredients to add value to HadoopTRANSCRIPT
Confidential © 2014 Actian Corporation1
Turning Your Big Data Lake Into
Measurable Business Value
Accelerating Big Data 2.0
John Santaferraro, Vice President, Marketing at ActianJim Walker, Director, Product Marketing at Hortonworks
May 20, 2014
Confidential © 2014 Actian Corporation2
Jim WalkerDirector of Product MarketingHortonworks@jaymce
John SantaferraroVP of Solutions & Product MarketingActian@santaferraro
Presenters
Confidential © 2014 Actian Corporation3
“In an on-demand world, consumers will judge brands by their ability to deliver heighted experiences –interactions, literally anywhere – that offer high levels of value and are radically customized and easy to access along the customer decision journey.”
McKinsey Quarterly, 2013
The Marketing Manifesto for the Digital World
Engage Customers
Create Conversations
Transform Brand Experiences
Confidential © 2014 Actian Corporation4
Big Data Creates Significant Opportunities for You
Personalized Experiences• Ad, offer, & content optimization
• Location-based content
• Pricing optimization by segment
Better Decision-Making• True marketing ROI
• Predictive modeling
• Holistic campaign planning
• Profitability by affinity groups
Deep Audience Analysis• Hyper-segmentation
• Full customer 360°understanding
• Buying signal analysis
• Customer lifetime value More Effective Spend• Omni-channel attribution
• Real-time ad & promo optimization
• Media mix optimization
• Brand monitoring
Confidential © 2014 Actian Corporation5
Big Data Creates $Billion Opportunities for You
Capture Share
Higher ReturnsHigher Profits
Risk
14%
Advanced marketing
mix analysis delivers
higher profits
$226B
Retail Purchases
Online
$200B
15-20% improvement in
return on overall
marketing spend 2017
Year when CMO spend
On IT surpasses CIO
Page 6 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Jim Walker
Director, Product Marketing – Hortonworks
Jim is a developer turned marketer with over twenty years experience building
products and developing emerging technologies. During his career, he has brought
multiple products to market in a variety of fields, including data loss prevention,
master data management and now big data. At Hortonworks, Jim is focused on
accelerating the development and adoption of Apache Hadoop.
Confidential © 2014 Actian Corporation7
Transformational ValueData Explosion
?Actian Analytics PlatformTM
Analyze ActConnect
Customer
Delight
Competitive
Advantage
World-Class Risk
Management
Disruptive New
Business Models
Turning Digital Data into Transformational Value
Discovery without limitations
Low latency at any scale
Reactive to predictive
Static to dynamic flow
Segment of 1
Best in Class Usage
Design-time & run-time optimization
Linear parallelism
Rich analytics DNA
Pipeline architecture
Affordable unlimited scale
Best in Class Capabilities
Confidential © 2014 Actian Corporation8
1. Visual Framework for connecting, blending, & enriching data, data science discovery, building and testing predictive models
7 Ingredients Added to Hadoop to Create Value
Connect Blend & Enrich Discover Build & Test Models
Confidential © 2014 Actian Corporation9
1. Visual Framework: connecting, blending, & enriching data, data science discovery, building and testing predictive models
2. 1500 KNIME Operators + R analytics running in parallel on HDFS + Hadoop = The Open Source Trifecta
7 Ingredients Added to Hadoop to Create Value
Gartner Magic
Quadrant for
Advanced Analytics
PlatformsSource: Gartner (February 2014)
Confidential © 2014 Actian Corporation10
Complete End-to-End Analytics on Hadoop
Source: 2013 Rexer Analytics Survey
Confidential © 2014 Actian Corporation11
1. Visual Framework: connecting, blending, & enriching data, data science discovery, building and testing predictive models
2. 1500 KNIME Operators + R analytics running in parallel on HDFS + Hadoop = The Open Source Trifecta
3. High-Performance, YARN-based data processing engine running on HDFS
7 Ingredients Added to Hadoop to Create Value
Confidential © 2014 Actian Corporation12
High Performance, Parallelized Processing on HDFS Without Any Programming
Actian Analytics Platform
Hadoop – Leader Node
Optimized, On-HDFS Processing
Query Pipelining
CPU Pipelining
Reuse and share all
components from
operators to
workflows
Optimize
Choose from five sets
of operators:
Connections
Transformation
Data Quality
Analytics
Data Science
Automatically detect
resources, plan
optimal utilization,
and parallelize all
workloads on Hadoop
Use dual pipeline
parallelism to
accelerate
performance 30X
Run fully optimized
processing directly on
the Hadoop node via
YARN
Take processing to
where the data lives,
runs natively on
Hortonworks
Visual Framework
Manage the entire
analytic process in a
visual framework with
no coding required.
≠ ☼ ≡ ∞ ∆ ∑ √ ≈ ∑ = ? # ~ ‰
Confidential © 2014 Actian Corporation13
1. Visual Framework: connecting, blending, & enriching data, data science discovery, building and testing predictive models
2. 1500 KNIME Operators + R analytics running in parallel on HDFS + Hadoop = The Open Source Trifecta
3. High-Performance, YARN-based data processing engine running on HDFS
4. High-Performance, vector processing engine as the pattern for SQL on Hadoop
7 Ingredients Added to Hadoop to Create Value
Confidential © 2014 Actian Corporation14
The Story of X100 - Vector
Peter Boncz
Performance
X100 Drives Stinger and Impala Aspirations!
Confidential © 2014 Actian Corporation15
1. Visual Framework: connecting, blending, & enriching data, data science discovery, building and testing predictive models
2. 1500 KNIME Operators + R analytics running in parallel on HDFS + Hadoop = The Open Source Trifecta
3. High-Performance, YARN-based data processing engine running on HDFS
4. High-Performance, vector processing engine as the pattern for SQL on Hadoop
5. Extreme-Performance, super-low latency, massively parallel analytics engine
7 Ingredients Added to Hadoop to Create Value
Confidential © 2014 Actian Corporation16
Libraries of Analytics
Ma
ss
ive
ly P
ara
lle
l
Inte
gra
tio
n
Hadoop
Sophisticated,
Low Latency
Analytics in
Database
Connections for Any Data
Actian Analytics PlatformTM
Enterprise Data
Machine Data
Social Data
Business
Processes
Users
Machines
Applications
Data Warehouse
Re
al-T
ime
An
aly
tic S
erv
ices
Visual Framework for Data and Analytic Workflows
SaaS Data
Actian Analytics Platform: Next Generation Big Data Analytics
Amazon
Redshift
High
Performance
Data Science
Natively on
Hadoop
Confidential © 2014 Actian Corporation17
1. Visual Framework: connecting, blending, & enriching data, data science discovery, building and testing predictive models
2. 1500 KNIME Operators + R analytics running in parallel on HDFS + Hadoop= The Open Source Trifecta
3. High-Performance, YARN-based data processing engine running on HDFS
4. High-Performance, vector processing engine as the pattern for SQL on Hadoop
5. Extreme-Performance, super-low latency, massively parallel analytics engine
6. Blueprints to accelerate analytics application development and value creation
7 Ingredients Added to Hadoop to Create Value
Confidential © 2014 Actian Corporation18
Big Data 2.0 Media Mix Modeling Blueprint
IMPACT
FORECAST
ANALYSIS
MARKETING
IMPACT
ANALYSIS
CRMdb
All Relevant
Account Info and
Demographics
CONNECT
BUILD
CUSTOMER
PROFILE
EDWdb
All Relevant Sales
Histories
ANALYZE ACT
MAXIMIZE
REVENUE
FROM
MARKETING
SENTIMENT
AND
CONTENT
ANALYSIS
AGGREGATE
SALES
DATA
Hadoop
Logs
Detailed ePOS
Receipts
SKU LEVEL
SALES DATA
BY GEO
JOIN
DERIVE
AGGREGATE
PREPARE
EDWdb
Marketing Vehicle
Details
CAPTURE
MARKETING
VEHICLES
MARKETING
MIX SALES
CONTRIBUTION
YEARLY CHANGE
REPORT
SALES
VOLUME,
EFFECTIVENES
S, EFFICIENCY
AND ROI
REPORT
NEW MEDIA
MIX
OPTIMIZATION
MINIMIZE
MARKETING
SPEND TO
REVENUE
RATIO
Hadoop
Text Files
Campaign Response
Notes
PREPARE
FOR TEXT
ANALYTICS
CUSTOMER
MATCH
WITH
CAMPAIGNS
VEHICLE
RESULTS
AT GEO,
STORE, SKU
AND
CUSTOMER
LEVEL
Confidential © 2014 Actian Corporation19
1. Visual Framework: connecting, blending, & enriching data, data science discovery, building and testing predictive models
2. 1500 KNIME Operators + R analytics running in parallel on HDFS + Hadoop= The Open Source Trifecta
3. High-Performance, YARN-based data processing engine running on HDFS
4. High-Performance, vector processing engine as the pattern for SQL on Hadoop
5. Extreme-Performance, super-low latency, massively parallel analytics engine
6. Blueprints to accelerate analytics application development and value creation
7. ??????
7 Ingredients Added to Hadoop to Create Value
Confidential © 2014 Actian Corporation20
Stay tuned for the announcement on June 3, 2014 at the Hortonworks Hadoop Summit in San Jose, California!
Join us for the webinar announcing “the seventh ingredient” on Tuesday, June 17, 2014
The Seventh Ingredient Adds Perfection
Confidential © 2014 Actian Corporation21
Libraries of Analytics
Ma
ss
ive
ly P
ara
lle
l
Inte
gra
tio
n
Hadoop
Extreme
Performance,
Low Latency
Analytics in
Database
Connections for Any Data
Actian Analytics PlatformTM
Enterprise Data
Machine Data
Social Data
Business
Processes
Users
Machines
Applications
Data Warehouse
Re
al-T
ime
An
aly
tic S
erv
ices
Visual Framework for Data and Analytic Workflows
SaaS Data
Questions and Answers
Amazon
Redshift
Confidential © 2014 Actian Corporation22
Jim WalkerDirector of Product [email protected]
John SantaferraroVP of Solutions & Product [email protected]
Thank You!