datameer6 for prospects - june 2016_v2

34
© 2016 Datameer, Inc. All rights reserved. John Morrell, Senior Director of Product Marketing Sean Anderson, Senior Product Marketing Manager Datameer 6 - The Modern BI Platform for Your Big Data Journey © 2016 Datameer, Inc. All rights reserved.

Upload: datameer

Post on 18-Jan-2017

281 views

Category:

Software


0 download

TRANSCRIPT

Page 1: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

John Morrell, Senior Director of Product MarketingSean Anderson, Senior Product Marketing Manager

Datameer 6 - The Modern BI Platform for Your Big Data Journey

© 2016 Datameer, Inc. All rights reserved.

Page 2: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

About the Speakers

John MorrellSenior DirectorProduct Marketing

Sean AndersonSenior Product Marketing Manager

Page 3: Datameer6 for prospects - june 2016_v2

© Cloudera, Inc. All rights reserved. 3

Spark will replace MapReduceTo become the standard execution engine for Hadoop

Page 4: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Data Processing

Data may require unique processing characteristics▪ Batch▪ Streaming▪ Real-time

Hadoop arose to address one and now the ecosystem is answering the rest.▪ “We’re doubling down on Spark. We invested earliest,

and we’ve invested most, in making Hadoop enterprise-grade” Doug Cutting

Data ProcessingLeverage the right processing for your job

Page 5: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Powerful Data Processing - The Most Apache Spark Experience

5

STRUCTUREDSqoop

UNSTRUCTUREDKafka, Flume

PROCESS, ANALYZE, SERVE

UNIFIED SERVICES

RESOURCE MANAGEMENTYARN

SECURITYSentry, RecordService

FILESYSTEMHDFS

RELATIONALKudu

NoSQLHBase

STORE

INTEGRATE

BATCHSpark, Hive,

Pig MapReduce

STREAMSpark

SQLImpala

SEARCHSolr

SDKKite

Spark: In-memory data processing for developers and data scientists• Easy development• Flexible, extensible API• Fast batch and stream processing

Cloudera: Most experience with Spark on Hadoop for instant success • First to ship and support• Most Spark users trained• Most customers running Spark• Most engineering resources (committers, contributors, support)• Only vendor focused on enterprise Spark

Page 6: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Apache SparkCloudera was the first Hadoop vendor to ship and support Spark

• Spark is a fully integrated part of Cloudera’s platform• Shared data, metadata, resource management, administration, security,

and governance• Complements specialized analytic tools for comprehensive big data platform

• Cloudera is the first Hadoop vendor to offer Spark training• Trained more customers than any other vendor• Most popular training course

• Cloudera has 5x the engineering resources of the next competitor• Most committers on staff and most changes contributed• Well-trained staff across the globe with expertise implementing a broad range

of Spark use cases

Page 7: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Cloudera’s Engineering Commitment to Spark

7

Page 8: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

The Spark Ecosystem & Hadoop

8

STRUCTUREDSqoop

UNSTRUCTUREDKafka, Flume

UNIFIED SERVICES

RESOURCE MANAGEMENTYARN

SECURITYSentry, RecordService

FILESYSTEMHDFS

RELATIONALKudu

NoSQLHBase

STORE

INTEGRATE

SQLImpala

SEARCHSolr

SDKKite

BATCH & STREAMSpark

Spark Streaming Spark SQL DataFrames MLlib …

Page 9: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Uniting Spark and Hadoop - The One Platform Initiative Investment Areas

9

ManagementLeverage Hadoop-nativeresource management.

SecurityFull support for Hadoop security

and beyond.

ScaleEnable 10k-node clusters.

StreamingSupport for 80% of common stream

processing workloads.

Page 10: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Community Initiative: Spark Supersedes MapReduce

10

Page 11: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Key Cloudera Contributions

11

• Spark-on-YARN integration• Dynamic Resource

Allocation• Kafka Integration• HBase Integration• Fixed operational issues at

scale

Integration with Hadoop Ecosystem Production-Ready Features Ongoing Initiatives

• Security• Kerberos Integration• HDFS Sync (Sentry)

• Governance• Cloudera Navigator integration

(audit & lineage)• Monitoring/

Troubleshooting• Improved debugging

• Zero Data Loss• Spark Streaming Resilience

• Standard Execution Engine• Hive on Spark • Pig on Spark• Crunch on Spark• Solr indexing on Spark

Page 12: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Cloudera Customers

12

• More customers running Spark than all other vendors combined• Over 170 customers• Spark clusters as large as 800 nodes• Diverse range of use cases across multiple industries

• Search personalization• Genomics research• Insurance modeling• Advertising optimization• Predictive modeling of disease conditions

Page 13: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Cloudera Enterprise, A New Way Forward

13

Page 14: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Key Cloudera Contributions

14

Download or Deploy in the Cloud

Signup for Training Contact us or a Partner to Start a POC

Getting Started is Easy

Page 15: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Datameer 6:The Modern BI Platform for Your Big Data Journey

Page 16: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Big Data Journey Challenges

Meeting Demand

(Productivity)

Putting Your

Insights to Work

Answering New

Questions

Using More of Your Data

Skill Gaps

Page 17: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Deep New-Age Questions

• What journey do customers take to purchase products?• What actions do customers take before they churn?• What attributes do customers with similar buying behavior have in

common?• Why do certain assets have a large impact on our overall risk? • What series of events occur before equipment fails?• Where are my network bottlenecks and how does this impact service?

Page 18: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

More Data Sources

Business Data

Digital Interactions

Machine Data

Marketing Call Center WebSocial Media

Devices IT Sensors Security Network

CRM Sales Financial

Page 19: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Datameer: Fastest Time to Insight

Time

Com

plex

ity

4 weeks 8 weeks 12 weeks 18 months

$$$

$$

$

Enterprise Data WarehouseETL Data Warehouse BI

UseCase#2

UseCase#3

UseCase#4

UseCase#5

Datameer Use Case #1

Use Case #1

UseCase#6

UseCase#7

UseCase#8

UseCase#9

6 weeks 10 weeks

Integrate Analyze Visualize

Use Case #2 Use Case #3

Custom Big DataFlume Hive Pig Sqoop Raw Data Hadoop

NO CODIN

G

Page 20: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

End-to-end Modern BI Platform

Integrate

• 70+ Connectors• Wizard-led• Unstructured data• High Performance

Prepare/Analyze

• 270+ Functions• Instant Profiling• Familiar Spreadsheet

UI• Advanced analytics• Smart Data

Discovery

Visualize

• 30+ Widgets• Infographics• HTML 5

Operationalize

• Security• Governance• Process integration

Page 21: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Agile Self-Service Analytics without Chaos

Spreadsheet Collaborative Governance & ControlDrag-n-Drop

Page 22: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Advanced Analytics & Smart Data Discovery

Clustering Decision Trees Dependencies Recommendations

Time Series Analytics Graph & Path Analytics Text Analytics

Page 23: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Augment Existing Analytics

Visualization & Exploration

Traditional BI

Page 24: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Enterprise Ready

Integrate, Don’t Replace

Intelligent Execution Framework

Flexible Deployment Options

Page 25: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

What’s New in Datameer 6?

Make Big Data Simple for Everyone

Speed-to-InsightEase-of-UseRe-imagined User Experience & Workflow Spark

Enable Citizen Data Scientists Abstract Complexity

Page 26: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Smart Execution with Spark

Page 27: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Differences in Spark Implementations

BI on Spark Spark Cluster

Embedded Spark

Smart Execution

Spark• Limited to

structured view of data

• Like using Hive or Impala in Hadoop

• Fully programmatic approach

• Need specialized skills $$

• May force you to code at points

• No future-proof story

• Limited execution frameworks

• Eliminates technical complexities

• Don’t need specialized skills

• Optimizes execution/performance

Page 28: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

New Backend Benefits

Future-proofFastest

processing, every time

Concentrate on analytics, not

backend

Abstract Complexity

Page 29: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Why does UI and Workflow Matter?“That is the way to learn the most, that when you are doing something with such enjoyment that you don’t notice the time passes.”

Page 30: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

The Evolution of Analytic Workflow

Integrate Transform Analyze Visualize

1st Generation:Multiple Steps

2nd Generation:Self-Service

3rd Generation:Iterative/Fluid

Page 31: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Datameer 6 User Experience

Parallel WorkflowImmediate insight into downstream

effects

Fresh, ModernUI/UX

Significant Time Savings Increased Productivity

Page 32: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Demonstration

Page 33: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Learn Morehttp://www.datameer.com/product/datameer-6/

Page 34: Datameer6 for prospects - june 2016_v2

© 2016 Datameer, Inc. All rights reserved.

Thank You!