why hadoop matters? - sas · cleanse and load data into hadoop and in-memory without writing code....

18
Copyright © 2014, SAS Institute Inc. All rights reserved. WHY HADOOP MATTERS? FEBRUARY 2016

Upload: others

Post on 24-May-2020

10 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

WHY HADOOP MATTERS?

FEBRUARY 2016

Page 2: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

AN ERA OF

ABUNDANCEWHERE WE ARE NOW

2005 2007 2009 2011 2013

ANALYTICSBIG DATA HADOOP

Lots of data Processing

Power

Accurate

Decisions

Page 3: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

SAS DATA

MANAGEMENTAGENDA

Market Drivers and Customers Challenges

Solution Overview

Benefits and Use Cases

Case Studies

Page 4: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2012, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

MARKET DRİVERS AND CUSTOMER CHALLENGES

Page 5: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

MARKET DRİVERS

AND CHALLENGESBİG DATA CHALLENGES

Source: Gartner (Sep 2014), Big Data Investment Grows but Deployments Remain Scarce in 2014 By Nick Heudecker, Lisa Kart

Page 6: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

EARLY USE CASES…SHOW ME THE MONEY

Dynamic Pricing

Page 7: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

Hadoop will soon become a replacement complement to:

Business Intelligence;

Data Warehousing;

Data Integration;

Analytics.

SOURCE: Integrating Hadoop with BI and DW - TDWI Best Practices Report

WHY HADOOP?

HADOOP IN PRODUCTION:

10%

28%

13%10%

12%

27%

YES

< 12

MONTHS

< 24

MONTHS

< 36

MONTHS

3+

YEARS

NEVER

Page 8: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

TWO STARTİNG

POİNTS

NOT MUTUALLY EXCLUSİVE… BUT OFTEN NOT SEEN TOGETHER!

Hadoop as a Data Platform(standalone or as part of a broader ecosystem)

Hadoop as a component of the next

generation of Business Analytics

.. to support innovative use cases.. to support an IT Transformation

TEXT

MANAGE

DATA

EX

PL

OR

E

DA

TA

DEVELOP

MODELS

DE

PL

OY

&

MO

NIT

OR

Page 9: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

SAS AND HADOOP OUR GOAL İS CLEAR

• Analytic workload of choice in Hadoop

• Data Integration toolset of choice for Hadoop

• Data exploration and reporting product of choice for

Hadoop

Page 10: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

SAS FOR HADOOP ENABLİNG THE DATA TO DECİSİON LİFECYCLE

Access & Manage DataAdvanced data management

capabilities (ELT, ETL, DQ,

virtualization) enabled for Hadoop

Interactively Explore & VisualizeQuickly Visualize Data in Hadoop, Discover New

Patterns, Publish Reports Via Web Reports, Mobile

Devices, MS Office Apps

Analyze & Model Uncover Patterns and trends in Hadoop data.

Interactive and visual environment for analytics.

Apply Domain specific high-performance analytics

Deploy & IntegrateAutomatically deploy and score analytic

models in the parallel environment.

Manage & analyze real time data

Page 11: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

THE KEY

CHALLENGE

CLOSİNG THE GAPS İN THE DATA TO DECİSİON

LİFECYCLE

BUSINESS

MANAGER

TIME TO DECISION

IT SYSTEMS /

MANAGEMENT

DATA SCIENTIST

/ STATISTICIAN

BUSINESS

ANALYST

VALUE CAPTURED

Page 12: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

SAS DATA

MANAGEMENTANALYSTS TAKE

Recommendation

“Use self-service interactive data preparation tools to enhance analyst productivity.” and

“improve the quality of data”

– Gartner, “Data Preparation Is Not an Afterthought”

Page 13: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2012, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

SAS DATA LOADER FOR HADOOP

Page 14: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

SAS DATA LOADER

FOR HADOOPNEW OFFERİNG!

Self-service big

data preparation

for business users

Certified by Hortonworks and Cloudera

Page 15: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

SAS DATA LOADER

FOR HADOOPCAPABİLİTİES

Simple Point-and-Click UI

• Web-based and wizard-driven

• Stop and start directives, check status and view logs.

• Run, view or edit saved directives for reuse.

Secure Big Data Access

• Secure access to Kerberos-enabled Hadoop clusters.

• Copy relational databases and SAS datasets to and from Hadoop

Load, Join and Transform Data

• Query a table or join multiple tables without knowing SQL.

• Filter and summarize rows

• Transpose and group selected columns.

Profile and Cleanse Data

• Standardize, de-duplicate, match, parse and run other DQ functions.

• Profile data to determine uniqueness, incompleteness, patterns and more.

• Query, sort, or de-duplicate data

Lift into memory for Visualization and Analytics

• Load data to SAS®

LASR™ Analytic Server

• Run a SAS® program

Page 16: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

SAS DATA LOADER

FOR HADOOPBENEFİTS

• Can be used with minimal training by business users to profile, transform, cleanse and load data into Hadoop and in-memory without writing code.

Business Users: Self-Service Big Data Access and Preparation

• The intuitive wizard-driven user interface reduces the pain and costs with finding, retaining and training talent with the specialized skills required to manage big data on Hadoop. Data provisioning tasks are shared with business users.

IT: Lower TCO, Productivity and Reduced Training

• SAS code and data quality functions are run in-cluster for improved performance. This reduces data movement for improved governance and security of trusted data.

Data Scientists/SAS Coders: Performance and Governance

Page 17: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .

SAS DATA LOADER

FOR HADOOPFOR MORE INFORMATION

• Download the Free Trial of SAS Data Loader for

Hadoop at:

• http://sas.com/dataloader

• Learn more about SAS and Hadoop:

• http://sas.com/hadoop

• Big Data Matters Webinar Series:

www.SAS.com/bigdatamatters

• Follow us on Twitter: @sasdatamgmt

• Like us on Facebook: SAS Software

DOWNLOAD THE

FREE TRIAL!

Page 18: WHY HADOOP MATTERS? - SAS · cleanse and load data into Hadoop and in-memory without writing code. Business Users: Self-Service Big Data Access and Preparation •The intuitive wizard-driven

Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .sas.com

Q&A