efficient approaches to high-scale apache hadoop processing cloud computing west - november 2012...

Post on 12-Jan-2016

218 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Efficient Approaches to High-Scale Apache Hadoop ProcessingCloud Computing West - November 2012

11/9/2012

Joey Jablonski

Practice Director, Analytic Services

Analytics | Looking for Actionable Data

Billions of Data

Points to Consider

Actionable Results• Consumer purchasing trends

• Product perception

• Drug Discovery

• Genomics

• Surveillance

• Financial Analysis

2

What our users want

• Delivering…– Content that is meaningful.

– Content that is timely.

– Content that evolves.

3

Customer Centric Decision Making

Improved Content, Directed at Consumers.

4

Data Info Insight Results

Value is Added Decisions are Made

Enabling Adoption

The Cloud

Em

pow

ered

Use

rs

Aw

are

Use

rs

Ena

bled

Use

rs5

6

DDN | The Complete Big Data Platform

Process

Ingest Distribute

Store

► Unleashing Data Access to Accelerate Insight► Minimizing Infrastructure, Management and Data Center TCO

Operations Lifecycle

Deploy

Manage

MonitorRespond

Upgrade

Software Platform Hardware Platform7

1 2 3 4 5 6 7 80

500

1000

1500

2000

2500

3000

3500

4000

4500

An Appliance-based approach to Apache Hadoop

8

Shared, Big-Data Storage with High Performance Networking Makes Hadoop Clusters More Efficient!

• 100% Storage Management Offload• End-End InfiniBand Networking with

RDMA Acceleration• Real-Time Data Delivery to Provide

MapReduce Process Consistency• Smaller Compute, Compact Storage

to Minimize Data Center ImpactReduce Compute Cluster Size by 40%

Reduce Disk Population by 60%Reduce Data Center Footprint by 75%Increase Responsiveness by 100%

DataDirect Networks, Information in Motion, Silicon Storage Appliance, S2A, Storage Fusion Architecture, SFA, Storage Fusion Fabric, Web Object Scaler, WOS, EXAScaler, GRIDScaler, xSTREAMScaler, NAS Scaler, ReAct, ObjectAssure, In-Storage Processing and SATAssure are all trademarks of DataDirect Networks. Any unauthorized use is prohibited.

Q & A

9

Who am I?

• Practice Director, Analytic Services at DataDirect Networks, Inc.

• 3+ years with Hadoop, 12+ with HPC• Contact Details

– @jrjablo– jjablonski@ddn.com/jrjablo@gmail.com– www.linkedin.com/in/joeyjablonski

10

top related