tut15981: deploying an elastic hadoop cluster - … · tut15981: deploying an elastic hadoop...

22
TUT15981: Deploying an elastic Hadoop cluster Alejandro Bonilla Sales Engineer [email protected]

Upload: doque

Post on 15-May-2018

234 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

TUT15981: Deploying an elastic Hadoop cluster

Alejandro BonillaSales Engineer

[email protected]

Page 2: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

2

Agenda

• Hadoop Overview

• SUSE OpenStack Cloud

• Manual Deployment

• Orchestration‒ Generic workload autoscaling

• Sahara‒ Dedicated for Hadoop Ecosystem

• QA

Page 3: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

HadoopWhat is it and how does it look like?

Page 4: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

4

OverviewHadoop

• Framework that allows for the distributed processing of large data sets across clusters of nodes

• The NameNode stores all metadata

• Datanodes carry the Data blocks and execute jobs locally.

• Store data that is relevant in bulk.

• Applications are adjusted to better mine unstructured data.

• Analyze marketing campaigns, fuel efficiency, weather, sequences and predictions.

Page 5: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

5

HadoopWhat can I do?

Leverage Data – Warm and Cold

• Attitudinal, Behavioral and Interactive (data)

For

• Account Intelligence

• Fraud Management

• Predictive Maintenance

• Customer Value, Sentiment, Social - Intelligence

• Responsive Manufacturing

• Risk Analysis

Page 6: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

6

Page 7: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

7

Page 8: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

SUSE OpenStack Cloud

Page 9: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

9

SUSE OpenStack Cloud

Enterprise OpenStack distribution that rapidly deploys and easily manages highly available, mixed hypervisor IaaS Clouds

• Increase business agility

• Economically scale IT capabilities

• Easily deliver future innovations

Page 10: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

10

OpenStack Distribution

Billing VM Mgmt Image Tool Portal App Monitor Sec & Perf

Cloud

Management

Orchestration(Heat)

Dashboard(Horizon)

Cloud APIs(OpenStack and

EC2)

Identity(Keystone)

Images(Glance)

Hypervisor

Compute(Nova)

Operating System

Physical Infrastructure: x86-64, Switches, Storage

OpenStack Juno

Object(Swift)

Network(Neutron)

Adapters

Telemetry(Ceilometer)

Block(Cinder)

SUSE Cloud Adds

RequiredServicesRabbitMQPostgresql

Ins

tall

Fra

me

wo

rk(C

row

ba

r, C

he

f, T

FT

P, D

NS

, D

HC

P)

SUSEManager

SUSEStudio

HypervisorXen, KVM

SUSE Linux Enterprise Server 11 SP3

SUSE Product

Rados

RBD

RadosGW

Ceph

Adapters

Highly Available Services

OverviewSUSE® OpenStack Cloud 5

Physical Infrastructure: x86-64, Switches, Storage

Billing Portal App Monitor Sec & Perf

Adapters Adapters VMware, Hyper-V

Partner Solutions

Page 11: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

11

OverviewWhy on OpenStack?

• Complex Event Processing, On Demand

• Advanced Resource Management, allocation.

• Flexibility, and ease on Deployment – Virtual or Physical.

• Easy for App Dev testing.

• KVM proves higher performance as it matures.

• After hours analytics of locally stored data / Resource re-utilization.

• Growing Community projects based on OpenStack.

Page 12: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

Manual Deployment Method

Page 13: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

13

Manual Deployment

• Requires a base image with Hadoop already pre-configured.

• Static Setup

• Appropriate for a mature Application Cycle stage.

• May require the Management framework “secret sauce” of Hadoop vendors like Cloudera, Hortonworks and MapR.

• Basic flexibility.

Page 14: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

Orchestration – Heat Method

Page 15: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

15

OrchestrationHeat

• Implements an orchestration engine to launch multiple composite cloud applications based on templates

• Native Resource Management - servers, floating ips, volumes, security groups, users

• Provides an autoscaling service that integrates with Ceilometer

• Requires a base image with the Hadoop components

• Actions can be triggered based on resource utilization, including auto scale-out

Page 16: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

The Sahara Way

Page 17: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

17

Elastic Data Processing facility - EDP

• Sahara’s Elastic Data Processing facility allows the execution of jobs on clusters created from Sahara. EDP supports:

• Hive, Pig, MapReduce, MapReduce.Streaming and Java job types on Hadoop clusters

• Spark jobs on Spark clusters

• Access and Storage of job binaries/output in Swift or Sahara’s own database

• Configuration of jobs at submission time

• Execution of jobs on existing clusters

Page 18: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

18

Sahara Architecture

Page 19: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

Thank you.

19

Demo!

Page 20: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

20

Page 21: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

Corporate HeadquartersMaxfeldstrasse 590409 NurembergGermany

+49 911 740 53 0 (Worldwide)www.suse.com

Join us on:www.opensuse.org

21

Page 22: TUT15981: Deploying an elastic Hadoop cluster - … · TUT15981: Deploying an elastic Hadoop cluster ... •Hadoop Overview •SUSE OpenStack Cloud •Manual Deployment ... What is

Unpublished Work of SUSE LLC. All Rights Reserved.This work is an unpublished work and contains confidential, proprietary and trade secret information of SUSE LLC. Access to this work is restricted to SUSE employees who have a need to know to perform tasks within the scope of their assignments. No part of this work may be practiced, performed, copied, distributed, revised, modified, translated, abridged, condensed, expanded, collected, or adapted without the prior written consent of SUSE. Any use or exploitation of this work without authorization could subject the perpetrator to criminal and civil liability.

General DisclaimerThis document is not to be construed as a promise by any participating company to develop, deliver, or market a product. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. SUSE makes no representations or warranties with respect to the contents of this document, and specifically disclaims any express or implied warranties of merchantability or fitness for any particular purpose. The development, release, and timing of features or functionality described for SUSE products remains at the sole discretion of SUSE. Further, SUSE reserves the right to revise this document and to make changes to its content, at any time, without obligation to notify any person or entity of such revisions or changes. All SUSE marks referenced in this presentation are trademarks or registered trademarks of Novell, Inc. in the United States and other countries. All third-party trademarks are the property of their respective owners.