hadoop essential setup

18
Hadoop Essential Setup A big data proposal

Upload: omid-mogharian

Post on 15-Apr-2017

37 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: Hadoop essential setup

Hadoop Essential SetupA big data proposal

Page 2: Hadoop essential setup

Who is thisweirdo?

Hiking, Table Tennis, Kicker, Traveling, Foods, Cultures, The BigbangTheory, Family Guy, Sherlok, …

Page 3: Hadoop essential setup

What's Big Data Really?

Page 4: Hadoop essential setup
Page 5: Hadoop essential setup
Page 6: Hadoop essential setup

There are only two hard things in Computer Science: cache invalidation and naming things.

Phil Karlton

Page 7: Hadoop essential setup

Hadoop?

From Wikipedia

The genesis of Hadoop came from the Google File System paper ..... This paper spawned another research paper from Google – MapReduce: …. in January 2006. Doug Cutting, who was working at Yahoo! at the time, named it after his son's toy elephan

Page 8: Hadoop essential setup

Map/Reduce

Shuffle(Transfer & Merge)

Page 9: Hadoop essential setup

HDFS

Page 10: Hadoop essential setup

By the wayHadoop 2.0

Page 11: Hadoop essential setup

HadoopEcosystyem

Page 12: Hadoop essential setup

ETL or ELT? ?

Source

TargetTarget

Source

Source

Page 13: Hadoop essential setup

Essential Setup

+

Flume Agent

Flume Agent

Flume Agent

HDFSHDFS

HDFSHDFS

Pig

Hive

SparkHbase

Page 14: Hadoop essential setup

Essential Setup

Page 15: Hadoop essential setup

Essential Setup

Page 16: Hadoop essential setup

Fast Data & Big Data

Lambda Architecture

Page 17: Hadoop essential setup

Fast Data & Big Data

Page 18: Hadoop essential setup

Now your

Turn, What's

your idea?