integrating with hadoop: couchbase connect 2014

30
Integrating with Hadoop: Import & Export Data with the Couchbase Sqoop Plugin Shane Johnson | Product Marketing, Couchbase Matt Ingenthron | Senior Director, Couchbase

Upload: couchbase

Post on 20-Aug-2015

1.047 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Integrating with Hadoop: Couchbase Connect 2014

Integrating with Hadoop: Import & Export Data with the Couchbase Sqoop Plugin

Shane Johnson | Product Marketing, Couchbase

Matt Ingenthron | Senior Director, Couchbase

Page 2: Integrating with Hadoop: Couchbase Connect 2014

©2014 Couchbase, Inc. 2

Who has big data?

Big Data

Page 3: Integrating with Hadoop: Couchbase Connect 2014

©2014 Couchbase, Inc. 3

What is it?

Big Data

Page 4: Integrating with Hadoop: Couchbase Connect 2014

VS.

VOLUME VELOCITY

VS.

OPERATIONAL ANALYTICAL

Page 5: Integrating with Hadoop: Couchbase Connect 2014

©2014 Couchbase, Inc. 5

Where is it?

Big Data

Page 6: Integrating with Hadoop: Couchbase Connect 2014
Page 7: Integrating with Hadoop: Couchbase Connect 2014

©2014 Couchbase, Inc. 7

Where is it coming from?

Big Data

Page 8: Integrating with Hadoop: Couchbase Connect 2014

CONSUMERS

Page 9: Integrating with Hadoop: Couchbase Connect 2014

ENERGY

Page 10: Integrating with Hadoop: Couchbase Connect 2014

TRANSPORTATION

Page 11: Integrating with Hadoop: Couchbase Connect 2014

DATAUSERSENSORGPSTELEMETRYACTIVITYEVENT

Page 12: Integrating with Hadoop: Couchbase Connect 2014

MORE MACHINESDATAUSERS

Page 13: Integrating with Hadoop: Couchbase Connect 2014

CONTINUOUSSTREAMOFDATA

Page 14: Integrating with Hadoop: Couchbase Connect 2014

14

Why Hadoop?

Big Data

Page 15: Integrating with Hadoop: Couchbase Connect 2014

PBs OF DATA

1000s OF SERVER

S

MAP REDUCE

Page 16: Integrating with Hadoop: Couchbase Connect 2014

16

How?

Big Data

Page 17: Integrating with Hadoop: Couchbase Connect 2014

HADOOPSTORMCOUCHBASEKAFKA

Page 18: Integrating with Hadoop: Couchbase Connect 2014

COMPLEX EVENT PROCESSING

RTREPOSITORY

PERPETUALSTORE

ANALYTICALDB

BUSINESSINTELLIGENCE

MONITORING

CHAT/VOICESYSTEM

BATCHTRACK

REAL-TIMETRACK

DASHBOARD

Page 19: Integrating with Hadoop: Couchbase Connect 2014

COLLECTION

ANALYSIS ANDVISUALIZATION

REST FILTER METRICS

Page 20: Integrating with Hadoop: Couchbase Connect 2014

20

What should I remember?

Big Data

Page 21: Integrating with Hadoop: Couchbase Connect 2014

NOKETCHUP!

Page 22: Integrating with Hadoop: Couchbase Connect 2014

Couchbase Hadoop Connector

Page 23: Integrating with Hadoop: Couchbase Connect 2014

©2014 Couchbase, Inc. 23

Sqoop == SQL to Hadoop

About Sqoop

Couchbase had the first certified sqoop connector back in 2012

But, it’s NoSQL?

• Sqoop runs by automatically generating a Hadoop M-R job that runs across the cluster. Input arguments generate code that is turned into a job and executed. In parallel.

Page 24: Integrating with Hadoop: Couchbase Connect 2014

Common Patterns

Page 25: Integrating with Hadoop: Couchbase Connect 2014

Example: Ad or Recommendation Targeting

Page 26: Integrating with Hadoop: Couchbase Connect 2014

©2014 Couchbase, Inc. 26

Example: Content Driven Site

Page 27: Integrating with Hadoop: Couchbase Connect 2014

©2014 Couchbase, Inc. 27

About the demonstration…• Couchbase Server 3.0 running locally on Mac OS X• Cloudera Quick Start VM as Hadoop Cluster

• Workload running against cluster• Start a sqoop job to “import” from couchbase

Demo

Page 28: Integrating with Hadoop: Couchbase Connect 2014

Alternatives

Page 29: Integrating with Hadoop: Couchbase Connect 2014

A similar bridge between Couchbase and Hadoop.

Couchdoop from Avira

• Uses Couchbase Views as a way to identify items for import from Couchbase to Hadoop.• Supports both import and

export--couchbase-view-keys ‘[“click”, “2014-09-16”]; [“click”, “2014-09-17”]’

• Allows for ranges on views

• Used in Avira’s Offers as part of their Antivirus Suite

Screenshot courtesy http://www.avira.com/en/avira-offers

Page 30: Integrating with Hadoop: Couchbase Connect 2014

Thanks! Q&A?

Matt Ingenthron – @ingenthr – [email protected]

Shane Johnson – @shane_dev – [email protected]

©2014 Couchbase, Inc. 30