gridgain vs hadoop - qcon london 2013 conference

12
GridGain vs Hadoop Why Elephants Can’t Fly GridGain System 1065 East Hillsdale Boulevard Suite 230 Foster City, CA 94404 www.gridgain.com

Upload: others

Post on 12-Sep-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: GridGain vs Hadoop - QCon London 2013 Conference

GridGain vs HadoopWhy Elephants Can’t Fly

GridGain System1065 East Hillsdale Boulevard

Suite 230 Foster City, CA 94404

www.gridgain.com

Page 2: GridGain vs Hadoop - QCon London 2013 Conference

GridGain Real Time Big Data Slide

GridGain In A Glance

2

GridGain is Java based open source middleware for transactional real time big data processing that scales up from one server to thousands of machines.

Unlike complex, decade-old Hadoop MapReduce systems which use stale data for batch offline analytics, our platform allows companies to harness live data for smarter, faster real time processing.

Page 3: GridGain vs Hadoop - QCon London 2013 Conference

GridGain Real Time Big Data Slide

GridGain History

> GridGain  Systems  founded  in  2005> VC  funded> Headquarter  in  Foster  City,  California,  USA> 12  product  releases:

> GridGain 1.x, Jul 2007> GridGain 2.x, Feb 2008> GridGain 3.0, Aug 2010

> Current  release  is  GridGain  3.6

3

Page 4: GridGain vs Hadoop - QCon London 2013 Conference

GridGain Real Time Big Data Slide

GridGain Facts

Over  8,000,000  starts  worldwide1000  unique  IP/month400  acIve  projects/month4000  forum  views/month

GridGain  starts  every  10  seconds  around  the  globe

Page 5: GridGain vs Hadoop - QCon London 2013 Conference

GridGain Real Time Big Data Slide

GridGain Users

GridGain Partners

Page 6: GridGain vs Hadoop - QCon London 2013 Conference

GridGain Real Time Big Data Slide

GridGain Technology

> Fully integrated cloud middleware: Compute Grid + Data Grid> Real Time Transactional Big Data> Zero Deployment

> Two editions:> Community Edition: License: GPLv3 + Basic Features> Enterprise Edition: Commercial License + Enterprise Features

> Language support:> Java 1.6> Scala 2.9.1

6

Page 7: GridGain vs Hadoop - QCon London 2013 Conference

GridGain Real Time Big Data Slide

GridGain - Compute Grid

> Direct support for MapReduce> Auto discovery> Checkpoints for long running tasks > Load Balancing> Affinity co-location with data grids > Automatic fault tolerance

7

Page 8: GridGain vs Hadoop - QCon London 2013 Conference

GridGain Real Time Big Data Slide

GridGain - Data Grid

8

> Replication & Partitioning

> Pessimistic & Optimistic Tx

> Read-Through and Write-Through

> Pluggable data overflow storage

> Distributed Queries

> Distributed Queues and Latches

> Distributes Java Atomics

Page 9: GridGain vs Hadoop - QCon London 2013 Conference

GridGain Real Time Big Data Slide

Hadoop Processing

9

> Very large data sets, BUT...

> Not Real Time

> Mandatory data snapshots

> HDFS instead of live databases

> Analytics based on offline data

Page 10: GridGain vs Hadoop - QCon London 2013 Conference

GridGain Real Time Big Data Slide

GridGain Processing

10

> Large data sets

> Near Real Time Processing

> Online databases

> In-memory data caching

> Co-location of analytics and data

> Business analytics on Live Data

Page 11: GridGain vs Hadoop - QCon London 2013 Conference

Cloud  CompuIng  with  Scala  and  GridGain Slide  

Live Coding - Real Time Word Count

> Real time uploading of books into Cache> Real time updates of word counts> Real time SQL queries for popular words> Real time print-outs of most popular words> ... using Scala & GridGain

11

Page 12: GridGain vs Hadoop - QCon London 2013 Conference

GridGain Real Time Big Data Slide 12

Thank You!