how to build a real time analytics enterprise with open source

12
Building enterprise analy0cs from Open Source Lecole Cole, Founder & CEO 1

Upload: ecobold

Post on 30-Nov-2014

340 views

Category:

Technology


4 download

DESCRIPTION

Given the ease of scaling from zero to millions of users with very little capital investment and availability of managed infrastructure, building startups or enterprise grade products has become easier, faster and cheaper then ever before. We will go over what it would take to build an enterprise big data real time analytics system, with insights onto architecture, open source and closed source available software, support requirements and alternatives around managed services. We will explore the use of open source tools (using cloud providers such as AWS, Google and Microsoft) to build these big data applications. From zero to enterprise will be the theme of this meetup, come join us to learn, explore and contribute.

TRANSCRIPT

Page 1: How to Build a Real Time Analytics Enterprise with Open Source

Building  enterprise  analy0cs  from    Open  Source  

Lecole  Cole,  Founder  &  CEO    1  

Page 2: How to Build a Real Time Analytics Enterprise with Open Source

•  My  name  is  Lecole  Cole  •  Worked  in  data  analysis  for  +15  years  •  TwiEer:  @lecole  •  Email:  [email protected]  •  Company  Skydera  Inc.  •  Projects:  Chartleaf  

(2)  

A little about me.

Page 3: How to Build a Real Time Analytics Enterprise with Open Source

Interes0ng  Packages  

•  Real-­‐0me  Stream  – AWS  Kinesis  – TwiEer  Storm  – Hadoop  v2?  

(#)  

•  Analy0cs  package  – Apache  Mahout  – R-­‐project  – Pandas  (Python)  – pyBrain  (Python)  – Custom  Python  

•  Visualiza0on  – D3.js  – R-­‐project  

Page 4: How to Build a Real Time Analytics Enterprise with Open Source

Interes0ng  Packages  

•  Batch  Processors  – Hadoop  V1  – EMR  (AWS)  

•   NoSQL:  – MongoDB  – DynamoDB  (AWS)  – BigTable  (Google)  – Cassandra  

(#)  

Page 5: How to Build a Real Time Analytics Enterprise with Open Source

Example  Stack  

•  Compute:  – Google  Compute  

•  Database  – MySQL  – BigTable  

•  Analy0cs  – Hadoop      

(#)  

•  Language:  –  Java  applica0on  for  Hadoop  

•  Data  access  – Apache  Pig  – Apache  Hive  

Page 6: How to Build a Real Time Analytics Enterprise with Open Source

Example  Stack  

•  Compute:  – AWS  EC2  

•  Database  – MySQL  – DynamoDB  

•  Data  warehouse  – Redshib  

(#)  

•  Analy0cs  Batch  – EMR    

•  Analy0cs  Real-­‐0me:  – AWS  Kinesis  

Page 7: How to Build a Real Time Analytics Enterprise with Open Source

Screen  Shots  

(#)  

Page 8: How to Build a Real Time Analytics Enterprise with Open Source

Screen  Shots  

(#)  

Page 9: How to Build a Real Time Analytics Enterprise with Open Source

Screen  Shots  

(#)  

Page 10: How to Build a Real Time Analytics Enterprise with Open Source

Screen  Shots  

(#)  

Page 11: How to Build a Real Time Analytics Enterprise with Open Source

Screen  Shots  

(#)  

Page 12: How to Build a Real Time Analytics Enterprise with Open Source

Screen  Shots  

(#)