datastax geeknet webinar - apache cassandra: enterprise nosql

31
Apache Cassandra Jonathan Ellis, CTO DataStax Dec, 7 2011 Real NoSQL Applications in the Enterprise Today.

Upload: datastax

Post on 26-Jan-2015

126 views

Category:

Technology


3 download

DESCRIPTION

 

TRANSCRIPT

Page 1: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

ApacheCassandra Jonathan Ellis, CTO DataStax

Dec, 7 2011

Real NoSQL Applications in the Enterprise Today.

Page 2: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Today’s Database Challenge

Page 3: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Navigating the NoSQL waters

 Distributed  Horizontally scalable  Eventually consistent  Non-relational

  Column store   Document stores   Key-value   Graph   … and more

Page 4: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Cassandra: the best for “big data”

 Elegant architecture  Operational flexibility   Industry-leading performance

 You should be using Cassandra for applications requiring   high-performance, realtime queries   scalability past one machine   bulletproof reliability

Page 5: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Bigtable, 2006 Dynamo, 2007

OSS, 2008

Incubator, 2009 TLP, 2010 1.0, October 2011

Page 6: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Cassandra Highlights  Multi-master, multi-DC  Linearly scalable  Larger-than-memory datasets  High performance  Full durability   Integrated caching  Tuneable consistency

Page 7: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

A single four-core machine; one million inserts + one million updates

Performance

Page 8: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
Page 9: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

The Cassandra Difference

Scalable Performance

Oracle Exadata ✖ ✔ ✔

MySQL ✖ ✔ ✔

Sharding ✔ ✔ ✖

MongoDB ✔

Operational Ease

Cost Effective

Cassandra ✔ ✔ ✔

HBase ✔ ✖ ✔

And when it comes to Performance, we’re unmatched.

*

*

✖ ✔

Page 10: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

“With Cassandra, we get better business agility, and we don’t have to plan capacity in advance, we don’t need to ask permission of other people to build things for us, and we don’t worry about running out of space or power.”

 Adrian Cockcroft, Cloud Architect

Page 11: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Netflix’s problems  Could not build datacenters fast enough  Made decision to go to cloud (AWS)  Cassandra on AWS is a key infrastructure

component of its globally distributed streaming product.

 Applications include Netflix’s subscriber system, AB testing, and viewing history service (including positions at which members stopped watching a streaming program).

Page 12: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Netflix on Cassandra TODO  Fast  Cheap  Scalable  Flexible  No SPOF

Page 13: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

“Without Cassandra, our engineers would’ve had to create something that could scale to our needs, that would’ve prevented us from focusing on building product and solving problems for Backupify’s users, which are far more important tasks.”

 Matt Conway, VP Engineering

Page 14: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Backupify’s problem  Cloud-based utility that enables

businesses and consumers to backup, search and restore the content of popular online applications such as Google Apps, Gmail, Facebook, Twitter, and Blogger

 Needs:   Horizontal scaling   Ability to handle high write loads   Elasticity with no manual sharding

Page 15: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Backupify on Cassandra  Ease of scale enabled engineers to focus

on building great applications  DataStax OpsCenter made it easy to

monitor the health and perf of their cluster   Reliable, redundant and scalable low-

balance data storage helped eliminate down-time

 Ability to offer both backup and storage, but also analysis of data eventually

Page 16: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

“You can seamlessly add new nodes and expand your total capacity without deteriorating the performance of the data store. Cassandra has allowed us to scale very effectively.”

 Harry Robertson, Tech Lead

Page 17: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Ooyala’s problem  Ooyala provides a suite of technologies

and services that support content owners in managing, analyzing and monetizing the digital video they publish online

 Needs:   Elasticity, to respond to spikes in data scale   Ability to respond to increasingly

sophisticated analytic needs of customers

Page 18: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Ooyala on Cassandra  Classic “Big Data” problem did not require

re-architecting  Application agility was enabled –

developers spend time building cool apps, not figuring out how to scale

 Enabled more powerful and granular analytics to their customers

Page 19: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

“Cassandra has allowed us to build bigger features faster and more reliably, while using less money and without needing to expand our staff.”

 Kyle Ambroff, Sr. Engineer

Page 20: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Formspring’ problem  Users of Formspring engage with and learn

more about each other by asking and responding to questions. With close to 4B responses in the system and 30M unique users, they needed:   To support explosive growth   To seamlessly syndicate user content   To avoid sharding   Application flexiblity

Page 21: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Formspring on Cassandra  No sharding needed – just add nodes to

scale  Performance – the popular users with

many followers saw no speed reduction.  No more memcached!  Flexibility of a schema-optional

architecture is very developer friendly

Page 22: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Why DataStax? DataStax delivers database products and services based on Apache Cassandra from experts who are at the forefront of today's data revolution.

Database Software & Tools Support & Services

  DataStax Enterprise   DataStax Community   DataStax OpsCenter   Drivers & Connectors

  Production Support   Consultative Help   Professional Training   Online Documentation

Page 23: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

DataStax Overview   Founded in April 2010   Commercial leader in Apache Cassandra™, the popular

open-source “big data” database   Headquartered in San Francisco Bay area   100+ customers   35+ employees (split between San Fran and Austin)   Home to Apache Cassandra Chair & most committers   Secured $11M in Series B funding in Sep 2011

Page 24: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

DataStax Value  The simplest way to get started with Apache

Cassandra: DataStax Community Edition  A smart, integrated platform that provides

Analytics and Real-Time capabilities in the same database, without any resource contention: DataStax Enterprise

 The backing of the Cassandra Experts

Page 25: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

100+ customers

Page 26: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

DataStax Enterprise 1.  DataStax Enterprise

Database Server

2.  OpsCenter Enterprise Management solution

3.  Expert production support & consultative services

Page 27: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Enterprise Database Server

  Leverages resources on-premise or in the cloud

  Guarantees uptime with a master-less distributed architecture

  Allows for fast application changes via flexible schemas

  Handles structured, semi-structured, and unstructured data

  Provides advanced security   Eliminates the need for separate analytics

system

1

2 3

4

6 5

Real-Time

Analytics

Repl

icat

ion

Enterprise-class database built to handle today’s big-data needs in a cost-effective, easy, and reliable way.

Page 28: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

OpsCenter Enterprise

  Visual, browser-based user interface

  Administration tasks carried out in point-and-click fashion

  Allows for visual rebalance of data across a cluster when new nodes are added

  Proactive alerts that warn of impending issues

  Built-in external notification abilities

OpsCenter Enterprise supplies management, monitoring, and control over DataStax Enterprise

Page 29: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Expert Production Support DataStax Enterprise includes production support and consultative services from the Cassandra experts.

 Support service level agreements that range from business hours to 24x7x365

 Consultative support for assistance on architecture, design, and tuning

 Certified quarterly service packs

 Hot-fix support

Page 30: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

DataStax Enterprise Compared

Scalable Performance

Oracle Exadata ✖ ✔ ✔

MySQL ✖ ✔ ✔

Sharding ✔ ✔ ✖

MongoDB ✔

Operational Ease

Cost Effective

DataStax Enterprise ✔ ✔ ✔

HBase ✔ ✖ ✔

✖ ✔

Real-Time + Analytics

Oracle NoSQL DB ✔ ✔ ? ✖

Page 31: DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

DataStax – Your One-Stop Shop  DataStax Enterprise and Community Editions  Professional Training, Expert Consulting  Documentation and Dev Center

  http://www.datastax.com/docs   http://www.datastax.com/dev

 Whitepapers, Case Studies, FAQ’s and more   http://www.datastax.com/resources/whitepapers   http://www.datastax.com/resources/casestudies

Thank you!