hbasecon 2012 | overcoming data deluge with hbase to help save the environment - opower

27
HBase to Save the Planet Alex Newman [email protected] Architect, Drawn to Scale Strategic Advisor, Opower

Upload: cloudera-inc

Post on 18-Jun-2015

1.115 views

Category:

Technology


0 download

DESCRIPTION

Opower is a fast moving energy management SaaS company that collects sensor data from nearly all of the major utilities in the United States–meaning from more than 45 million American households–along with major utilities in 5 countries throughout Europe and AsiaPac. Opower manages more than 100 billion meter reads, ranging from high frequency power data (AMI), smart thermostats data, and weather data. Currently all data at Opower is stored in HBase or Hadoop (and is notably not security sensitive). This discussion will discuss Opower’s HBase architecture, highlight potential and current uses of data in HBase, share the vision of Opower’s future projects and directions, and reveal how Opower’s big data management has allowed the company to help its utility clients save enough energy to power a city of nearly 200,000 people and save utility customers more than $70 million since only 2008!

TRANSCRIPT

Page 1: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

HBase to Save the Planet

Alex [email protected]

Architect, Drawn to ScaleStrategic Advisor, Opower

Page 2: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

My life with HBase

Drawn to Drawn to ScaleScale

Drawn to Drawn to ScaleScale OpowerOpowerOpowerOpowerClouderaClouderaClouderaClouderaFactsetFactsetFactsetFactset

Page 3: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

About Opower

Opower is a customer engagement platform for the utility industry

Page 4: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

About Opower

Home energy reportsCustomized utility bills

Energy efficiency programs for utilities

Page 5: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

About Opower

Opower runs on analyticsAnalytics run on Hadoop + HBase

Page 6: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Opower analysis relies on datafrom a variety of sources

» Electric Utility Usage Data

» Gas Utility Usage Data

2

4

3 1

Data Storage & Processing

Disaggregation Algorithms

Shared Energy Signature

Repository

OPOWER Platform

» Thermostat data

» Weather data

Page 7: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Opower’s first architecture could not support their analytic vision

MySQLScalability?

Performance? Data integration?

Page 8: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Opower’s first architecture could not support their analytic vision

Analytic workflow instead of analytic apps:

SQL -> CSV -> R -> too little, too slow

Page 9: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Problem #1 Data Lake Cost

Usage AMI Regional AMI Sensor Data Data Lake

Page 10: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Problem #2 Slower and slower queries

Smart-grid-scale dataLots of supporting data: weather, demographics, etc.

Page 11: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Problem #3 It was taking lots of “magic”

Intense analyticsStrange schemas

Segmented queries

Page 12: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Hadoop + HBase at Opower

Opower determined that they needed an entirely new data architecture

Page 13: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

NexGen Architecture @ Opower

Page 14: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Hadoop + HBase at Opower

Early success: HBase AMI

Page 15: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What rocked

Endless, cheap scalability

Page 16: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What rocked

The analytics team loved it!

Page 17: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What sucked

Hard on the ops team – still trying to grok it

Page 18: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What suckedNoSchema p1.

Creating SchemaManaging MetaData

Schema <=> Performance

Page 19: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What sucked

HAFailover

Snapshots

Page 20: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What sucked

No secondary indexAggregation is slow (Rollup/OLAP)

Poor Client Performance

Page 21: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

It would be better if only …

Developers were not forced to know how the data is stored, indexed, etc.

Page 22: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

It would be better if only …

There were nicer APIs and better query languages (SQL?)

Page 23: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

It would be better if only …

Version migrations were easyHierarchical Tables

Page 24: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

It would be better if only …

Real-time tuning

Page 25: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

It would be better if only …

Did I mention HA?

Page 26: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

In summary

HBase has helped Opower achieve their analytic vision

But they’ve still got a long way to goHBase still has a long way to go

Page 27: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Questions?

Alex [email protected]

Architect, Drawn to ScaleStrategic Advisor, Opower