keys to balancing explosive data growth · 2014. 8. 14. · 12 –sybase confidential –may 5,...

14
Keys to Balancing Explosive Data Growth JOSEPH SHAFFNER DIRECTOR OF TECHNICAL SALES, SYBASE May 5, 2009

Upload: others

Post on 06-Mar-2021

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

Keys to Balancing Explosive Data Growth

JOSEPH SHAFFNERDIRECTOR OF TECHNICAL SALES, SYBASE

May 5, 2009

Page 2: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

2 – Sybase Confidential – May 5, 2009

Can’t get the insights about the business

With Explosive Data Growth, Challenges Arise

Source: The Toxic Terabyte, IBM

2007

2008

2009

2010

2011

= Annual doublings of total data worldwide

Time

An

nu

al D

ou

blin

gs o

f Data

25

400

800

What you don’t know is costing you….in risk, resources, and knowledge

Page 3: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

3 – Sybase Confidential – May 5, 2009

Need For Knowledge

• Business Intelligence ranked top technology priority in 2009, 4th year in a row. (Source: Gartner Executive Programs survey 1,500+ CIOs)

• Leverage & expand existing BI investments

– More pervasive

– Actionable

Page 4: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

4 – Sybase Confidential – May 5, 2009

Why are Businesses Not Fixing the Problem

• Cost of delivery is too high

– Resources

– Infrastructure

• Inconsistency in information across the data silos

• Using manual processes to integrate data into one version of truth

• Time to deliver is to slow

How difficult to get relevant corporate information to make business decisions

Aware of bad decisions managers have made due to insufficient information

Page 5: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

5 – Sybase Confidential – May 5, 2009

The Gap

Page 6: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

6 – Sybase Confidential – May 5, 2009

Bridging the Gap: Something Has to Change

• Relational emerged in 1980 and mainstream 1990– Transactional

– Data warehouse, Business Intelligence and Analytics

• Relational builds on transactional architecture for DW– More infrastructure

– More complexity

– More skills needs

– “one size fits all”

• OLAP Emerge– Still requires specialized skills

– Hard to scale

Page 7: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

7 – Sybase Confidential – May 5, 2009

Emerging Column-based DBMS can Bridge the Gap

Column-based Analytic Engines Can Manage the Explosive Data Growth

Page 8: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

8 – Sybase Confidential – May 5, 2009

Why Column-based DBMS for Analytics

• Data is stored in rows (horizontally)

• Need indexes, partitions, etc for performance

• Much of the database space is taken up for indexes & growth

• 3NF vs. Dimensional

• Aggregations, Materialized Views & Denormalization

Relational Database

54321 …9876

r1

r2

r3

r4

r5

Columnar Database

54321 …9876

r1

r2

r3

r4

r5

• Data is stored in columns

(vertically)

• The data is the indexes

• Little to no resources

required

• Retrieve only columns used in

the query

• Reduce I/O

• Data Compression up to

70%

Page 9: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

9 – Sybase Confidential – May 5, 2009

Shared Disk vs. MPP Architecture

© 2008 Sybase, Inc. and IBM Corporation

AnalyticEngine

CPU CPU

Mem CPU

CPU Mem

Analytic Engine

CPU CPU

CPU Mem

CPU Mem

Analytic Engine

CPU CPU

Mem Mem

Mem Mem

Analytic Engine

CPU CPU

CPU CPU

Mem Mem

AnalyticEngine

CPU Mem

CPU Mem

CPU Mem

SAN

Shared Disk Benefits• One data store

• Shared Across Multiple • Computer Nodes

• No Partitioning of Data

• No Distributed Lock Management

• Individual nodes are Independent of Other Nodes

• Add servers and CPUs – terabytes of disk – with little or no loss in performance

One copy of data, backup is compressed, large savings in power, cooling and storage costs.

Reader NodesWriter NodeShared

Disks

Multiplexing architecture – concurrency and scalability

Page 10: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

10 – Sybase Confidential – May 5, 2009

Replicate Data Assets Without Intensive Resources and Additional Storage

Look for…

• Non-intrusive transaction capture

• Flexible transformation of data

• Efficient routing across networks

• Real-time synchronization across heterogeneous DBs

• Oracle• Microsoft• DB2 on all platforms including mainframe and AS/400

Replication Agents

Replicate

Connect

Data Sources

• ASE

• SQL Anywhere

• Oracle• Microsoft SQL Server

• DB2 UDB (Unix, Windows)

• DB2 on mainframe (Z/OS) – available as an separately sold

option in DI Suite

Analytics DBMS

Page 11: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

11 – Sybase Confidential – May 5, 2009

Everybody was pretty amazed. We were hoping for a 50% decrease in query time. But we saw many times more than that when we started up the system. The Sybase IQ implementa-tion far exceeded our expectations. The results were stunning.

Simon Falconer

DBA

TelstraClear

Business Issues� Huge data volume – 3 billion rows of

information, growing by 6 million rows per day

� Goal – Develop a reporting system across the enterprise to provide on demand reporting and business analytics against enough historical data to reflect historical trends

Results� Not able to solve need with existing technology� Call reporting system stored and managed 13

months of data – at no additional storage cost� Complex report times reduced from two

hours to two minutes� Business analysts able to create and request

their own reports – without the need for IT intervention

� Increased capacity without additional hardware� Low maintenance and DBA costs

TELSTRACLEAR

Page 12: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

12 – Sybase Confidential – May 5, 2009

Remarkable Gains

“We have seen remarkable improvements in productivity,

and big decreases in training, support, and hardware

costs. Sybase IQ allows the IRS to better analyze data, and

because of that, we are able to administer the tax code

more fairly and equitably.”

Jeff ButlerDirector of Research Databases

Office of Research, US Internal Revenue Service

Page 13: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

13 – Sybase Confidential – May 5, 2009

Apply A New Approach to Analytics

BE SMARTER about the problem

• Leverage existing investments by using replication for heterogeneous integration

• Continue using relational databases for your transactional systems

• Think about using columnar databases for analytics and reporting

• Evaluate compression technology to help manage the data volume

Page 14: Keys to Balancing Explosive Data Growth · 2014. 8. 14. · 12 –Sybase Confidential –May 5, 2009 Remarkable Gains “We have seen remarkable improvements in productivity, and

14 – Sybase Confidential – May 5, 2009

THANK YOU!