normalization in a mixed oltp and olap workload scenario · olap workload scenario anja bog 1, kai...

17
Normalization in a Mixed OLTP and OLAP Workload Scenario Anja Bog 1 , Kai Sachs 2 , Alexander Zeier 1 , and Hasso Plattner 1 1 Hasso Plattner Institute, University of Potsdam, 14482 Potsdam, Germany 2 SAP AG, 69190 Walldorf, Germany

Upload: others

Post on 19-Aug-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Normalization in a Mixed OLTP and OLAP Workload Scenario

Anja Bog1, Kai Sachs2, Alexander Zeier1, and Hasso Plattner1

1Hasso Plattner Institute, University of Potsdam, 14482 Potsdam, Germany

2SAP AG, 69190 Walldorf, Germany

Page 2: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Motivation

■ Analytics on up-to-date data vs. separated analytical and

transactional systems

How? Increase update frequency, use synced operational data stores

■ Flexible/ad hoc analyses vs. preparation of a data subset

for analytics

How? Use transactional data directly to answer analytical queries

■ But analytics on transactional data immensely intensifies

the load on the transactional system?

Evaluate impact of adding analytics to the transactional workload

Study variants of database design for mixed workloads

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

2

Page 3: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Database Design Steps

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

3

Page 4: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Database “Schemas” in Analytical and Transaction Processing

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

4

Page 5: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Case Study

■ Normalization as a key element that differs in OLTP and OLAP

database design

■ Analyze the interplay of workload mix and differently normalized

database schemas wrt. query performance

■ Dynamic database design in varying mixed workload scenarios?

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

5

Page 6: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Workload Scenario for the Case Study

■ Simulation on the basis of real enterprise data

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

6

Page 7: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Sales DB Schema – 1NF

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

7

Page 8: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Workload Components

Type Action Share/Type

Profile

wOLTP

Sales Order 30% 7+ Master Data (MD) selections, 5+ Transaction Data (TD) inserts

Shipping 27% 2+ TD selections, 2+ TD inserts

Billing 25% 4+ TD selections, 5+ TD inserts

Payment 18% 1 TD selection, 1+ TD update

rOTLP

Sales Order by Key

Ran-dom/Con-fig-urable

TD Selection on key, header item join

Sales Order by Period Range selection, no join

Open Bills (Items) Selection by non-key attributes, header item join, order by FK attribute and PK attribute

Customer Details MD Selection on key, 2-table join

Product Details MD Selection on key, 2-table join

OLAP

Daily Flash

Ran-dom/Con-fig-urable

TD aggregate range selection, header item join, group by 3 sales hierarchy attributes, order by aggregate

Avg. Order Processing Time TD aggregate select on independent range sub-select (4-table header item join), group by sales hierarchy, order by aggreg.

Order Delivery Fulfillment TD aggregate range select, 4-table header item join, group by sales hierarchy, order by aggregate, dependent aggregate range sub-select with header item join

Days Sales Outstanding TD aggregate range select, header item join, group by currency, order by aggregate, dependent aggregate range sub-select with header item join

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

8

Page 9: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Schema Variant l – Document Style

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

9

Page 10: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Schema Variant ll – Star Schema Style

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

10

Page 11: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Experimental Setup

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

11

■ Database types in test (chosen according to their usage in

productive enterprise systems)

□ In-memory column-oriented

□ Disk-based column-oriented

□ Disk based row-oriented

■ Workload simulation

□ 100 concurrent clients sending requests to the database

□ Varying share of OLTP and OLAP clients

Page 12: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Results – Disk-based row-oriented

12

Type Doc/1NF Snow/1NF 1NF/Doc Snow/Doc

wOLTP +8% +54% -10% - -5% +78% - +114%

rOLTP +85% +279% -19% - +17% +240% - +327%

OLAP N/A N/A +3681% - +6220% +305% - +1041%

0

10

20

30

40

50

60

70

80

90

100

100,0 80,20 75,25 60,40 50,50 40,60 25,75 20,80 0,100

Averag

e r

esp

on

se t

ime

(n

orm

alized

)

Client configuration [#OLTP,#OLAP]

wOLTP 1NF

wOLTP Doc

wOLTP Snow

rOLTP 1NF

rOLTP Doc

rOLTP Snow

OLAP 1NF

OLAP Doc

OLAP Snow

Favored Design: 1NF Document

Page 13: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Results –Disk-based column-oriented (I)

13

0.00

10.00

20.00

30.00

40.00

50.00

60.00

70.00

80.00

90.00

100.00

100,0 80,20 75,25 60,40 50,50 40,60 25,75 20,80 0,100

Averag

e r

esp

on

se t

ime

(n

orm

alized

)

Client configuration [#OLTP,#OLAP]

wOLTP 1NF

wOLTP Doc

wOLTP Snow

rOLTP 1NF

rOLTP Doc

rOLTP Snow

OLAP 1NF

OLAP Doc

OLAP Snow

Favored Design: 1NF Document

Type Doc/1NF Snow/1NF 1NF/Doc Snow/Doc

wOLTP +79% +692% -44% - -7% +314% - +812%

rOLTP +91% +760% -58% - -31% +196% – +760%

OLAP N/A N/A +342% – +619% -7% - +61%

Page 14: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Results –Disk-based column-oriented (II)

14

0.00

10.00

20.00

30.00

40.00

50.00

60.00

70.00

80.00

90.00

100.00

100,0 80,20 75,25 60,40 50,50 40,60 25,75 20,80 0,100

Averag

e r

esp

on

se t

ime

(n

orm

alized

)

Client configuration [#OLTP,#OLAP]

wOLTP 1NF

wOLTP Doc

wOLTP Snow

rOLTP 1NF

rOLTP Doc

rOLTP Snow

OLAP 1NF

OLAP Doc

OLAP Snow

Favored Design: 1NF Document

Type Doc/1NF Snow/1NF 1NF/Doc Snow/Doc

wOLTP +32% +47% -0.2 - +16% -19% - +47%

rOLTP -18% +47% +9% - +75% +0.4 – +48%

OLAP N/A N/A +99% - +109% +5% - +9%

Page 15: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Results –In-memory column-oriented

15

Type Doc/1NF Snow/1NF

wOLTP +80% - +143% +153% - +251%

rOLTP +28% - +127% +69% - +143%

OLAP +121% - +418% +72% - +498%

0

10

20

30

40

50

60

70

80

90

100

100,0 80,20 75,25 60,40 50,50 40,60 25,75 20,80 0,100

Averag

e r

esp

on

se t

ime

(n

orm

alized

)

Client configuration [#OLTP,#OLAP]

wOLTP 1NF

wOLTP Doc

wOLTP Snow

rOLTP 1NF

rOLTP Doc

rOLTP Snow

OLAP 1NF

OLAP Doc

OLAP Snow

Favored Design: 1NF

Page 16: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Conclusion

■ Manually created schema variants, tested via benchmark

framework that simulates real company order-to-cash composite

analytical and transactional workloads

■ Not necessarily beneficial to have a static database design in a

mixed workload scenario when the workload varies

■ In Future: Utilize strategies from database design advisors for

autonomous schema variation to adapt normalization in schemas

to changing workloads?

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

16

Page 17: Normalization in a Mixed OLTP and OLAP Workload Scenario · OLAP Workload Scenario Anja Bog 1, Kai Sachs2, Alexander Zeier , and Hasso Plattner1 1Hasso Plattner Institute, University

Thanks!

Please visit

http://epic.hpi.uni-potsdam.de

to find out more about our research.

TPC TC 2011 | Normalization in a Mixed OLTP and OLAP Workload Scenario | Anja Bog

17