when the rules change - nyougnyoug.org/.../uploads/2016/06/when-the-rules-change.pdf• no...

56
© James Morle, June 2016 WHEN THE RULES CHANGE Next Generation Oracle Database Architectures using Super-fast Storage James Morle, EMC DSSD 1

Upload: others

Post on 20-Apr-2020

10 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

WHEN THE RULES CHANGENext Generation Oracle Database Architectures using Super-fast Storage

James Morle, EMC DSSD

1

Page 2: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

INTRO

1993 20152001

thar be dragons

2010

founded in

2

Page 3: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

Disclaimer: I work for EMC these days, and use some of the corporate content, but all opinions here are my own - this is not an official company presentation.

3

Page 4: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

“I/O certainly has been lagging in the last decade”- Seymour Cray 1976

4

Page 5: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

THE ACCESS TIME GAP

Cos

t ($

/GB)

0

1

10

100

1,000

10,000

100,000

1,000,000

Access time (ns)

1 10 100 1,000 10,000 100,000 1,000,000 10,000,000 100,000,000

Access time gap

Disk

DRAM

NAND3D XPoint

5

Page 6: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

"Bandwidth is the work of man, latency is the realm of <insert deity here>"

Jeff Bonwick, CTO and Founder, DSSD

6

Page 7: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

BIG PIPES ARE EASY7

Page 8: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

LATENCY

PERFORMANCE

8

Page 9: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

WHAT MATTERS WITH ORACLE WORKLOADS?

• DW/BI Workloads:

• Multiblock read bandwidth

• Sequential write bandwidth and latency

• OLTP Workloads:

• Single block read latency

• Sequential write latency

9

Page 10: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

SO WHAT'S THE PROBLEM?• Delivery of low latency I/O requires low latency transport in addition

to low latency media

• We have the media, currently NAND flash, but…

• Fibre Channel often adds up to 200 microseconds of latency

• This needs something new, and fit for purpose… let’s start with the software

10

Page 11: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2015 EMC Corporation. All rights reserved. MENU

DSSD Block Device Access to DSSDA bit more latency due to kernel overhead

SOFTWAREApplication

Libraries

KERNEL

System Call

POSIX File System

Volume Mgr.

Device Driver

HARDWARE

PCIe HBA

SAS/SATA

Device Controller

Disk/NAND

PCIe Client Card

DSSD I/O Module

DSSD Flash Module

Application

Libraries

User DMA Port

libflood

DSSD Block Driver

300µS to 5,000µS

<120µS

11

Page 12: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2015 EMC Corporation. All rights reserved. MENU

DSSD FM VS OTHER FLASH STORAGESimpler and Faster Flash Modules

• DSSD has simple, fast Flash Modules • Control Module with rich resources

implements advanced global algorithms

DSSD FM DSSD FM DSSD FM

DSSD CM

ECC

ECC

ECC

Wear Leveling Flash Physics Garbage Collection

Defect MgmtCubic RAID FTL

Vaulting

Vaulting

Vaulting

ASIC ASIC ASIC

D5

ECC

Wear Leveling

Flash Physics

Garbage C

ollection

Defect M

gmt

FTL

Vaulting

ASIC

SSD Drive

ECC

Wear Leveling

Flash Physics

Garbage C

ollection

Defect M

gmt

FTL

Vaulting

ASIC

SSD Drive

ECC

Wear Leveling

Flash Physics

Garbage C

ollection

Defect M

gmt

FTL

Vaulting

ASIC

SSD Drive

• Complex firmware, limited power • Independently managed media

Standard Flash Devices

12

Page 13: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

MENU © Copyright 2015 EMC Corporation. All rights reserved.

HARDWARE + SOFTWARE RESILIENCE

System Wide Data Protection

• Cubic RAID has 2x greater reliability of other RAID but has the same overhead (17%)

• Cubic RAID Grid is an interlocked, multi-dimensional array of multi-page “cells” of NAND die

• High performance – always on

Always On Cubic Raid

5013

Page 14: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2015 EMC Corporation. All rights reserved. MENU

36 Flash Modules (FMs) 18 Flash Modules when Half Populated

2TB/4TB Flash Modules today Larger FMs on the roadmap

Dual Ported PCIe Gen 3 x4 per FM

Dual-Redundant Control Modules (CMs) PCIe Gen 3 Connected

Dense and Shared FlashDSSD D5 - 5U RACK SCALE FLASH PLATFORM

FLASH AND CMs

14

Page 15: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2015 EMC Corporation. All rights reserved. MENU

Redundant Power Supplies x4

Dual-Redundant IO Modules (IOMs) PCIe Gen 3 Connected

48 PCIe Gen 3 x4 Client Ports Per IOM

Total of 96 PCIe Gen 3 x4 Client Port Connections per D5

Redundant Fan Modules x5

Dense and Shared FlashDSSD D5 - 5U RACK SCALE FLASH PLATFORM

IOMs, Fans, Power Supplies

15

Page 16: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

NOISY NEIGHBORS• In all other (non-D5) storage solutions, data is

served by CPUs

• CPUs execute the code to service HBA requests, check caches, request data from media, and so on

• CPU is a relatively scarce resource, and prone to abuse by certain sessions/systems/users – the noisy neighbors

• When CPU resource is unavailable, response times degrade rapidly and exponentially

CPUNetwork

HBAFC or IP

CPU Memory

RequestMediaHBA Persistent Media

9

2, 3

4 5

1. Request arrives, 2. CPU accepts interrupt, checks CPU memory for cached copy3. If found, skip to 8. If not, continue 4. CPU forwards request to Media HBA5. HBA makes request from persistent media6. Media locates data and responds7. HBA forwards data to CPU8. CPU forwards data to Network HBA9. Return data to host

67

8

8Response

1

16

Page 17: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

NOISY NEIGHBOURS• In DSSD D5, data is self-service

• Hosts have full access to 18,432 flash chips, a much less scarce resource

• Data is spread thinly across those chips, minimizing contention

• All data transfers, read and write are direct DMA between host and flash

• The D5 has so much performance capacity, compared to other platforms, that the likelihood of a single errant system affecting others is greatly reduced

CPU

RequestFlash Media

2

1. Request arrives (as DMA write of requested LBA)2. CPU writes DMA directly to appropriate Flash Module3. Flash Module returns data via DMA write to host

3Response

1

Much less prone to Noisy Neighbor Syndrome!

17

Page 18: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2015 EMC Corporation. All rights reserved. MENU

Performance Oriented ArchitectureI/O Module PCIe ports

I/O Module PCIe ports

Control Module CPUsFlash Modules

18

Page 19: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

WHAT DOES ALL THIS GIVE US?• Marketing ‘hero’ numbers (real, but

using artificial tools):

• 100TB Usable

• 100GB/s bandwidth

• 100µs latency

• 10 million IOPs (4KB)

• 5U rack space

• Proven Oracle numbers

• 100TB Usable

• 60GB/s bandwidth into Oracle

• 140µs latency

• 5.3 million IOPs (8KB, SLOB)

• 5U rack space19

Page 20: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

AND THERE’S MORE…

• Up to two D5s are currently supported on a single system

• Proven Oracle numbers

• 200TB Usable

• 120GB/s bandwidth into Oracle

• 140µs latency

• 10.6 million IOPs (8KB, SLOB)

• 10U rack space

20

Page 21: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

NEW RULES

• D5 has NO cache - Everything is fast

• You just have a full 100TB usable ‘working set’

21

Page 22: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

TRADITIONAL STORAGE

Persistent Storage

CacheFAST

SLOW

Data M

otion

22

Page 23: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

Persistent Storage

D5 STORAGE

FAST

Entire Dataset

23

Page 24: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

WHAT DOES IT LOOK LIKE TO A DBA?• Familiar block-driver interface:

• i.e.: /dev/dssdXXXX devices

• Fully shared disk

• Multipathing is fully automatic and invisible

• No child devices exposed, no tunables

• Udev rules recommended to create friendly names

• Reference documentation is the “Oracle Databases on DSSD D5 – Best Known Methods” paper

24

Page 25: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

WHAT DOES IT LOOK LIKE TO A DBA?# ls -l /dev/asmdisks

total 0

lrwxrwxrwx 1 root root 11 Feb 11 20:19 OraOCR000441_00 -> ../dssd0030

lrwxrwxrwx 1 root root 11 Feb 11 20:19 OraOCR000441_01 -> ../dssd0031

lrwxrwxrwx 1 root root 11 Feb 11 20:19 OraOCR000444_00 -> ../dssd0028

lrwxrwxrwx 1 root root 11 Feb 11 20:19 OraOCR000444_01 -> ../dssd0029

lrwxrwxrwx 1 root root 11 Feb 11 20:19 OraRedo000441_00 -> ../dssd0000

lrwxrwxrwx 1 root root 11 Feb 11 20:19 OraRedo000441_01 -> ../dssd0001

lrwxrwxrwx 1 root root 11 Feb 11 20:19 OraRedo000444_00 -> ../dssd0026

lrwxrwxrwx 1 root root 11 Feb 11 20:19 OraRedo000444_01 -> ../dssd0027

lrwxrwxrwx 1 root root 11 Feb 11 20:19 OraVol000441_00 -> ../dssd0032

lrwxrwxrwx 1 root root 11 Feb 11 20:19 OraVol000441_01 -> ../dssd0033

lrwxrwxrwx 1 root root 11 Feb 11 20:19 OraVol000441_02 -> ../dssd003425

Page 26: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

WHAT DOES IT LOOK LIKE TO A DBA?SQL> l

1* select group_number,path,name,failgroup,mount_status from v$asm_disk order by 1,4,3

SQL> /

GROUP_NUMBER PATH NAME FAILGROUP MOUNT_S

------------ ---------------------------------------- -------------------- -------------------- -------

0 /dev/asmdisks/OraFRA000441_03 CLOSED

0 /dev/asmdisks/OraVol000441_11 CLOSED

0 /dev/asmdisks/OraVol000444_06 CLOSED

0 /dev/asmdisks/OraVol000444_03 CLOSED

0 /dev/asmdisks/OraRedo000444_00 CLOSED

0 /dev/asmdisks/OraVol000444_09 CLOSED

0 /dev/asmdisks/OraVol000444_01 CLOSED

… etc

26

Page 27: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

ELIMINATION OF COMPLEXITY• No dm-multipath or Powerpath

• Purpose built, high performance multipathing integral in client drivers

• Only a single device name is exposed, all detail is handled by the driver

• No manipulation of I/O elevators

• NOOP is forced

• Everything is 4KB anyway (blkdev)

• DMA access and separate submission and completion queues

• No queue tuning - DMA enqueues so fast that it is largely unnecessary - but we make an exception for redo

27

Page 28: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

WHICH DIMENSION MATTERS?

• Bandwidth?

• Latency?

• IOPs?

• Nobody actually needs 5.3M IOPs, but they are a side effect of the bandwidth and low latency - which people DO need!

28

Page 29: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

ANALYSIS OF DB TIME

• Low latency storage dramatically alters the split of time for a process

• Using SLOB:

• Traditional storage: ~200µs CPU, 6000µs single block read. 30:1 ratio

• D5: ~200µs CPU, ~200µs (at high load) single block read. 1:1 ratio

29

Page 30: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

LATENCY: SYNCHRONOUS I/O• Oracle workloads are most frequently dependent on synchronous I/O

• Index traversal and Nested Loop joins (serial I/O pathology)

• Log writer (redo bandwidth is proportional to write latency)

• Latency is now so low that the returns are diminishing after this:

• Reducing disk latency from 6ms->3ms was almost 2x speedup

• But now the compute time is similar to the I/O time - halving I/O latency is 25% speedup

• OMG - if we eliminate I/O altogether, we can only go 2x faster. Where did orders of magnitude go?!

30

Page 31: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

BANDWIDTH: BIG QUERIES

• It is rare that ‘adhoc query’ exists in reality:

• Sure, submit the query

• But it might not come back until next Tuesday

• Oh, and everyone else will suffer while it runs

31

Page 32: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

THE REALITY: THE DBA’S PLIGHT!

• Physical schema mitigations are adopted to minimize the data scan volume:

• Materialized Views

• Secondary Indexes

• Fine grain subpartitioning

• Even Smart Scan - a non-deterministic workaround

32

Page 33: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

AN EXPERIMENT

• Exam Question: How much do Materialized Views actually help with runtimes when you have next-generation I/O horsepower?

33

Page 34: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2016 EMC Corporation. All rights reserved.

DISSECTING THE QUERYTOP LEVEL

ALL_SALES (cur year)

ALL_SALES (prev year)

WHERE year=2000 WHERE year=1999

Filte

r

Filte

r

Join

cur_year_sales_cnt, prev_year_sales_cnt, sales_count_diff, sales_amount_diff

34

Page 35: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2016 EMC Corporation. All rights reserved.

MAIN BLOCKDISSECTING THE QUERY

Sales (ie WEB_SALES,

CATALOG_SALES or STORE_SALES)

Returns

7-14 billion rows

2-6 billion rows

date_dim item

WHERE category=‘Shoes’

Filte

r

Join

year, brand_id, class_id , category_id, manufact_id, cs_quantity-COALESCE(cr_return_quantity,0) AS sales_cnt, cs_ext_sales_price - COALESCE(cr_return_amount,0.0) AS sales_amt

‘NET SALES’ Query Block

35

Page 36: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2016 EMC Corporation. All rights reserved.

UNIONDISSECTING THE QUERY

Net Sales (Store)

Net Sales (Catalog)

Net Sales (Web)Union Union

ALL_SALES Inline View

36

Page 37: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2016 EMC Corporation. All rights reserved.

• Materialize the main query block of the three sales channels

THE TEST

WEB STORE CATALOG

WEB STORE CATALOG

Mat

eria

lize

Mat

eria

lize

Mat

eria

lize

37

Page 38: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2016 EMC Corporation. All rights reserved.

RESULTS

0

1.3

2.5

3.8

5

Data Volume Scanned (TB) Query Runtime (min)

3.4

1.3

4.54.6

Full Query MV Optimized Query

72

% less data

But only 24% runtime reduction

38

Page 39: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2016 EMC Corporation. All rights reserved.

• DSSD D5 makes the I/O portion of the query much less significant in the total runtime

• Remaining work, such as CPU compute, serialization, and inter-node communication remain constant

WHY ONLY A SMALL SPEEDUP?

39

Page 40: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2015 EMC Corporation. All rights reserved. MENU

D5 Versus a Typical All-flash Array

Complex Query Runtime (Shorter is Better)

Runt

ime

(M

inut

es)

0

7.5

15

22.5

30

DSSD D5 All Flash ArrayFull Query MV Optimized Query

24.4%

64.2%

40

Page 41: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2015 EMC Corporation. All rights reserved. MENU

D5 Full Query vs. AFA Materialized View

Complex Query Runtime (Shorter is Better)Ru

ntim

e (M

inute

s)

0

3

6

8

11

All Flash Array MV Optimized DSSD D5 Full Query

55.5%

41

Page 42: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2016 EMC Corporation. All rights reserved.

• Full query running on DSSD D5 (with gas left in the tank):

BANDWIDTH MATTERS

42

Page 43: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

SO WHAT?

• “Extreme Performance” is not just for “Extreme Workloads”

• As a DBA, you have only been able to deliver that which the hardware allows

• “Extreme Performance” is an enabler to business transformation

Page 44: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

SOFTWARE: ALGORITHMS

• Until now, a cache miss meant certain death…

• At least 50x slower, including code path

• Net result: algorithms carefully maximize cache hit, and optimizer aggressively favors cached access paths

44

Page 45: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

SOFTWARE: ALGORITHMS

• Next-Gen Storage:

• Cost of cache miss is much, much less

• But algorithms remain largely the same

• Algorithms could be significantly more speculative in approach

45

Page 46: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

SQL OPTIMIZER

• Should push more I/O out as large physical I/O requests

• Large index joins will become less relevant - synchronous/serial pathology and inefficient join algorithm at scale

• Large PIO is async and parallel, and hash joins are highly effective (if you can spill to disk at a decent rate)

46

Page 47: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

WHAT’S MISSING?• Things that will probably

never come:

• Data Services

• Compression

• Dedupe

• Things that are coming:

• Data Services (probably)

• At-rest Encryption

• Snapshots

• Replication

• Full Non-disruptive Operations support (definitely, and soon)

47

Page 48: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

ARCHITECTURES

• Tiering with D5

• Preferred Read Failure Group

48

Page 49: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016 49

Page 50: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016 50

Page 51: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

HADOOP/HDFS SUPPORT

• There is also an HDFS Datanode Plugin

51

Page 52: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2016 EMC Corporation. All rights reserved.

FLASH OPTIMIZED HDFS

TRADITIONAL HADOOP (3 COPIES OF DATA)SERVER 1 SERVER 2 SERVER 3

• HDFS uses a replication factor of at least 3 for availability

• Results in 3x+ data on persistent media

• Not economical for flash

• Stores just 1 copy of data regardless of replication factor

• Use entire flash capacity for data • Increase data locality without

using more capacitySERVER 1 SERVER 2 SERVER 3

HADOOP WITH DSSD D5 (1 COPY OF DATA)

52

Page 53: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© Copyright 2016 EMC Corporation. All rights reserved.

SIMPLIFIED ARCHITECTUREINDEPENDENT SCALING

Compute

Sto

rag

e Increased Cluster Nodes Required to Increase

Compute

Add D5 to Increase Storage

HDFS on DSSD • Scale compute independent

of storage

• Achieve optimal asymmetric high performance balance

• Add additional performance as hardware evolves

53

Page 54: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

HADOOP/HDFS SUPPORT• Elimination of Replication

• Storage savings make the D5 price competitive with local SSDs

• Local data access is possible for every attached host without storage multiplication

• Eliminates any Key Hashing hotspots

• Run all of this, Oracle, Hadoop, Filesystems, on the same storage platform54

Page 55: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

NEXT STEPS

• Moore's Law++ <—12mo doubling in storage density

• Controller CPU and memory is also subject to Moore's Law - balanced growth

• Optane/3DXpoint - another order of magnitude

55

Page 56: When The Rules Change - NYOUGnyoug.org/.../uploads/2016/06/When-The-Rules-Change.pdf• No dm-multipath or Powerpath • Purpose built, high performance multipathing integral in client

© James Morle, June 2016

THANK YOU!

• Any Questions?

[email protected]

• @jamesmorle

56