understanding write behaviors of storage backends in ceph ...understanding write behaviors of...

54
Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo Kim, Joo-Young Hwang and Sangyeun Cho

Upload: others

Post on 26-Aug-2021

10 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Understanding Write Behaviors of Storage Backendsin Ceph Object Store

Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo Kim,

Joo-Young Hwang† and Sangyeun Cho†

Page 6: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

How Ceph Amplifies Writes

6

client

DataCeph

metadataCeph

journalDataCeph

metadataCeph

journalDataCeph

metadataCeph

journal

File system metadata

File system journal

File system metadata

File system journal

File system metadata

File system journal

Single write can make several hidden I/Os What about in long-term situation?

Page 7: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

How Much Does Ceph Amplifies Writes

7

0 2 4 6 8 10 12 14 16WAF

DataCeph

metadataCeph

journalFile system metadata

File system journal

Page 8: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

How Much Does Ceph Amplifies Writes

8

0 2 4 6 8 10 12 14 16WAF

DataCeph

metadataCeph

journalFile system metadata

File system journal

Page 9: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

How Much Does Ceph Amplifies Writes

9

0 2 4 6 8 10 12 14 16WAF

DataCeph

metadataCeph

journalFile system metadata

File system journal

Page 10: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

How Much Does Ceph Amplifies Writes

10

0 2 4 6 8 10 12 14 16WAF

DataCeph

metadataCeph

journalFile system metadata

File system journal

Page 11: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

How Much Does Ceph Amplifies Writes

11

0 2 4 6 8 10 12 14 16WAF

DataCeph

metadataCeph

journalFile system metadata

File system journal

Page 12: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

How Much Does Ceph Amplifies Writes

12

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

WAF

DataCeph

metadataCeph

journalFile system metadata

File system journal

Page 13: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

How Much Does Ceph Amplifies Writes

13

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

DataCeph

metadataCeph

journalFile system metadata

File system journal

3 0.702 5.994 0.419 3.635Writes amplified by over 13x

Page 14: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Our Motivation & Goal

How/Why are writes highly amplified in Ceph?• With the exact numbers

Why do we focus on write amplification?• WAF(Write amplification Factor) affects the overall performance

• When using SSDs, it hurts the lifespan of SSDs

• Larger WAF → smaller effective bandwidth

• Redundant journaling of journal may exist

Goals of this paper• Understanding of write behaviors of Ceph

• Empirical study of write amplification in Ceph

14

Page 15: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Outline

Introduction

Background

Evaluation environment

Result & Analysis

Conclusion

15

Page 17: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Architecture of Ceph RADOS Block Device (RBD)

17

librbd, librados

/dev/rbd

publicnetwork

MONOSD

#0OSD

#1

OSD#2

OSD#3

• • •

I/O Stack

CEPH RADOS Block Device

D

Page 18: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Architecture of Ceph RADOS Block Device (RBD)

18

librbd, librados

/dev/rbd

MONOSD

#0OSD

#1

OSD#2

OSD#3

• • •

I/O Stack

D

Object (4MB)

publicnetwork

Page 19: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Architecture of Ceph RADOS Block Device (RBD)

19

librbd, librados

/dev/rbd

MONOSD

#0OSD

#1

OSD#2

OSD#3

• • •

I/O Stack

D

Gets which OSD the object should be sent to

(CRUSH algorithm)

publicnetwork

Page 20: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Architecture of Ceph RADOS Block Device (RBD)

20*Figure Courtesy: Sébastien Han https://www.sebastien-han.fr

BlueStoreKStore

Storage

XFS

FileStore

*

OSD #2 OSD #10OSD #4OSD #1OSD #0

• • •• • • • • •

OSD Internal layersData

AttributesKey-Value Data

Replication 3x

D

Page 21: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Architecture of Ceph RADOS Block Device (RBD)

21

BlueStoreKStore

Storage

XFS

FileStore

*

OSD #2 OSD #10OSD #4OSD #1OSD #0

• • •• • • • • •

OSD Internal layersD Ceph Storage Backends (Object Store)

*Figure Courtesy: Sébastien Han https://www.sebastien-han.fr

Transaction

Page 22: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Ceph Storage Backends: (1) FileStore

Manages each object as a file

Write flow in FileStore1. Write-Ahead journaling

− For consistency and performance

2. Performs actual write to file after journaling− Can be absorbed by page cache

3. Calls syncfs + flush journal entries for every 5 seconds

NVMe SSD HDDs or SSDs

Ceph Journal XFS file system

ObjectsMetadataAttributes

Ceph journalCeph data Ceph metadata

FS metadata FS journal

Write-AheadJournaling

DB WALLevelDB

22

<Breakdown of FileStore>

Page 23: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Ceph Storage Backends: (2) KStore

Using existing key-value stores• Encapsulates everything to key-

value pair

Supports LevelDB, RocksDB and Kinetic Store

Write flow in KStore1. Simply calls key-value APIs with

the key-value pair

23

<Breakdown of KStore>

HDDs or SSDs

XFS file system

Objects, Metadata, Attributes

Ceph data Ceph metadataCompaction

FS metadataFS journal

DB WALLevelDB or RocksDB

Page 24: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Ceph Storage Backends: (3) BlueStore

To avoid limitations of FileStore• Double-write issue due to

journaling− Directly stores data to raw device

Write flow in BlueStore1. Puts data into raw block device

2. Sets metadata to RocksDB on BlueFS (user-level file system)

24

<Breakdown of BlueStore>

HDDs or SSDs NVMe SSD

Raw Device BlueFS

ObjectsMetadataAttributes

Ceph dataZero-filled data

RocksDB DB

DB WALRocksDB

NVMe SSD

BlueFS

RocksDB WAL

Page 25: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Outline

Introduction

Background

Evaluation environment

Result & Analysis

Conclusion

25

Page 26: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

H/W Evaluation Environment

26

OSD Servers (x4)

Model DELL R730

Processor Intel® Xeon® CPU E5-2640 v3

Memory 32 GB

Storage HGST UCTSSC600 600 GB x4Samsung PM1633 960 GB x4Intel® 750 series 400 GB x2

Admin Server / Client (x1)

Model DELL R730XD

Processor Intel® Xeon® CPU E5-2640 v3

Memory 128 GB

Switch (x2)

Public Network

DELL N4032 10Gbps Ethernet

StorageNetwork

Mellanox SX6012 40Gbps InfiniBand P

ub

lic Netw

ork

Sto

rage

Net

wo

rk

Page 27: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

S/W Evaluation Environment

Linux 4.4.43 kernel • Some modifications to collect and classify write requests

Ceph Jewel LTS version (v10.2.5)

From client side• Configures a 64GiB KRBD

− 4 OSDs per OSD server (total 16 OSDs)

• Generates workloads using fio while collecting trace and diskstats

Perform 2 different workloads• Microbenchmark

• Long-term workload

27

Page 28: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Outline

Introduction

Background

Evaluation environment

Result & Analysis

Conclusion

28

Page 29: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Key Findings in Microbenchmark

FileStore• Large WAF when write request size is small (over 40 at 4KB write)

• WAF converges to 6 (3x by replication, 2x by Ceph journaling)

KStore• Sudden WAF jumps due to memtable flush

• WAF converges to 3 (by replication)

BlueStore• Because the minimum extent size is set to 64KB

− Zero-filled data for the hole within the object

− WAL (Write-Ahead Logging) for small overwrite to guarantee durability

• WAF converges to 3 (by replication)

29

Page 30: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Long-Term Workload Methodology

We focus on 4KB random writes• In VDI, most of the writes are random and 4KB in size

Workload scenario1. Install Ceph and create a krbd partition

2. Drop page cache, call sync and wait for 600 secs

3. Issue 4KB random writes with QD=128 until the total write amount reaches 90% of the capacity (57.6GiB)

Run tests with 16 HDDs first and repeat with 16 SSDs

Calculate and breakdown WAF for given time period30

Page 31: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Long-term Results: (1) FileStore

31

0

1000

2000

3000

4000

5000

0

1

2

3

4

0 2025 4051 6077 8132

IOP

S (o

ps/

sec)

Wri

te A

mo

un

t (G

B/1

5se

c)

Time (sec)

0

15000

30000

45000

0

2

4

6

8

0 93 186 280 379

IOP

S (o

ps/

sec)

Wri

te A

mo

un

t (G

B/3

sec)

Time (sec)

Ceph data + Ceph metadata Ceph journal File system metadata File system journal IOPS

<HDD>

<SSD>

Large fluctuation due to repeated throttling

No throttling: SAS SSDs with XFS are enough fast

Performance throttling due to slow HDDs

Ceph journal first absorbs random writes

Page 32: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

0

1000

2000

3000

4000

5000

0123456789

0 4711 9422 14134 18905

IOP

S (o

ps/

sec)

Wri

te A

mo

un

t (G

B/1

5se

c)

Time (sec)

0

15000

30000

45000

0

2

4

6

8

10

12

0 424 848 1272 1700

IOP

S (o

ps/

sec)

Wri

te A

mo

un

t (G

B/3

sec)

Time (sec)

Ceph data Ceph metadata Compaction File system metadata File system journal IOPS

Long-term Results: (2) KStore (RocksDB)

32

<HDD>

<SSD>

Huge Compaction Traffic in both cases

Huge compaction traffic in both casesFluctuation gets worse

Page 33: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

<HDD>

<SSD>

0

15000

30000

45000

0

5

10

15

20

0 105 210 316 421

IOP

S (o

ps/

sec)

Wri

te A

mo

un

t (G

B/3

sec)

Time (sec)

Ceph data RocksDB RocksDB WAL Compaction Zero-filled data IOPS

0

1000

2000

3000

4000

5000

0

1

2

3

4

5

6

7

0 2251 4501 6752 9032

IOP

S (o

ps/

sec)

Wri

te A

mo

un

t (G

B/1

5se

c)

Time (sec)

Long-term Results: (3) BlueStore

33

Since allocating extents is in sequential order,first incoming writes become sequential writes

Zero-filled data traffic is huge at fistZero-filled data traffic is huge at fist

SSDs have superior random write performance

Page 34: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Overall Results of WAF

34

0 5 10 15 20 25 30 35 40 45

SSD

HDD

SSD

HDD

SSD

HDD

Blu

eSt

ore

KSt

ore

(Ro

cksD

B)

File

Sto

re

Ceph data Ceph metadata Ceph journal Compaction

Zero-filled data File system metadata File system journal

60 70 80 WAF

Page 35: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Overall Results of WAF

35

0 5 10 15 20 25 30 35 40 45

SSD

HDD

SSD

HDD

SSD

HDD

Blu

eSt

ore

KSt

ore

(Ro

cksD

B)

File

Sto

re

Ceph data Ceph metadata Ceph journal Compaction

Zero-filled data File system metadata File system journal

60 70 80 WAF

WAF for Ceph journal is about 6 in both cases, not 3→ Ceph journal triples the write traffic

FS metadata + FS journal = 4.2 (HDD), 4.054 (SSD)

FS metadata + FS journal = 4.2 (HDD), 4.054 (SSD)

Page 36: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Overall Results of WAF

36

0 5 10 15 20 25 30 35 40 45

SSD

HDD

SSD

HDD

SSD

HDD

Blu

eSt

ore

KSt

ore

(Ro

cksD

B)

File

Sto

re

Ceph data Ceph metadata Ceph journal Compaction

Zero-filled data File system metadata File system journal

60 70 80 WAF

Huge compaction traffic matters

Page 37: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Overall Results of WAF

37

0 5 10 15 20 25 30 35 40 45

SSD

HDD

SSD

HDD

SSD

HDD

Blu

eSt

ore

KSt

ore

(Ro

cksD

B)

File

Sto

re

Ceph data Ceph metadata Ceph journal Compaction

Zero-filled data File system metadata File system journal

60 70 80 WAF

WAF is still larger than FileStore’seven without zero-filled data

Compaction traffic for RocksDB is not ignorable

Page 38: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Conclusion

Writes are amplified by more than 13x in Ceph• No matter which storage backend is used

FileStore• External Ceph journaling triples write traffic

• File system overhead exceeds the original data traffic

KStore• Suffers huge compaction overhead

BlueStore• Small write requests are logged on RocksDB WAL

• Unignorable zero-filled data & compaction traffic

38

Thank you!

Page 39: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Backup Slides

39

Page 40: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Microbenchmark Methodology

Purpose• To see basic behaviors of each storage backend

Methodology• Calculate WAF while doubling write size (4KiB, 8KiB, 16KiB, …)

• Reinstall Ceph for each write size

40

1ST WRITE 2ND WRITE

OVERWRITE

0 4MiB2MiB

3RD WRITE

krbd

Object (Default: 4MiB)

Page 41: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Microbenchmark Result

41

0

10

20

30

40

50

60

70

80

4K 8K 16K 32K 64K 128K 256K 512K 1M 2M 4M

WA

F

Request Size

1st write 2nd write 3rd write Overwrite

0

10

20

30

40

50

60

4K 8K 16K 32K 64K 128K 256K 512K 1M 2M 4M

WA

F

Request Size

1st write 2nd write 3rd write Overwrite

Overwrite smaller than 64KB- WAL in the case of overwrite

smaller than the minimum extent size for consistency

<FileStore> <BlueStore>

Large WAF at 4KB- Range 41 to 67- Due to large hidden I/Os

3rd write smaller than 32KB- Due to the minimum extent

allocation size is 64KB- Zero-filling the write hole

WAF converges to 3- 3x by replication

WAF converges to 6- 3x by replication - 2x by external

journaling

Page 42: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Microbenchmark Result (cont’d)

42

0

5

10

15

20

25

30

35

40

45

4K 8K 16K 32K 64K 128K 256K 512K 1M 2M 4M

WA

F

Request SIze

1st write 2nd write 3rd write Overwrite

0

2

4

6

8

10

12

14

16

18

4K 8K 16K 32K 64K 128K 256K 512K 1M 2M 4M

WA

F

Request Size

1st write 2nd write 3rd write Overwrite

<KStore (LevelDB)> <KStore (LevelDB)>

WAF converges to 3- 3x by replication

Sudden jumps- Due to memtable flush

Page 43: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Why Ceph?

It’s cloud computing era• We need a virtual block device to support a VM

43

Block Device

Ummm, it should be stable, reliable plus high-performance

Page 44: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Why Ceph?

It’s cloud computing era• We need lots of virtual block devices to support VMs

44

Stable, reliable, high-performance, consistent, scalable, fault-tolerance

Page 45: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Why Ceph?

It’s cloud computing era• We need lots of virtual block devices to support VMs

45

Stable, reliable, high-performance, consistent, scalable, fault-tolerance

Page 47: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

Overall Results of WAF

47

Page 48: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

How Much Does Ceph Amplifies Writes

48

0 2 4 6 8 10 12 14 16

DataCeph

metadataCeph

journal

File system metadata

File system journal

WAF

Page 49: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

How Much Does Ceph Amplifies Writes

49

0 2 4 6 8 10 12 14 16

DataCeph

metadataCeph

journal

File system metadata

File system journal

WAF

Data : 3

Page 50: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

How Much Does Ceph Amplifies Writes

50

0 2 4 6 8 10 12 14 16

DataCeph

metadataCeph

journal

File system metadata

File system journal

WAF

Data : 3 Ceph metadata : 0.702

Page 51: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

How Much Does Ceph Amplifies Writes

51

0 2 4 6 8 10 12 14 16

DataCeph

metadataCeph

journal

File system metadata

File system journal

WAF

Data : 3 Ceph metadata : 0.702 Ceph journal : 5.994

Page 52: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

How Much Does Ceph Amplifies Writes

52

0 2 4 6 8 10 12 14 16

DataCeph

metadataCeph

journal

File system metadata

File system journal

WAF

Data : 3 Ceph metadata : 0.702 Ceph journal : 5.994 File system metadata : 0.419

Page 53: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

How Much Does Ceph Amplifies Writes

53

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

DataCeph

metadataCeph

journal

File system metadata

File system journal

WAF

Data : 3 Ceph metadata : 0.702 Ceph journal : 5.994 File system metadata : 0.419 File system journal : 3.635

Page 54: Understanding Write Behaviors of Storage Backends in Ceph ...Understanding Write Behaviors of Storage Backends in Ceph Object Store Dong-Yun Lee, Kisik Jeong, Sang-Hoon Han, Jin-Soo

How Much Does Ceph Amplifies Writes

54

0 2 4 6 8 10 12 14 16

File

Sto

re

Ceph data Ceph metadata Ceph journal File system metadata File system journal

DataCeph

metadataCeph

journal

File system metadata

File system journal

WAF

Data : 3 Ceph metadata : 0.702 Ceph journal : 5.994 File system metadata : 0.419 File system journal : 3.635

Writes amplified by over 13x