grid checkpointing · xtreemos ip project is funded by the european commission under contract...

47
XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf, Germany XtreemOS Summer School, Günzburg, Germany, 2010 XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576

Upload: others

Post on 03-Aug-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576

1

Grid CheckpointingJohn Mehnert-Spahn

Heinrich-Heine University Duesseldorf, Germany

XtreemOS Summer School, Günzburg, Germany, 2010

XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576

Page 2: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Checkpointing

XtreemGCP

Communication channel checkpointing with heterogeneous checkpointers

( Adaptive Checkpointing – incremental grid cp )

Overview

2

Page 3: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Grid Jobs

Paris London Duesseldorf Barcelona

Job A running in a VOJob unit A1 Job unit A2 Job unit A3 Job unit A4

3

Page 4: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Faults

Paris London Duesseldorf Barcelona

Job A running in a VOJob unit A1 Job unit A2 Job unit A3 Job unit A4

4

Fault tolerance needed

Page 5: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Fault tolerance

Replication

Forward error recovery

Backward error recovery

5

Page 6: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576

6

Checkpointing & Restart

Checkpointing: The application state is saved periodically to stable storage.

Restart: The application gets reestablished from a recent checkpoint. Thus, no fall back to the initial state will occur.

Page 7: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576

7

Checkpointing & Restart

Checkpointing: Saving periodically the state of the application in stable storage

Restart: In case of a fault we can restart from a checkpoint and do not fall back to the initial state

Challenges: Trade-off between costs during fault-free execution and costs at recovery

Size of the distributed state may be very large

Checkpointing images must be replicated

Heterogeneity of checkpointer packages

Page 8: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Many Checkpointers exist

CoCheck

Condor

DCR

DMTCP & MTCPBLCR

LAM/MPI&BLCRzap

CLIP

libckpt

Dynamite

LinuxSSI

Linux-native

OpenVZ

tmPVM

VMWare player

Ckpt

CHPOX

CRAK

UCLiK

Epckpt

MCR

SCore

TICK

VMADump

8

KMU

CP/R

Page 9: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Workflow: Coordinated CP

9

XtreemGCPcheckpointing service

Page 10: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576

10

XtreemGCP

A grid service integrated within AEM implementingjob migration and job fault tolerance for grid jobs

Integrates existing checkpointer packages

Supports transparent and application-level checkpointing

Security

Page 11: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Grid-Checkpointing Architecture

11

Page 12: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

12

Grid-Checkpointing Architecture

Page 13: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

13

Grid-Checkpointing Architecture

Page 14: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

14

Grid-Checkpointing Architecture

Page 15: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

15

Grid-Checkpointing Architecture

Page 16: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

16

Grid-Checkpointing Architecture

Page 17: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Grid-Checkpointing Architecture

17

Page 18: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Uniform Checkpointer Interface

Uniform access to different checkpointer packages implemented by a translib (shared library)

Translations• function signatures• job-to-Linux process group• grid user id-to-local user id• callback management• checkpoint image dependencies• checkpointer-to-checkpointer• application-checkpointer-compatibility

18

Page 19: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

To which extent must existing checkpointers be adapted to support various checkpointing protocols?

We need the following sequences Stop Checkpoint Resume_cp

Rebuild Resume_rst

Uniform Checkpointer Interface

19

Checkpoint

Restart

Page 20: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Currently, supported checkpointer packages

BLCR

OpenVZ

MTCP

LinuxSSI

(Linux native)

Uniform Checkpointer Interface

20

Page 21: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Checkpoint files

Must be replicated

And accessible from each grid node

Stored in XtreemFS, providing: Stripping Automatic replication Location-transparent access Access control via XtreemOS user accounts

21

Page 22: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Coordinated Checkpointing Workflow

22

Translation Library

LinuxSSI Checkp.

Job-unit Checkpointer

Translation Library

BLCR

Job-unit Checkpointer

Job Checkpointer

LinuxSSI cluster Linux

Page 23: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Coordinated Checkpointing Workflow

23

Translation Library

LinuxSSI Checkp.

Job-unit Checkpointer

Translation Library

BLCR

Job-unit Checkpointer

Job Checkpointer

LinuxSSI cluster Linux

Page 24: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Coordinated Checkpointing Workflow

24

Translation Library

LinuxSSI Checkp.

Job-unit Checkpointer

Translation Library

BLCR

Job-unit Checkpointer

Job Checkpointer

LinuxSSI cluster Linux

Page 25: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Coordinated Checkpointing Workflow

25

Translation Library

LinuxSSI Checkp.

Job-unit Checkpointer

Translation Library

BLCR

Job-unit Checkpointer

Job Checkpointer

LinuxSSI cluster Linux

job meta­datajob­unit meta­datacheckpointer images

sync/split/replicate

Page 26: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Independent Checkpointing Workflow

26

Translation Library

LinuxSSI Checkp.

Job-unit Checkpointer

Translation Library

BLCR

Job-unit Checkpointer

Job Checkpointer

LinuxSSI cluster Linux

Page 27: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Independent Checkpointing Workflow

27

Translation Library

LinuxSSI Checkp.

Job-unit Checkpointer

Translation Library

BLCR

Job-unit Checkpointer

Job Checkpointer

LinuxSSI cluster Linux

Page 28: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Independent Checkpointing Workflow

28

Translation Library

LinuxSSI Checkp.

Job-unit Checkpointer

Translation Library

BLCR

Job-unit Checkpointer

Job Checkpointer

LinuxSSI cluster Linux

job meta­datajob­unit meta­datacheckpointer images

sync/split/replicate

Page 29: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Translation Library

LinuxSSI Checkp.

Job-unit Checkpointer

Translation Library

BLCR

Job-unit Checkpointer

Job Checkpointer

LinuxSSI cluster Linux

Independent Restart Workflow (during application runtime)

receive determinants (create dependency graph)

wrappers for send, recv, etc. (LD_PRELOAD)

29

Page 30: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Job Checkpointer

Independent Restart Workflow

calculate recovery line from received determinants

30

Page 31: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Translation Library

LinuxSSI Checkp.

Job-unit Checkpointer

Translation Library

BLCR

Job Checkpointer

LinuxSSI cluster Linux

Independent Restart Workflow

restart from CP1

rollback to CP2

Job-unit Checkpointer

31

Page 32: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Measurements

Checkpoint Restart

32

Page 33: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Callback Management

Implemented in generic part of translib

Called before and after a checkpoint and after restart

Common API for application callback registration

Useful for: Application-level checkpointing Application-level enhancements/optimizations System-level checkpointing of communication channels

33

Page 34: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Workflow: Coordinated CP

34

Channel checkpointing with heterogeneous checkpointers

Page 35: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Consistent Checkpoints- in-transit messages -

orphan message

lost message:

35

Page 36: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Soluition save in-transit messages

Marker-based approach

Challenges in the grid context

Node A Node B

Marker

Page 37: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Marker-based approach

Challenges • incompatible checkpointers must cooperate• migration support• transparency (application, checkpointer, operating system)

Node ACheckpointer X

Node B

„This is my marker.“

Challenges in the grid context

Page 38: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Marker-based approach

Challenges • incompatible checkpointers must cooperate• migration support• transparency (application, checkpointer, operating system)

Node ACheckpointer X

Node BCheckpointer Y

„This is my marker.“ „What's that? A normal paket with no specific meaning.“

Challenges in the grid context

Page 39: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Architecture

Page 40: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Gridkanalsicherung- Messungen -

Nachrichtenlänge und Sendefrequenz ohne Auswirkungen

40

Page 41: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Workflow: Coordinated CP

41

Adaptive checkpointing

Page 42: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Incremental Checkpointing• write-bit• reflect dynamical memory layout changes• mprotect und jsdl

Adaptive Checkpointing- Incremental Checkpointing -

Page 43: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Adaptive Checkpointing- Incremental Checkpointing -

Page 44: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Adaptive Checkpointing- Incremental Checkpointing -

Common Checkpoint Incremental Checkpoint

Page 45: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Adaptive Checkpointing- Incremental Checkpointing -

Common Restart Incremental Restart

Page 46: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Summary

XtreemGCP offers migration and fault tolerance in grids by

providing checkpointing and restart

It is designed for heterogeneous setups integrating

existing checkpointer packages

Future work:

virtual machine support & adaptive checkpointing

46

Page 47: Grid Checkpointing · XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 1 Grid Checkpointing John Mehnert-Spahn Heinrich-Heine University Duesseldorf,

Acknowledgment

EC for funding XtreemOS

XtreemOS- GCP contributors: Heinrich-Heine Universität Düsseldorf

John Mehnert-Spahn, Eugen Feller INRIA, Rennes, France

Christine Morin, Thomas Ropars, Surbi Chitre, Stefania Costache

47