4 oct 04storage resource manager, timur perelmutov, don petravick, fermilab 1 storage resource...

28
4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi National Accelerator Lab

Upload: dayna-mccoy

Post on 13-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1

Storage Resource Management at Fermilab

Timur PerelmutovDon Petravick

Fermi National Accelerator Lab

Page 2: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 2

Fermilab Storage

● We have covered:– Enstore – permanent tape storage

● Scales to several petabytes● Flexible media selection● Encp access protocol

– Dcache – distributed disk storage ● Scales to hundreds of terabytes● Commodity hardware● Pluggable transfer protocols

– PNFS – namespace and file metadata server for Enstore and Dcache

● Next: SRM – Storage Resource Manager

Page 3: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 3

SRM Collaboration

Jefferson LabBryan HessAndy KowalskiChip Watson

Fermilab

Don PetravickTimur Perelmutov

LBNL

Arie ShoshaniAlex SimJunmin Gu

EU DataGrid WP2Peter KunsztHeinz StockingerKurt StockingerErwin Laure

EU DataGrid  WP5

Jean-Philippe BaudStefano OcchettiJens JensenEmil KnezoOwen Synge

Page 4: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 4

SRM Motivation (1)

Grid Architecture promises Reservation and Scheduling of the Following Shared Resources

– Computing Resources– Network Resources– Storage Resources (often neglected)

SRM provides Reservation and Scheduling of the Storage Resources

Page 5: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 5

SRM Motivation (2)

High Energy Physics Collaborations span multiple institutions where

– A large variety of types of Storage Systems exist● Single Disk / Raid● Robotic Tape Storage System (Enstore, HPSS)● Distributed Disk Cache (dCache)● Hierarchical Storage System (dCache - Enstore)

– Heterogeneous environments and proliferation of custom Mass Storage Systems (MSSs) exist

– User applications often need to access data at multiple institutions on multiple MSSs

SRM provides Standardized Uniform Access to Heterogeneous Storage

Page 6: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 6

Access to Multiple MSS

HPSS Client

Enstore Client

Jasmineclient

Enstore JASMine

ClientUSER/APPLICATIONS

Dcacheclient

DCache

CastorClient

CASTOR

Page 7: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 7

Uniform Access via SRM

SRM SRM SRM

Dcache/Enstore

JASMine

ClientUSER/APPLICATIONS

Grid Middleware

SRM

DCache

SRM

CASTOR

Page 8: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 8

Storage Classification

Storage systems can be classified by:– Persistence of data

● Permanent ● Temporary

– Data access availability● Data immediately available● Data needs advanced reservation before utilization (tapes need to be

mounted, files need to migrate to disks, etc.)– Supported transfer protocols

● File transfer protocols● POSIX like access protocols

Need management interface that supports all of the above

Page 9: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 9

Storage Resource Managers

SRMs are middleware components that manage shared storage resources on the Grid and provide:

●Uniform access to heterogeneous storage●File Transfer Protocol negotiation●Dynamic Transfer URL allocation●Access to permanent and temporary types of storage●Advanced space and file reservation●Reliable transfer services

Page 10: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 10

Data life cycle on the grid overview

●Collection or generation – Data is collected from the CMS detector– Data generated by the analysis

●Storage– Data is stored on a shared storage system

●Registration– Data is registered in a Replica Catalog by

via a Replica Registration Service (RRS)●Replication

– Data is replicated to a storage geographically close to the consumer

●Replica registration– RRS

●Delivery and consumption

Globally Unique Identifier

Logical File Name 1

Logical File Name 2

Logical File Name N

Physical File Name 1

Physical File Name 2

Physical File Name M

Replica Catalogs Mappings

Page 11: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 11

Illustration US-CMS DATAGRID

Compact Muon solenoid (CMS) is an experiment at the lepton Hadron Collider (LHC) at CERN in Geneva, Switzerland.●US-CMS is building a data grid to facilitate physics data analysis at academic institutions across the United States.●Data grid 3 tier architecture.

–Tier 0 CERN, Geneva, Switzerland.–Tier 1 consists of 5 regional centers, FERMILAB in Batavia, IL is a North American Center.–Tier 2 consists of 25 centres, 5 of these are in North America.

●SRM copy is used as a management protocol and reliable replication service for movement of data from tier 0 to tier 1 centers and from tier 1 to tier 2.

Page 12: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 12

Data life cycle on the grid and a role of SRM on the GRID

SRM-Client

SRM

cache

SRM

dCache

6.GridFTP ERET (pull mode)

EnstoreCASTOR

Replica Catalog

Network transferof DATA

1.DATA Creation 2. SRM-PUT

Network transfer

3. Register(via RRS)

CERNTier 0

Replica Manager

FNALTier 1

archive filesstage files

4.SRM-COPYTier0 to Tier1

5.SRM-GET

archive files

SRM

Tier2Storage

Tier 2Center

Network transfer

9.GridFTP ESTO (push mode)

8.SRM-PUT

7.SRM-COPYTier1 toTier2

SRM-Client

Retrieve data for analysis10.SRM-GET

UsersSRM-Client

Network transferof DATA

Page 13: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 13

●Two SRM Interface specifications – SRM v1.1provides

• Data access/transfer • Implicit space reservation

– SRM v2.1 adds • Explicit space reservation • Namespace discovery and manipulation • Access permissions manipulation

●Fermilab SRM implements SRM v1.1 specification ●SRM v2.1 by the end of 2004

Storage Resource Manager versions

Page 14: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 14

SRM Protocols andGroups of Functions

SRM interface consists of the following groups of functions:

● Space Management Functions – v2.1● Data Transfer Functions – v1.1 and v2.1● Directory Functions – v2.1● Permission Functions – v2.1● Status Functions – v1.1 and v2.1SRM v2.1 in the rest of this talk

Page 15: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 15

SRMFile and Space Types

Volatile Durable Permanent

Volatile Durable Permanent SpaceTypes

FileTypes

BlahBlah BlahBlahVolatileFile Type PermanentDurable

BlahBlah BlahBlahNoArchived YesNo

BlahBlah BlahBlahYesLifetime NoYes

BlahBlah BlahBlahYesAutoDeleted NoNo

Property

Page 16: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 16

SRM Interface Details

Space Management Functions●SrmReserveSpace●SrmReleaseSpace●srmUpdateSpace●srmCompactSpace●srmGetSpaceMetaData●srmChangeFileStorageType●srmGetSpaceToken

Directory●SrmMkdir●srmRmdir●srmRm●srmLs●srmMv

Data transfer functions●srmPrepareToGet●SrmPrepareToPut●srmCopy●SrmRemoveFiles●srmReleaseFiles●srmPutDone●srmAbortRequest●srmAbortFiles●srmSuspendRequest●srmResumeRequest

Status Functions●srmStatusOfGetRequest●srmStatusOfPutRequest●srmStatusOfCopyRequest●srmGetRequestSummary●srmExtendFileLifeTime●SrmGetRequestID

Permission●srmSetPermission●srmReassignToUser●srmCheckPermission

Page 17: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 17

SRM Interface Details

Space Management Functions●SrmReserveSpace●SrmReleaseSpace●srmUpdateSpace●srmCompactSpace●srmGetSpaceMetaData●srmChangeFileStorageType●srmGetSpaceToken

Directory●SrmMkdir●srmRmdir●srmRm●srmLs●srmMv

Data transfer functions●srmPrepareToGet●SrmPrepareToPut●srmCopy●SrmRemoveFiles●srmReleaseFiles●srmPutDone●srmAbortRequest●srmAbortFiles●srmSuspendRequest●srmResumeRequest

Status Functions●srmStatusOfGetRequest●srmStatusOfPutRequest●srmStatusOfCopyRequest●srmGetRequestSummary●srmExtendFileLifeTime●SrmGetRequestID

Permission●srmSetPermission●srmReassignToUser●srmCheckPermission

Page 18: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 18

Srm Example 1 - srmPrepareToPut

User

SRMNode

dCache HeadNode

dCache PoolNode

findPoolNode reserveSpace

spaceToken

srmPrepareToPut(SpaceToken,size,gsiftp)

srmReserveSpace

findDoor(gsiftp)

Transferurl=gsiftp://doorNode:port/path doorNode,port

dCache DoorNode

Gridftp ESTOR

Where is user space

poolNodeDATA

PutDone

Page 19: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 19

Srm Example - Robust Replication

Anywhere

cachedCache

HRM-COPY(thousands of files)

SRM-GET (one file at a time)

SRM-Client

SRM(performs writes)

SRM(performs reads)

FNALGridFTP GET (pull mode)

archive files

Network transfer

Enstore CASTOR

stage files

CERN

Page 20: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 20

SRM Interface Details

Space Management Functions●SrmReserveSpace●SrmReleaseSpace●srmUpdateSpace●srmCompactSpace●srmGetSpaceMetaData●srmChangeFileStorageType●srmGetSpaceToken

Directory●SrmMkdir●srmRmdir●srmRm●srmLs●srmMv

Data transfer functions●srmPrepareToGet●SrmPrepareToPut●srmCopy●SrmRemoveFiles●srmReleaseFiles●srmPutDone●srmAbortRequest●srmAbortFiles●srmSuspendRequest●srmResumeRequest

Status Functions●srmStatusOfGetRequest●srmStatusOfPutRequest●srmStatusOfCopyRequest●srmGetRequestSummary●srmExtendFileLifeTime●SrmGetRequestID

Permission●srmSetPermission●srmReassignToUser●srmCheckPermission

Page 21: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 21

Page 22: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 22

Grid File Access Library and SRM

Physics Application

Replica ManagerClient

SRMClient Local

File I/O

rfio I/Oopen()read()

etc.

dCap I/Oopen()read()

etc.

Grid File Access Library (GFAL)

SRMService

dCapService

rfioService

RMServices

MSSService

LocalDisk

POSIX I/O

GridFTPService

Wide AreaAccess

Michael Ernst’s slide

Page 23: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 23

Client Access to CMS-DATA

Michael Ernst’s slide

Client Client

LogicalQuery

Request Interpreter

RequestExecuter

Logical Files

site-specificfile requests

site-specific files

SRM SRM

SRMrequest

planning

pinning & filetransfer requests

propertyindex

ReplicaCatalog

MDS

Site hosting the Application

Network

Tier1 Tier2

? (dCache, dFarm, DRM, NeST)

Page 24: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 24

Status of Fermilab SRM Implementation

● SRM Interface to dCache Storage System– Data Transfer Functions (get, put and copy) – Load balancing, throttling, fairness– Scalable replication mechanism via gridftp– Automatic directory creation– Fault tolerance and reliability achieved by providing persistent

storage for transfer requests and retries on failures● SRM interface as a standalone product, adaptable to

work on top of another storage system through a SRM-Storage interface

● A reference implementation of the SRM-Storage interface to a Unix File System

Page 25: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 25

Fermilab SRM Implementation Plans (1)

● Space Management – First implicit as a part of SRM Version 1.1– Than explicit, through the SRM V2.1

● Full implementation of SRM Version 2.1 interface● International Lattice QCD (ILDG)● Research utilization of Lambda Station Interface by a

Storage System. Lambda Station gives optical path allocation and per flow routing

● Monitoring, Administration and Accounting interfaces

Page 26: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 26

Fermilab SRM Implementation Plans (2)

● Integration of SRM to SAMGrid Data Handling System as the universal

interface to a storage

Page 27: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 27

Fermilab SRM Implementation Plans (3)

● Integration with Virtual Organization based Authorization Services,

development of Storage Specific fine grain Authorization Services

Page 28: 4 Oct 04Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 1 Storage Resource Management at Fermilab Timur Perelmutov Don Petravick Fermi

4 Oct 04 Storage Resource Manager, Timur Perelmutov, Don Petravick, Fermilab 28

Resources

●The Storage Resource Manager Collaboration, http://sdm.lbl.gov/srm-wg/●Fermilab SRM Project , http://www-isd.fnal.gov/srm●Patrick Fuhrmann, dCache, Grid Storage Element and enhanced use cases,http://indico.cern.ch/contributionDisplay.py?contribId=233&sessionId=10&confId=0●DCache, Disk Cache Mass Storage System, http://www.dcache.org/●US-CMS, http://www.uscms.org/●Don Petravick, Lambda Station Proposal, http://hppc.fnal.gov/wawg/omnibus-text.pdf●Arie Shoshany, Replica Registration Service,http://www.ppdg.net/mtgs/28jun04-wb/slides/PPDG-AH-0406-RMS-RRS.ppt●Robert Kennedy, SAMGrid Integration of SRMs, http://indico.cern.ch/contributionDisplay.py?contribId=460&sessionId=7&confId=0●Michael Ernst, US-CMS Grid File Access Proposal,http://www.uscms.org/sandc/reviews/doe-nsf/2003-07/docs/GFA-Proposal-Short-v1.0.pdf●Michael Ernst, Managed Data Storage and Data Access Services for Data Grids,http://indico.cern.ch/contributionDisplay.py?contribId=190&sessionId=7&confId=0●Philip DeMar, LambdaStation: A forwarding and admission control service to interface production network facilities with advanced research network paths http://indico.cern.ch/contributionDisplay.py?contribId=359&sessionId=11&confId=0