osg information services, vo monitoring services and resource selection services

28
Gabriele Garzoglio, Rob Quick, Chris Green July 25, 2007 1/21 OSG Information Services OSG Information Services, VO Monitoring Services and Resource Selection Services Gabriele Garzoglio, Chris Green, Computing Division, Fermilab Rob Quick, Indiana University OSG User Meeting & OSG Site Administrators Meeting July 2007 OSG Information Services Architecture The VO Resource Service (VORS) The OSG Resource Selection Service (ReSS) ClassAd Matchmaking How these affect the Sites How these affect the User

Upload: wendi

Post on 12-Jan-2016

34 views

Category:

Documents


0 download

DESCRIPTION

OSG Information Services, VO Monitoring Services and Resource Selection Services. OSG Information Services Architecture The VO Resource Service (VORS) The OSG Resource Selection Service (ReSS) ClassAd Matchmaking How these affect the Sites How these affect the User. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 1/21

OSG Information Services OSG Information Services, VO Monitoring Services and Resource Selection Services

Gabriele Garzoglio, Chris Green, Computing Division, Fermilab

Rob Quick, Indiana University

OSG User Meeting & OSG Site Administrators Meeting July 2007

• OSG Information Services Architecture• The VO Resource Service (VORS)• The OSG Resource Selection Service (ReSS)• ClassAd Matchmaking• How these affect the Sites• How these affect the User

Page 2: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 2/21

OSG Information Services

Context

• The OSG Information Services have 4 goals:– Provide static and “real-time” (where real-time is still

evolving) information about Resource configurations and state.

– Feed OSG-wide monitoring tools and provide interfaces to this information for Grid operations, VOs and Users.

– Provide information for interoperation of OSG and EGEE for LHC Experiments and WLCG operations.

– Provide information for resource selection by OSG VOs and Users.

Page 3: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 3/21

OSG Information Services

Please ask Questions During this talk

We are looking for input, feedback and guidance.

Page 4: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 4/21

OSG Information Services

OSG IS Architecture

Grid / Site Interface

VO / Grid Interface

Sit

eG

rid

VO

Static Info (LDIF)Info Providers Config…

Configuration

Info FormattingInfo Publishing Site Info Publisher (CEMon)

Generic Info Providers (GIP) Info Gathering

Info CollectionJob / Res. Match ReSS

Info CollectionBDII

Info Coll.WLCG BDII

Job Queue CondorSchedd

Job/Res. Match CondorMatchmaker

Job QueueJob/Res. Match

EGEE ResourceBroker (RB)

Info CollectionInfo DisplayVORS

Info GatheringVORSProbes

Instantiate…

LDIF

Classad

Classad

LDIF

LDIF

Page 5: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 5/21

OSG Information Services

VORS in OSG IS

Grid / Site Interface

VO / Grid Interface

Sit

eG

rid

VO

Static Info (LDIF)Info Providers Config…

Configuration

Info FormattingInfo Publishing Site Info Publisher (CEMon)

Generic Info Providers (GIP) Info Gathering

Info CollectionJob / Res. Match ReSS

Info CollectionBDII

Info Coll.WLCG BDII

Job Queue CondorSchedd

Job/Res. Match CondorMatchmaker

Job QueueJob/Res. Match

EGEE ResourceBroker (RB)

Info CollectionInfo DisplayVORS

Info GatheringVORSProbes

Instantiate…

LDIF

Classad

Classad

LDIF

LDIF

Page 6: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 6/21

OSG Information Services

What VORS does for you…

• Allows VO users to pick which sites support their VO

• Provides critical site info to a VO user

• Gives users a snapshot of current grid and site status

• Will provide a facility for users to look at other Grids from an OSG PO

Page 7: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 7/21

OSG Information Services

Page 8: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 8/21

OSG Information Services

Page 9: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 9/21

OSG Information Services

Page 10: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 10/21

OSG Information Services

ReSS in OSG IS

Grid / Site Interface

VO / Grid Interface

Sit

eG

rid

VO

Static Info (LDIF)Info Providers Config…

Configuration

Info FormattingInfo Publishing Site Info Publisher (CEMon)

Generic Info Providers (GIP) Info Gathering

Info CollectionJob / Res. Match ReSS

Info CollectionBDII

Info Coll.WLCG BDII

Job Queue CondorSchedd

Job/Res. Match CondorMatchmaker

Job QueueJob/Res. Match

EGEE ResourceBroker (RB)

Info CollectionInfo DisplayVORS

Info GatheringVORSProbes

Instantiate…

LDIF

Classad

Classad

LDIF

LDIF

Page 11: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 11/21

OSG Information Services

ReSS Motivations

• Implement a light-weight cluster selector for push-based job handling services

• Enable users to express requirements on the resources in the job description

• Enable users to refer to abstract characteristics of the resources in the job description

• Provide soft-registration for clusters• Use the standard characterizations of the

resources via the Glue Schema

Page 12: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 12/21

OSG Information Services

ReSS Technology

• ReSS basis its central services on the Condor Match-making service– Users of Condor-G naturally integrate their scheduler servers

with ReSS– Condor information collector manages resource soft registration

• Resource characteristics is handled at sites by the EGEE gLite CE Monitor Service (CEMon)– CEmon registers with the central ReSS services at startup– Info is gathered by CEMon at sites running Generic Information

Prividers (GIP)– GIP expresses resource information via the Glue Schema model– CEMon converts the information from GIP into old classad

format. Other supported formats: XML, LDIF, new classad– CEMon publishes information using web services interfaces

Page 13: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 13/21

OSG Information Services

A case study:VO Schedd to interact with ReSS

Grid / Site Interface

VO / Grid Interface

Sit

eG

rid

VO

Static Info (LDIF)Info Providers Config…

Configuration

Info FormattingInfo Publishing Site Info Publisher (CEMon)

Generic Info Providers (GIP) Info Gathering

Info CollectionJob / Res. Match ReSS

Info CollectionBDII

Info Coll.WLCG BDII

Job Queue CondorSchedd

Job/Res. Match CondorMatchmaker

Job QueueJob/Res. Match

EGEE ResourceBroker (RB)

Info CollectionInfo DisplayVORS

Info GatheringVORSProbes

Instantiate…

LDIF

Classad

Classad

LDIF

LDIF

Page 14: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 14/21

OSG Information Services

VO Condor-Schedd interacts with ReSS

CondorMatch Maker

InfoGatherer

classads

classads classads classads

CondorScheduler

jobWhat Gate?

Gate 3

job

CEMon

CE

Gate1

job-managersjob-managersjob-managers

jobs info

CLUSTER

GIP

CEMon

CE

Gate2

job-managersjob-managersjob-managers

jobs info

CLUSTER

GIP

CEMon

CE

Gate3

job-managersjob-managersjob-managers

jobs info

CLUSTER

GIP

ReSS

• Info Gatherer is the Interface Adapter between CEMon and Condor

Grid / Site Interface

VO

/ G

rid

Inte

rfa

ce

Gri

dS

ite

VO

Page 15: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 15/21

OSG Information Services

User Interacts with Schedd and ReSSuniverse = globusglobusscheduler = $$(GlueCEInfoContactString) requirements = TARGET.GlueCEAccessControlBaseRule == "VO:DZero"executable = /bin/hostnamearguments = -fqueue

MyType = "Machine"Name = "antaeus.hpcc.ttu.edu:2119/jobmanager-lsf-dzero.-1194963282"Requirements = (CurMatches < 10)ReSSVersion = "1.0.6"TargetType = "Job"GlueSiteName = "TTU-ANTAEUS"GlueSiteUniqueID = "antaeus.hpcc.ttu.edu"GlueCEName = "dzero"GlueCEUniqueID = "antaeus.hpcc.ttu.edu:2119/jobmanager-lsf-dzero"GlueCEInfoContactString = "antaeus.hpcc.ttu.edu:2119/jobmanager-lsf"GlueCEAccessControlBaseRule = "VO:dzero"GlueCEHostingCluster = "antaeus.hpcc.ttu.edu"GlueCEInfoApplicationDir = "/mnt/lustre/antaeus/appsGlueCEInfoDataDir = "/mnt/hep/osg"GlueCEInfoDefaultSE = "sigmorgh.hpcc.ttu.edu"GlueCEInfoLRMSType = "lsf"GlueCEPolicyMaxCPUTime = 6000GlueCEStateStatus = "Production"GlueCEStateFreeCPUs = 0GlueCEStateRunningJobs = 0GlueCEStateTotalJobs = 0GlueCEStateWaitingJobs = 0GlueClusterName = "antaeus.hpcc.ttu.edu"GlueSubClusterWNTmpDir = "/tmp"GlueHostApplicationSoftwareRunTimeEnvironment = "MountPoints,VO-cms-CMSSW_1_2_3"GlueHostMainMemoryRAMSize = 512GlueHostNetworkAdapterInboundIP = FALSEGlueHostNetworkAdapterOutboundIP = TRUEGlueHostOperatingSystemName = "CentOS"GlueHostProcessorClockSpeed = 1000GlueSchemaVersionMajor = 1…

Resource DescriptionResource Description

Job DescriptionJob Description

Abstract ResourceCharacteristic

ResourceRequirements

Page 16: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 16/21

OSG Information Services

• Does this sound like something you need to do ? (Users)

• Does this sound reasonable to you? (Site Admins)

Page 17: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 17/21

OSG Information Services

ReSS Deployment on OSG

Click here for live URL

Page 18: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 18/21

OSG Information Services

Status of ReSS

• ReSS is a lightweight Resource Selection Service for push-based job handling systems

• ReSS is deployed on OSG 0.6.0 as a general service: talk to us if you are interested!

• DZero and Engagement VO use ReSS on OSG• ReSS is used by FermiGrid for campus-wide

resource selection• More info at

https://twiki.grid.iu.edu/twiki/bin/view/ResourceSelection/

Page 19: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 19/21

OSG Information Services

What Sites Need to Do

• Configure GIPs correctly so show Green on GIP monitor http://gip-validate.grid.iu.edu/production/index.html

• Make sure VORS reports correct info for your site http://vors.grid.iu.edu/cgi-bin/index.cgi

• Make sure CEMon reports info from your site http://home.fnal.gov/~garzogli/ReSS/ReSS-prd-History.html

• Ask for help from [email protected] if you have any questions or problems!

Page 20: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 20/21

OSG Information Services

What VOs and Users need to do

• Understand parameters needed to select resource where your applications can run

• Interface the Information services to your application

• AND/OR

• use one of the OSG provided resource selectors (details in hidden slides).

Page 21: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 21/21

OSG Information Services

Conclusions

• OSG Information Services exist and are used in patches but the information provided is not yet complete nor uniform.

• We need the Sites to pay attention to the information content and configurations.

• We support Users who want to use any or all of the tools.

• OSG has a focus on Usability and Robustness over the next 12 months

Page 22: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 22/21

OSG Information Services

Additional Slides for More Detailed Information

Page 23: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 23/21

OSG Information Services

User Interaction with ReSS• The ReSS exposes information via condor collector interfaces

– Programmatically:• via a Web Service interface

– Command line, via condor_status• Examples:

https://twiki.grid.iu.edu/twiki/bin/view/ResourceSelection/ReSSUserInterfaceTools

• The Engagement VO gets OSG info from ReSS and does match making via a VO Match Making Service: https://twiki.grid.iu.edu/twiki/bin/view/ResourceSelection/ReSSForEngagementVO

– Condor scheduler interaction with ReSS• See how to connect a scheduler directly to the OSG ReSS (à la DZero):

https://twiki.grid.iu.edu/twiki/bin/view/ResourceSelection/SystemDeployment• See how FermiGrid uses ReSS for campus-wide resource selection:

http://fermigrid.fnal.gov/matchmaking.html– Glue Schema Attributes definition: http://fermigrid.fnal.gov/attributes.html– FermiGrid classads: http://fermigrid.fnal.gov/classads.html

Page 24: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 24/21

OSG Information Services

Glue Schema to old classad Mapping

Site Cluster

CE1

SubCluster1

SubCluster2

CE2

VO1

VO2

VO2

VO3

Mapping the Glue Schema “tree” intoa set of “flat” classads:all possible combination of(Cluster, Subcluster, CE, VO)

Page 25: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 25/21

OSG Information Services

Glue Schema to old classad Mapping

Site Cluster

CE1

SubCluster1

SubCluster2

CE2

VO1

VO2

VO2

VO3

Site

Cluster

SubCluster1

CE1VO1

classad

Mapping the Glue Schema “tree” intoa set of “flat” classads:all possible combination of(Cluster, Subcluster, CE, VO)

Page 26: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 26/21

OSG Information Services

Glue Schema to old classad Mapping

Site Cluster

CE1

SubCluster1

SubCluster2

CE2

VO1

VO2

VO2

VO3

Site

Cluster

SubCluster1

CE1VO1

classad

Site

Cluster

SubCluster2CE1

VO1

classad

Mapping the Glue Schema “tree” intoa set of “flat” classads:All possible combination of(Cluster, Subcluster, CE, VO)

Page 27: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 27/21

OSG Information Services

Glue Schema to old classad Mapping

Site Cluster

CE1

SubCluster1

SubCluster2

CE2

VO1

VO2

VO2

VO3

Site

Cluster

SubCluster1

CE1VO1

classad

Site

Cluster

SubCluster2CE1

VO1

classad

Site

Cluster

SubCluster1CE1

VO2

classad

Mapping the Glue Schema “tree” intoa set of “flat” classads:All possible combination of(Cluster, Subcluster, CE, VO)

Page 28: OSG Information Services,  VO Monitoring Services and  Resource Selection Services

Gabriele Garzoglio, Rob Quick, Chris Green

July 25, 2007 28/21

OSG Information Services

Glue Schema to old classad Mapping

Site Cluster

CE1

SubCluster1

SubCluster2

CE2

VO1

VO2

VO2

VO3

Site

Cluster

SubCluster1

CE1VO1

classad

Site

Cluster

SubCluster2CE1

VO1

classad

Site

Cluster

SubCluster1CE1

VO2

classad

Site

Cluster

SubCluster2CE1

VO2

classad

Site

Cluster

SubCluster1CE2

VO1

classad

Site

Cluster

SubCluster2CE2

VO1

classad

Mapping the Glue Schema “tree” intoa set of “flat” classads:All possible combination of(Cluster, Subcluster, CE, VO)