osg information services, vo monitoring services and resource selection services
DESCRIPTION
OSG Information Services, VO Monitoring Services and Resource Selection Services. OSG Information Services Architecture The VO Resource Service (VORS) The OSG Resource Selection Service (ReSS) ClassAd Matchmaking How these affect the Sites How these affect the User. - PowerPoint PPT PresentationTRANSCRIPT
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 1/21
OSG Information Services OSG Information Services, VO Monitoring Services and Resource Selection Services
Gabriele Garzoglio, Chris Green, Computing Division, Fermilab
Rob Quick, Indiana University
OSG User Meeting & OSG Site Administrators Meeting July 2007
• OSG Information Services Architecture• The VO Resource Service (VORS)• The OSG Resource Selection Service (ReSS)• ClassAd Matchmaking• How these affect the Sites• How these affect the User
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 2/21
OSG Information Services
Context
• The OSG Information Services have 4 goals:– Provide static and “real-time” (where real-time is still
evolving) information about Resource configurations and state.
– Feed OSG-wide monitoring tools and provide interfaces to this information for Grid operations, VOs and Users.
– Provide information for interoperation of OSG and EGEE for LHC Experiments and WLCG operations.
– Provide information for resource selection by OSG VOs and Users.
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 3/21
OSG Information Services
Please ask Questions During this talk
We are looking for input, feedback and guidance.
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 4/21
OSG Information Services
OSG IS Architecture
Grid / Site Interface
VO / Grid Interface
Sit
eG
rid
VO
Static Info (LDIF)Info Providers Config…
Configuration
Info FormattingInfo Publishing Site Info Publisher (CEMon)
Generic Info Providers (GIP) Info Gathering
Info CollectionJob / Res. Match ReSS
Info CollectionBDII
Info Coll.WLCG BDII
Job Queue CondorSchedd
Job/Res. Match CondorMatchmaker
Job QueueJob/Res. Match
EGEE ResourceBroker (RB)
Info CollectionInfo DisplayVORS
Info GatheringVORSProbes
Instantiate…
LDIF
Classad
Classad
LDIF
LDIF
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 5/21
OSG Information Services
VORS in OSG IS
Grid / Site Interface
VO / Grid Interface
Sit
eG
rid
VO
Static Info (LDIF)Info Providers Config…
Configuration
Info FormattingInfo Publishing Site Info Publisher (CEMon)
Generic Info Providers (GIP) Info Gathering
Info CollectionJob / Res. Match ReSS
Info CollectionBDII
Info Coll.WLCG BDII
Job Queue CondorSchedd
Job/Res. Match CondorMatchmaker
Job QueueJob/Res. Match
EGEE ResourceBroker (RB)
Info CollectionInfo DisplayVORS
Info GatheringVORSProbes
Instantiate…
LDIF
Classad
Classad
LDIF
LDIF
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 6/21
OSG Information Services
What VORS does for you…
• Allows VO users to pick which sites support their VO
• Provides critical site info to a VO user
• Gives users a snapshot of current grid and site status
• Will provide a facility for users to look at other Grids from an OSG PO
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 7/21
OSG Information Services
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 8/21
OSG Information Services
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 9/21
OSG Information Services
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 10/21
OSG Information Services
ReSS in OSG IS
Grid / Site Interface
VO / Grid Interface
Sit
eG
rid
VO
Static Info (LDIF)Info Providers Config…
Configuration
Info FormattingInfo Publishing Site Info Publisher (CEMon)
Generic Info Providers (GIP) Info Gathering
Info CollectionJob / Res. Match ReSS
Info CollectionBDII
Info Coll.WLCG BDII
Job Queue CondorSchedd
Job/Res. Match CondorMatchmaker
Job QueueJob/Res. Match
EGEE ResourceBroker (RB)
Info CollectionInfo DisplayVORS
Info GatheringVORSProbes
Instantiate…
LDIF
Classad
Classad
LDIF
LDIF
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 11/21
OSG Information Services
ReSS Motivations
• Implement a light-weight cluster selector for push-based job handling services
• Enable users to express requirements on the resources in the job description
• Enable users to refer to abstract characteristics of the resources in the job description
• Provide soft-registration for clusters• Use the standard characterizations of the
resources via the Glue Schema
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 12/21
OSG Information Services
ReSS Technology
• ReSS basis its central services on the Condor Match-making service– Users of Condor-G naturally integrate their scheduler servers
with ReSS– Condor information collector manages resource soft registration
• Resource characteristics is handled at sites by the EGEE gLite CE Monitor Service (CEMon)– CEmon registers with the central ReSS services at startup– Info is gathered by CEMon at sites running Generic Information
Prividers (GIP)– GIP expresses resource information via the Glue Schema model– CEMon converts the information from GIP into old classad
format. Other supported formats: XML, LDIF, new classad– CEMon publishes information using web services interfaces
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 13/21
OSG Information Services
A case study:VO Schedd to interact with ReSS
Grid / Site Interface
VO / Grid Interface
Sit
eG
rid
VO
Static Info (LDIF)Info Providers Config…
Configuration
Info FormattingInfo Publishing Site Info Publisher (CEMon)
Generic Info Providers (GIP) Info Gathering
Info CollectionJob / Res. Match ReSS
Info CollectionBDII
Info Coll.WLCG BDII
Job Queue CondorSchedd
Job/Res. Match CondorMatchmaker
Job QueueJob/Res. Match
EGEE ResourceBroker (RB)
Info CollectionInfo DisplayVORS
Info GatheringVORSProbes
Instantiate…
LDIF
Classad
Classad
LDIF
LDIF
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 14/21
OSG Information Services
VO Condor-Schedd interacts with ReSS
CondorMatch Maker
InfoGatherer
classads
classads classads classads
CondorScheduler
jobWhat Gate?
Gate 3
job
CEMon
CE
Gate1
job-managersjob-managersjob-managers
jobs info
CLUSTER
GIP
CEMon
CE
Gate2
job-managersjob-managersjob-managers
jobs info
CLUSTER
GIP
CEMon
CE
Gate3
job-managersjob-managersjob-managers
jobs info
CLUSTER
GIP
ReSS
• Info Gatherer is the Interface Adapter between CEMon and Condor
Grid / Site Interface
VO
/ G
rid
Inte
rfa
ce
Gri
dS
ite
VO
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 15/21
OSG Information Services
User Interacts with Schedd and ReSSuniverse = globusglobusscheduler = $$(GlueCEInfoContactString) requirements = TARGET.GlueCEAccessControlBaseRule == "VO:DZero"executable = /bin/hostnamearguments = -fqueue
MyType = "Machine"Name = "antaeus.hpcc.ttu.edu:2119/jobmanager-lsf-dzero.-1194963282"Requirements = (CurMatches < 10)ReSSVersion = "1.0.6"TargetType = "Job"GlueSiteName = "TTU-ANTAEUS"GlueSiteUniqueID = "antaeus.hpcc.ttu.edu"GlueCEName = "dzero"GlueCEUniqueID = "antaeus.hpcc.ttu.edu:2119/jobmanager-lsf-dzero"GlueCEInfoContactString = "antaeus.hpcc.ttu.edu:2119/jobmanager-lsf"GlueCEAccessControlBaseRule = "VO:dzero"GlueCEHostingCluster = "antaeus.hpcc.ttu.edu"GlueCEInfoApplicationDir = "/mnt/lustre/antaeus/appsGlueCEInfoDataDir = "/mnt/hep/osg"GlueCEInfoDefaultSE = "sigmorgh.hpcc.ttu.edu"GlueCEInfoLRMSType = "lsf"GlueCEPolicyMaxCPUTime = 6000GlueCEStateStatus = "Production"GlueCEStateFreeCPUs = 0GlueCEStateRunningJobs = 0GlueCEStateTotalJobs = 0GlueCEStateWaitingJobs = 0GlueClusterName = "antaeus.hpcc.ttu.edu"GlueSubClusterWNTmpDir = "/tmp"GlueHostApplicationSoftwareRunTimeEnvironment = "MountPoints,VO-cms-CMSSW_1_2_3"GlueHostMainMemoryRAMSize = 512GlueHostNetworkAdapterInboundIP = FALSEGlueHostNetworkAdapterOutboundIP = TRUEGlueHostOperatingSystemName = "CentOS"GlueHostProcessorClockSpeed = 1000GlueSchemaVersionMajor = 1…
Resource DescriptionResource Description
Job DescriptionJob Description
Abstract ResourceCharacteristic
ResourceRequirements
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 16/21
OSG Information Services
• Does this sound like something you need to do ? (Users)
• Does this sound reasonable to you? (Site Admins)
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 17/21
OSG Information Services
ReSS Deployment on OSG
Click here for live URL
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 18/21
OSG Information Services
Status of ReSS
• ReSS is a lightweight Resource Selection Service for push-based job handling systems
• ReSS is deployed on OSG 0.6.0 as a general service: talk to us if you are interested!
• DZero and Engagement VO use ReSS on OSG• ReSS is used by FermiGrid for campus-wide
resource selection• More info at
https://twiki.grid.iu.edu/twiki/bin/view/ResourceSelection/
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 19/21
OSG Information Services
What Sites Need to Do
• Configure GIPs correctly so show Green on GIP monitor http://gip-validate.grid.iu.edu/production/index.html
• Make sure VORS reports correct info for your site http://vors.grid.iu.edu/cgi-bin/index.cgi
• Make sure CEMon reports info from your site http://home.fnal.gov/~garzogli/ReSS/ReSS-prd-History.html
• Ask for help from [email protected] if you have any questions or problems!
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 20/21
OSG Information Services
What VOs and Users need to do
• Understand parameters needed to select resource where your applications can run
• Interface the Information services to your application
• AND/OR
• use one of the OSG provided resource selectors (details in hidden slides).
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 21/21
OSG Information Services
Conclusions
• OSG Information Services exist and are used in patches but the information provided is not yet complete nor uniform.
• We need the Sites to pay attention to the information content and configurations.
• We support Users who want to use any or all of the tools.
• OSG has a focus on Usability and Robustness over the next 12 months
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 22/21
OSG Information Services
Additional Slides for More Detailed Information
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 23/21
OSG Information Services
User Interaction with ReSS• The ReSS exposes information via condor collector interfaces
– Programmatically:• via a Web Service interface
– Command line, via condor_status• Examples:
https://twiki.grid.iu.edu/twiki/bin/view/ResourceSelection/ReSSUserInterfaceTools
• The Engagement VO gets OSG info from ReSS and does match making via a VO Match Making Service: https://twiki.grid.iu.edu/twiki/bin/view/ResourceSelection/ReSSForEngagementVO
– Condor scheduler interaction with ReSS• See how to connect a scheduler directly to the OSG ReSS (à la DZero):
https://twiki.grid.iu.edu/twiki/bin/view/ResourceSelection/SystemDeployment• See how FermiGrid uses ReSS for campus-wide resource selection:
http://fermigrid.fnal.gov/matchmaking.html– Glue Schema Attributes definition: http://fermigrid.fnal.gov/attributes.html– FermiGrid classads: http://fermigrid.fnal.gov/classads.html
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 24/21
OSG Information Services
Glue Schema to old classad Mapping
Site Cluster
CE1
SubCluster1
SubCluster2
CE2
VO1
VO2
VO2
VO3
…
Mapping the Glue Schema “tree” intoa set of “flat” classads:all possible combination of(Cluster, Subcluster, CE, VO)
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 25/21
OSG Information Services
Glue Schema to old classad Mapping
Site Cluster
CE1
SubCluster1
SubCluster2
CE2
VO1
VO2
VO2
VO3
Site
Cluster
SubCluster1
CE1VO1
classad
…
Mapping the Glue Schema “tree” intoa set of “flat” classads:all possible combination of(Cluster, Subcluster, CE, VO)
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 26/21
OSG Information Services
Glue Schema to old classad Mapping
Site Cluster
CE1
SubCluster1
SubCluster2
CE2
VO1
VO2
VO2
VO3
Site
Cluster
SubCluster1
CE1VO1
classad
Site
Cluster
SubCluster2CE1
VO1
classad
…
Mapping the Glue Schema “tree” intoa set of “flat” classads:All possible combination of(Cluster, Subcluster, CE, VO)
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 27/21
OSG Information Services
Glue Schema to old classad Mapping
Site Cluster
CE1
SubCluster1
SubCluster2
CE2
VO1
VO2
VO2
VO3
Site
Cluster
SubCluster1
CE1VO1
classad
Site
Cluster
SubCluster2CE1
VO1
classad
Site
Cluster
SubCluster1CE1
VO2
classad
…
Mapping the Glue Schema “tree” intoa set of “flat” classads:All possible combination of(Cluster, Subcluster, CE, VO)
Gabriele Garzoglio, Rob Quick, Chris Green
July 25, 2007 28/21
OSG Information Services
Glue Schema to old classad Mapping
Site Cluster
CE1
SubCluster1
SubCluster2
CE2
VO1
VO2
VO2
VO3
Site
Cluster
SubCluster1
CE1VO1
classad
Site
Cluster
SubCluster2CE1
VO1
classad
Site
Cluster
SubCluster1CE1
VO2
classad
Site
Cluster
SubCluster2CE1
VO2
classad
Site
Cluster
SubCluster1CE2
VO1
classad
Site
Cluster
SubCluster2CE2
VO1
classad
…
Mapping the Glue Schema “tree” intoa set of “flat” classads:All possible combination of(Cluster, Subcluster, CE, VO)