csf4 meta-scheduler name: zhaohui ding, xiaohui wei email : [email protected],...
TRANSCRIPT
CSF4 Meta-SchedulerCSF4 Meta-Scheduler
Name: Zhaohui Ding, Xiaohui WeiName: Zhaohui Ding, Xiaohui WeiEmail : Email : [email protected]@email.jlu.edu.cn, , [email protected]@jlu.edu.cn
College of Computer Science & Technology College of Computer Science & Technology Jilin UniversityJilin University
22
AgendaAgenda
Meta-Scheduler & CSF4 IntroductionMeta-Scheduler & CSF4 Introduction
CSF4 ArchitectureCSF4 Architecture
CSF4 FunctionalitiesCSF4 Functionalities
Future WorkFuture Work
Demo and PracticeDemo and Practice
33
What is Meta-SchedulerWhat is Meta-Scheduler
Resource Allocation & ManagementResource Allocation & Management HeterogeneousHeterogeneous DistributedDistributed DynamicDynamic
Local Scheduler VS Meta-schedulerLocal Scheduler VS Meta-scheduler
44
Local Scheduler VS Meta-SchedulerLocal Scheduler VS Meta-SchedulerLocal SchedulerLocal Scheduler Meta SchedulerMeta Scheduler
Administrative Administrative scopescope
Cluster, Single DomainCluster, Single Domain Grid, Multiple Domains,Grid, Multiple Domains,
Virtual OrganizationsVirtual Organizations
Hardware &Hardware &
Software (OS)Software (OS)
HomogeneousHomogeneous Heterogeneous Heterogeneous
OS-independentOS-independent
Data Data managementmanagement
LAN file system (NFS, FLAN file system (NFS, FTP, scp)TP, scp)
Global file systemGlobal file system
(Gridftp, Gfarm)(Gridftp, Gfarm)
Certificates Certificates managementmanagement
Not supportedNot supported SupportedSupported
Resource Resource Management Management
ProtocolProtocol
Specified, Private Specified, Private Protocols for different Protocols for different
local schedulerlocal scheduler
Standard, Open, General-Standard, Open, General-Purpose Protocols Purpose Protocols
(GRAM)(GRAM)
Scheduling Scheduling modemode
CentralizedCentralized Centralized / DistributedCentralized / Distributed
55
Meta-Scheduler VS Local SchedulerMeta-Scheduler VS Local Scheduler
Local SchedulerLocal Scheduler LSF (Load Sharing Facility)LSF (Load Sharing Facility) PBS (Portable Batch System) PBS (Portable Batch System) SGE (Sun Grid Engine)SGE (Sun Grid Engine) CondorCondor IBM LoadlevelerIBM Loadleveler
Meta-Scheduler Meta-Scheduler CSFCSF Maui (Silver)Maui (Silver) GridwayGridway Nimrod-GNimrod-G Condor-GCondor-G
66
What is CSF4What is CSF4
What is CSF Meta-SchedulerWhat is CSF Meta-Scheduler Full Name: Full Name: CCommunity ommunity SScheduler cheduler FFrameworkramework CSF4 contains a group of grid services host in GT4CSF4 contains a group of grid services host in GT4 CSF4 is a full WSRF compliant meta-scheduler.CSF4 is a full WSRF compliant meta-scheduler. Open Source project and can be accessed at Open Source project and can be accessed at
http://sourceforge.net/projects/gcsfhttp://sourceforge.net/projects/gcsf Developed by Jilin University, ChinaDeveloped by Jilin University, China CSF4 has been added to Globus Toolkit 4 as an ExecCSF4 has been added to Globus Toolkit 4 as an Exec
ution Componentution Component
77
CSF4 in Globus Toolkit 4CSF4 in Globus Toolkit 4
88
Meta-Scheduler VS Local SchedulerMeta-Scheduler VS Local Scheduler
CSF4 Meta-Scheduler
Grid Site GT2
LSF
Grid Site GT2
PBS
Grid Site GT4
SGE
Grid Site GT2
Condor……
A typical deployment for Meta-Scheduler and Local SchedulersA typical deployment for Meta-Scheduler and Local Schedulers
99
What CSF4 Can Do?What CSF4 Can Do?
Basic FunctionalitiesBasic Functionalities Submit jobs to Grid without Specifying ClusterSubmit jobs to Grid without Specifying Cluster Monitor and Control JobsMonitor and Control Jobs Provide Queuing ServiceProvide Queuing Service Schedule jobs and resource by custom-built pSchedule jobs and resource by custom-built p
olicesolices CSF4 Portlet (A Web browser based User InteCSF4 Portlet (A Web browser based User Inte
rface)rface)
1010
What CSF4 Can Do? (cont.)What CSF4 Can Do? (cont.)
Advanced FunctionalitiesAdvanced Functionalities Multiple Domains Resource Information Multiple Domains Resource Information
SharingSharing Automatic user credentials delegationAutomatic user credentials delegation Automatic data-staging Automatic data-staging Extensible scheduling policiesExtensible scheduling policies Supporting grid parallel jobs (MPICH-G2)Supporting grid parallel jobs (MPICH-G2)
1111
CSF4 – ArchitectureCSF4 – Architecture
Local Machine
PBS SGE CondorLSFLocal
MachinePBS SGE Condor
: Adapter : Local Scheduler
CSF4 Services
Queuing Service
Resource Manager LSF Service
GramPBS GramCondorGramFork GramSGE
WS-GRAM
gabd
Resource Manager Factory Service
Job Service
Reservation Srevice
GT2 Environment
GateKeeper
GramPBS GramSGE GramCondorGramFork
Resource Manager Gram Service
WS-MDSMeta Information
Grid Environment
GramLSF
1212
CSF4 – Architecture CSF4 – Architecture User view User view
1313
CSF4 – Architecture (CSF4 – Architecture (cont.cont.) ) Resource ViewResource View
PBS Gram SGE Gram LSF Gram
GRAM client
CSF4
Job service
Rsv service
PBS Cluster SGE Cluster LSF Cluster
Globus env
NOTE: One RM Gram is for one cluster onlyNOTE: One RM Gram is for one cluster only
Local Local
clustersclusters
1414
CSF4 – FunctionalitiesCSF4 – Functionalities
Local Scheduler SupportedLocal Scheduler Supported LSFLSF PBSPBS SGESGE CondorCondor
Infrastructure SupportedInfrastructure Supported Globus Toolkit 4Globus Toolkit 4 Globus Toolkit 2Globus Toolkit 2
1515
CSF4 – Functionalities CSF4 – Functionalities Scheduling Plug-in FrameworkScheduling Plug-in Framework
Designed For Queuing ServiceDesigned For Queuing Service
Provide A set of policiesProvide A set of policies
CustomizableCustomizable
ExtensibleExtensible
1616
Existent Scheduling PoliciesExistent Scheduling Policies
FCFS (First Come First Serve) round-robinFCFS (First Come First Serve) round-robin Default policyDefault policy
ThrottleThrottle Restrict the number of jobs in a scheduling cycleRestrict the number of jobs in a scheduling cycle
Array Job Plug-inArray Job Plug-in Design for life science applications (such as AutoDock, BLAST)Design for life science applications (such as AutoDock, BLAST)
MPICH-G2 Plug-inMPICH-G2 Plug-in By using VJM, the plug-in guarantee the synchronized resource By using VJM, the plug-in guarantee the synchronized resource
allocation can be successfulallocation can be successful
Data intensive applications plug-in (under-developing)Data intensive applications plug-in (under-developing)
1717
Schedule plug-in & scheduling policiesSchedule plug-in & scheduling policies
Each policy is implemented inside a scheduling plugiEach policy is implemented inside a scheduling plugin modulen module
A queue can load multiple plugin modulesA queue can load multiple plugin modules
1818
Array JobArray Job
AutoDock and Blast-like applicationsAutoDock and Blast-like applications A large number of sub-jobs.A large number of sub-jobs. Execute same binaryExecute same binary Different input/output filesDifferent input/output files
1919
Array Job (cont.)Array Job (cont.)
AdvantagesAdvantages Submit job only onceSubmit job only once Save submission time and memory storageSave submission time and memory storage
Executable: autodock4Input: hsg.gpfOutput:hsg.glgArray Size: 100
Array Job
CSF4 Meta-scheduler
Executable: autodock4Input: hsg.gpf.1Output:hsg.glg.1
Executable: autodock4Input: hsg.gpf.100Output:hsg.glg.100
Executable: autodock4Input: hsg.gpf.2Output:hsg.glg.2
Submit
......
Split
2020
Resource Information SharingResource Information Sharing
A MDS information provider for CSF4 A MDS information provider for CSF4
Multiple CSF4 can share the resource Multiple CSF4 can share the resource informationinformation
2121
CSF4 – Functionalities (CSF4 – Functionalities (cont.cont.))
Deploy Multiple CSF4 in a Grid CommunityDeploy Multiple CSF4 in a Grid Community
2222
Automatic user credential delegationAutomatic user credential delegation GT4 Delegation ServiceGT4 Delegation Service Full/Limited DelegationFull/Limited Delegation
User Credential DelegationUser Credential Delegation
2323
Automatic user credential Automatic user credential delegationdelegation
2424
Data StagingData Staging
Manual Data StagingManual Data Staging Which clusters I can use?Which clusters I can use? Which clusters my jobs will running on?Which clusters my jobs will running on? Where is the output data?Where is the output data? When will the job finish, so that I can stage-When will the job finish, so that I can stage-
out the output data?out the output data?
2525
Automatic Data StagingAutomatic Data Staging
Stage-in and Stage-out without Meta-SchedulerStage-in and Stage-out without Meta-Scheduler
User
Cluster
Cluster
Cluster
Input DataOutput
Data
Manual Stage In
Submit Job
Manual Stage Out
2626
Automatic Data StagingAutomatic Data Staging
With CSF4 Automatic Data StagingWith CSF4 Automatic Data Staging
User Cluster
Cluster
Cluster
Submit Job
Input Data
Output Data
Submit Job
Gridftp
CSF4 Meta-Scheduler
2727
CSF4 – User interface (CSF4 – User interface (contcont.).)
Command line interfaceCommand line interface
- Services available for grid users:- Services available for grid users: JobServiceJobService
csf-job-create, csf-job-start, csf-job-submit, csf-job-create, csf-job-start, csf-job-submit,
csf-job-status, csf-job-stop, csf-job-resume,csf-job-status, csf-job-stop, csf-job-resume,
csf-job-list, csf-resource-list, csf-mpi-runcsf-job-list, csf-resource-list, csf-mpi-run ReservationServiceReservationService
csf-rsv-create, csf-rsv-status, csf-rsv-cancelcsf-rsv-create, csf-rsv-status, csf-rsv-cancel
csf-job-submit, …csf-job-submit, … QueuingServiceQueuingService
csf-queue-create, csf-queue-conf, csf-queue-datacsf-queue-create, csf-queue-conf, csf-queue-data
2828
CSF4 PortletCSF4 Portlet
Web Browser based InterfaceWeb Browser based Interface
Friendly interfaceFriendly interface
2929
View Available ClustersView Available Clusters
3030
Submit a jobSubmit a job
3131
View Jobs historyView Jobs history
3232
View Job outputView Job output
3333
How to get CSF4?How to get CSF4?
Homepage:Homepage: http://sourceforge.net/projects/gcsfhttp://sourceforge.net/projects/gcsf
Release Version (Recommended) :Release Version (Recommended) : http://sourceforge.net/project/showfiles.php?group_idhttp://sourceforge.net/project/showfiles.php?group_id
=103105=103105
Developing Version (CVS, the newest but not Developing Version (CVS, the newest but not stable)stable) http://sourceforge.net/cvs/?group_id=103105http://sourceforge.net/cvs/?group_id=103105
Globus Toolkit 4 Package :Globus Toolkit 4 Package : In “contrib” directoryIn “contrib” directory
3434
Future WorkFuture Work
Plug-in Framework ImprovementPlug-in Framework Improvement
More scheduling policiesMore scheduling policies
Optimize for Parallel JobOptimize for Parallel Job
Optimize for Data intensive jobOptimize for Data intensive job Workflow supportedWorkflow supported
3535
PracticesPractices
View Credential InfoView Credential Info
Query Available ResourceQuery Available Resource
Submit A Simple JobSubmit A Simple Job
Submit An Array JobSubmit An Array Job
Query Jobs HistoryQuery Jobs History
View Job OutputView Job Output
3636
Thank youThank you