csf4 meta-scheduler name: zhaohui ding, xiaohui wei email : zhaohui.ding@email.jlu.edu.cn,...

Post on 02-Jan-2016

215 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

CSF4 Meta-SchedulerCSF4 Meta-Scheduler

Name: Zhaohui Ding, Xiaohui WeiName: Zhaohui Ding, Xiaohui WeiEmail : Email : Zhaohui.ding@email.jlu.edu.cnZhaohui.ding@email.jlu.edu.cn, , weixh@jlu.edu.cnweixh@jlu.edu.cn

College of Computer Science & Technology College of Computer Science & Technology Jilin UniversityJilin University

22

AgendaAgenda

Meta-Scheduler & CSF4 IntroductionMeta-Scheduler & CSF4 Introduction

CSF4 ArchitectureCSF4 Architecture

CSF4 FunctionalitiesCSF4 Functionalities

Future WorkFuture Work

Demo and PracticeDemo and Practice

33

What is Meta-SchedulerWhat is Meta-Scheduler

Resource Allocation & ManagementResource Allocation & Management HeterogeneousHeterogeneous DistributedDistributed DynamicDynamic

Local Scheduler VS Meta-schedulerLocal Scheduler VS Meta-scheduler

44

Local Scheduler VS Meta-SchedulerLocal Scheduler VS Meta-SchedulerLocal SchedulerLocal Scheduler Meta SchedulerMeta Scheduler

Administrative Administrative scopescope

Cluster, Single DomainCluster, Single Domain Grid, Multiple Domains,Grid, Multiple Domains,

Virtual OrganizationsVirtual Organizations

Hardware &Hardware &

Software (OS)Software (OS)

HomogeneousHomogeneous Heterogeneous Heterogeneous

OS-independentOS-independent

Data Data managementmanagement

LAN file system (NFS, FLAN file system (NFS, FTP, scp)TP, scp)

Global file systemGlobal file system

(Gridftp, Gfarm)(Gridftp, Gfarm)

Certificates Certificates managementmanagement

Not supportedNot supported SupportedSupported

Resource Resource Management Management

ProtocolProtocol

Specified, Private Specified, Private Protocols for different Protocols for different

local schedulerlocal scheduler

Standard, Open, General-Standard, Open, General-Purpose Protocols Purpose Protocols

(GRAM)(GRAM)

Scheduling Scheduling modemode

CentralizedCentralized Centralized / DistributedCentralized / Distributed

55

Meta-Scheduler VS Local SchedulerMeta-Scheduler VS Local Scheduler

Local SchedulerLocal Scheduler LSF (Load Sharing Facility)LSF (Load Sharing Facility) PBS (Portable Batch System) PBS (Portable Batch System) SGE (Sun Grid Engine)SGE (Sun Grid Engine) CondorCondor IBM LoadlevelerIBM Loadleveler

Meta-Scheduler Meta-Scheduler CSFCSF Maui (Silver)Maui (Silver) GridwayGridway Nimrod-GNimrod-G Condor-GCondor-G

66

What is CSF4What is CSF4

What is CSF Meta-SchedulerWhat is CSF Meta-Scheduler Full Name: Full Name: CCommunity ommunity SScheduler cheduler FFrameworkramework CSF4 contains a group of grid services host in GT4CSF4 contains a group of grid services host in GT4 CSF4 is a full WSRF compliant meta-scheduler.CSF4 is a full WSRF compliant meta-scheduler. Open Source project and can be accessed at Open Source project and can be accessed at

http://sourceforge.net/projects/gcsfhttp://sourceforge.net/projects/gcsf Developed by Jilin University, ChinaDeveloped by Jilin University, China CSF4 has been added to Globus Toolkit 4 as an ExecCSF4 has been added to Globus Toolkit 4 as an Exec

ution Componentution Component

77

CSF4 in Globus Toolkit 4CSF4 in Globus Toolkit 4

88

Meta-Scheduler VS Local SchedulerMeta-Scheduler VS Local Scheduler

CSF4 Meta-Scheduler

Grid Site GT2

LSF

Grid Site GT2

PBS

Grid Site GT4

SGE

Grid Site GT2

Condor……

A typical deployment for Meta-Scheduler and Local SchedulersA typical deployment for Meta-Scheduler and Local Schedulers

99

What CSF4 Can Do?What CSF4 Can Do?

Basic FunctionalitiesBasic Functionalities Submit jobs to Grid without Specifying ClusterSubmit jobs to Grid without Specifying Cluster Monitor and Control JobsMonitor and Control Jobs Provide Queuing ServiceProvide Queuing Service Schedule jobs and resource by custom-built pSchedule jobs and resource by custom-built p

olicesolices CSF4 Portlet (A Web browser based User InteCSF4 Portlet (A Web browser based User Inte

rface)rface)

1010

What CSF4 Can Do? (cont.)What CSF4 Can Do? (cont.)

Advanced FunctionalitiesAdvanced Functionalities Multiple Domains Resource Information Multiple Domains Resource Information

SharingSharing Automatic user credentials delegationAutomatic user credentials delegation Automatic data-staging Automatic data-staging Extensible scheduling policiesExtensible scheduling policies Supporting grid parallel jobs (MPICH-G2)Supporting grid parallel jobs (MPICH-G2)

1111

CSF4 – ArchitectureCSF4 – Architecture

Local Machine

PBS SGE CondorLSFLocal

MachinePBS SGE Condor

: Adapter : Local Scheduler

CSF4 Services

Queuing Service

Resource Manager LSF Service

GramPBS GramCondorGramFork GramSGE

WS-GRAM

gabd

Resource Manager Factory Service

Job Service

Reservation Srevice

GT2 Environment

GateKeeper

GramPBS GramSGE GramCondorGramFork

Resource Manager Gram Service

WS-MDSMeta Information

Grid Environment

GramLSF

1212

CSF4 – Architecture CSF4 – Architecture User view User view

1313

CSF4 – Architecture (CSF4 – Architecture (cont.cont.) ) Resource ViewResource View

PBS Gram SGE Gram LSF Gram

GRAM client

CSF4

Job service

Rsv service

PBS Cluster SGE Cluster LSF Cluster

Globus env

NOTE: One RM Gram is for one cluster onlyNOTE: One RM Gram is for one cluster only

Local Local

clustersclusters

1414

CSF4 – FunctionalitiesCSF4 – Functionalities

Local Scheduler SupportedLocal Scheduler Supported LSFLSF PBSPBS SGESGE CondorCondor

Infrastructure SupportedInfrastructure Supported Globus Toolkit 4Globus Toolkit 4 Globus Toolkit 2Globus Toolkit 2

1515

CSF4 – Functionalities CSF4 – Functionalities Scheduling Plug-in FrameworkScheduling Plug-in Framework

Designed For Queuing ServiceDesigned For Queuing Service

Provide A set of policiesProvide A set of policies

CustomizableCustomizable

ExtensibleExtensible

1616

Existent Scheduling PoliciesExistent Scheduling Policies

FCFS (First Come First Serve) round-robinFCFS (First Come First Serve) round-robin Default policyDefault policy

ThrottleThrottle Restrict the number of jobs in a scheduling cycleRestrict the number of jobs in a scheduling cycle

Array Job Plug-inArray Job Plug-in Design for life science applications (such as AutoDock, BLAST)Design for life science applications (such as AutoDock, BLAST)

MPICH-G2 Plug-inMPICH-G2 Plug-in By using VJM, the plug-in guarantee the synchronized resource By using VJM, the plug-in guarantee the synchronized resource

allocation can be successfulallocation can be successful

Data intensive applications plug-in (under-developing)Data intensive applications plug-in (under-developing)

1717

Schedule plug-in & scheduling policiesSchedule plug-in & scheduling policies

Each policy is implemented inside a scheduling plugiEach policy is implemented inside a scheduling plugin modulen module

A queue can load multiple plugin modulesA queue can load multiple plugin modules

1818

Array JobArray Job

AutoDock and Blast-like applicationsAutoDock and Blast-like applications A large number of sub-jobs.A large number of sub-jobs. Execute same binaryExecute same binary Different input/output filesDifferent input/output files

1919

Array Job (cont.)Array Job (cont.)

AdvantagesAdvantages Submit job only onceSubmit job only once Save submission time and memory storageSave submission time and memory storage

Executable: autodock4Input: hsg.gpfOutput:hsg.glgArray Size: 100

Array Job

CSF4 Meta-scheduler

Executable: autodock4Input: hsg.gpf.1Output:hsg.glg.1

Executable: autodock4Input: hsg.gpf.100Output:hsg.glg.100

Executable: autodock4Input: hsg.gpf.2Output:hsg.glg.2

Submit

......

Split

2020

Resource Information SharingResource Information Sharing

A MDS information provider for CSF4 A MDS information provider for CSF4

Multiple CSF4 can share the resource Multiple CSF4 can share the resource informationinformation

2121

CSF4 – Functionalities (CSF4 – Functionalities (cont.cont.))

Deploy Multiple CSF4 in a Grid CommunityDeploy Multiple CSF4 in a Grid Community

2222

Automatic user credential delegationAutomatic user credential delegation GT4 Delegation ServiceGT4 Delegation Service Full/Limited DelegationFull/Limited Delegation

User Credential DelegationUser Credential Delegation

2323

Automatic user credential Automatic user credential delegationdelegation

2424

Data StagingData Staging

Manual Data StagingManual Data Staging Which clusters I can use?Which clusters I can use? Which clusters my jobs will running on?Which clusters my jobs will running on? Where is the output data?Where is the output data? When will the job finish, so that I can stage-When will the job finish, so that I can stage-

out the output data?out the output data?

2525

Automatic Data StagingAutomatic Data Staging

Stage-in and Stage-out without Meta-SchedulerStage-in and Stage-out without Meta-Scheduler

User

Cluster

Cluster

Cluster

Input DataOutput

Data

Manual Stage In

Submit Job

Manual Stage Out

2626

Automatic Data StagingAutomatic Data Staging

With CSF4 Automatic Data StagingWith CSF4 Automatic Data Staging

User Cluster

Cluster

Cluster

Submit Job

Input Data

Output Data

Submit Job

Gridftp

CSF4 Meta-Scheduler

2727

CSF4 – User interface (CSF4 – User interface (contcont.).)

Command line interfaceCommand line interface

- Services available for grid users:- Services available for grid users: JobServiceJobService

csf-job-create, csf-job-start, csf-job-submit, csf-job-create, csf-job-start, csf-job-submit,

csf-job-status, csf-job-stop, csf-job-resume,csf-job-status, csf-job-stop, csf-job-resume,

csf-job-list, csf-resource-list, csf-mpi-runcsf-job-list, csf-resource-list, csf-mpi-run ReservationServiceReservationService

csf-rsv-create, csf-rsv-status, csf-rsv-cancelcsf-rsv-create, csf-rsv-status, csf-rsv-cancel

csf-job-submit, …csf-job-submit, … QueuingServiceQueuingService

csf-queue-create, csf-queue-conf, csf-queue-datacsf-queue-create, csf-queue-conf, csf-queue-data

2828

CSF4 PortletCSF4 Portlet

Web Browser based InterfaceWeb Browser based Interface

Friendly interfaceFriendly interface

2929

View Available ClustersView Available Clusters

3030

Submit a jobSubmit a job

3131

View Jobs historyView Jobs history

3232

View Job outputView Job output

3333

How to get CSF4?How to get CSF4?

Homepage:Homepage: http://sourceforge.net/projects/gcsfhttp://sourceforge.net/projects/gcsf

Release Version (Recommended) :Release Version (Recommended) : http://sourceforge.net/project/showfiles.php?group_idhttp://sourceforge.net/project/showfiles.php?group_id

=103105=103105

Developing Version (CVS, the newest but not Developing Version (CVS, the newest but not stable)stable) http://sourceforge.net/cvs/?group_id=103105http://sourceforge.net/cvs/?group_id=103105

Globus Toolkit 4 Package :Globus Toolkit 4 Package : In “contrib” directoryIn “contrib” directory

3434

Future WorkFuture Work

Plug-in Framework ImprovementPlug-in Framework Improvement

More scheduling policiesMore scheduling policies

Optimize for Parallel JobOptimize for Parallel Job

Optimize for Data intensive jobOptimize for Data intensive job Workflow supportedWorkflow supported

3535

PracticesPractices

View Credential InfoView Credential Info

Query Available ResourceQuery Available Resource

Submit A Simple JobSubmit A Simple Job

Submit An Array JobSubmit An Array Job

Query Jobs HistoryQuery Jobs History

View Job OutputView Job Output

3636

Thank youThank you

top related