www.see-grid-sci.eu see-grid-sci antun balaz sa1 leader institute of physics belgrade...

24
www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade [email protected] National, Regional and World- wide Grid eInfrastructures Regional Grid Training University of Belgrade, 24-25 June 2008 The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures contract no. 211338

Upload: margery-maxwell

Post on 17-Dec-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

www.see-grid-sci.eu

SEE-GRID-SCI

Antun BalazSA1 Leader

Institute of Physics [email protected]

National, Regional and World-wideGrid eInfrastructures

Regional Grid TrainingUniversity of Belgrade, 24-25 June 2008

The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures contract no. 211338

Page 2: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

Overview

AEGIS infrastructureSEE-GRID infrastructureSEE-GRID operationsSEE-GRID Operational and monitoring toolsSEE-GRID Service Level AgreementVO managementEGEE infrastructureOther World-wide Grid infrastructures

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 3: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

AEGIS infrastructure (1)

AEGIS01-PHY-SCL, Institute of Physics Belgrade:704 CPUs, 25 TBAEGIS02-RCUB: 12 CPUsAEGIS03-ELEF-LEDA, Faculty of Electronic Engineering, Nis: 4 CPUsAEGIS04-KG, CSANU & University of Kragujevac:42 CPUs, 0.85 TBAEGIS05-ETFBG: 28 CPUsAEGIS06-AOB, Astronomical Observatory Belgrade (retired)AEGIS07-PHY-ATLAS, Institute of Physics Belgrade: 128 CPUsAll core services sufficient for the operation of our national VO AEGIS

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 4: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

AEGIS infrastructure (2)

Core services: AEGIS CA (UOB-RCUB) AEGIS VOMS server (IPB) BDII (IPB) WMS (IPB) LFC (IPB, UOB-RCUB)

Redundant core services are the next stepUser support through SEE-GRID Helpdesk or through national mailing listsSite support through SEE-GRID Helpdesk, through national mailing lists, and in personExtensive guides on operations

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 5: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

AEGIS01-PHY-SCL at IPB (1)

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 6: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

AEGIS01-PHY-SCL at IPB (2)

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 7: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

SEE-GRID Infrastructure (1)

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 8: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

SEE-GRID Infrastructure (2)

SEE-GRID infrastructure contains currently the following resources: 35 sites in SEE-GRID production (31 at the end of Y1) CPUs: 2200 total (1150 at the end of Y1) Storage: 57.35 TB (23.94 TB at the end of Y1) Typical machine configuration: dual or quad-core CPUs, with 1GB of RAM

per CPU coreAll sites on gLite-3, with 20 sites already on gLite-3.1 and the rest on gLite-3.0; Scientific Linux used as a base OS (SL4 for gLite-3.1, SL3 for gLite-3.0 services), but others also present (CentOS, Debian)New gLite services deployed: glite-WMS/LB actively used, together with lcg-RB gLite-3.1 lcg-CE tested and deployed on several sites gLite-3.1 SE_dpm largely replaces the old SE_classic

Experience and detailed guides on deploying natively compiled 64-bit architecture gLite services: worker nodes and disk storage servers

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 9: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

SEE-GRID Infrastructure (3)

SEE-GRID total CPUs, May 2006

– April 2008 (from GStat)

Page 10: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

SEE-GRID Infrastructure (4)

SEE-GRID Core services Catch-all Certification Authority

enables regional sites to obtain user and host certificates Virtual Organisation Management Service (VOMS),

authorization system for the SEE-GRID Virtual Organisation (VO),

supporting groups and roles deployed two instances (master and slave) for failover

Workload management service (lcg-RB and glite-WMSLB) deployed several instances for failover Information Services (BDII)

deployed several instances for failover MyProxy is operational

supports certificate renewal FTS deployed

used in production

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 11: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

SEE-GRID Infrastructure (5)

As sites mature, they migrate to EGEE Croatia, Turkey, Serbia, Romania

However, this depends on agencies providing funding for the hardware Each participating institute has its own strategy

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 12: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

SEE-GRID Operations (1)

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 13: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

SEE-GRID Operations (2)

Distributed Operations – currently one ROC EGI: SEE ROC probably integrated with the SEE-GRID ROC

Pilot SLA establishedMonitoring and Accounting ToolsHelpdesk tickets procedures Generic support group for users

TPM-like (monitoring open tickets created by users, trying to solve the simple ones, route the tickets, etc.).

Country level user support groups Step towards stand-alone operations

Grid-Operator-On-Duty shifts introduced to improve site availabilities and resolve all operational issues

SEEGRID Wiki with detailed information for site admins: http://wiki.egee-see.org/index.php/SEE-GRID_Wiki

VOMS Role=ops used for SAM jobs submissionRegional Grid Training, University of Belgrade, 24-25 June 2008

Page 14: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

SEE-GRID Operational & monitoring tools (1)

HGSMHGSM

HELP-DESKHELP-DESK

BDIIBDII

R-GMAR-GMA

SAMSAM

GSTAT(Taiwan)GSTAT

(Taiwan)

VOMSVOMSRTM(UK)

RTM(UK)

Googlemaps

Googlemaps

BBmSAMBBmSAM

GridICEGridICE

MonALISAMonALISA

NAGIOSNAGIOS

WiatGWiatG

AccountingAccounting

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 15: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

SEE-GRID Operational & monitoring tools (2)

Operational & monitoring tools deployment status Hierarchical Grid Site Management (HGSM) – Turkey Service Availability Monitoring (SAM) (+ porting to MySQL) – Bosnia

and Herzegovina with CERN support Helpdesk - Romania BBmSAM - Bosnia and Herzegovina GridICE – FYR of Macedonia SEE-GRID GoogleEarth – Turkey + ic.ac.uk SEE-GRID GoogleMaps - Turkey Global Grid Information Monitoring System (GStat) – ASGC, Taiwan R-GMA and Accounting Portal – Bulgaria Nagios - Bulgaria Real Time Monitor (RTM) – ic.ac.uk and Turkey (HGSM) MONitoring Agents using a Large Integrated Services Architecture

(MonALISA) – Romania What is at the Grid (WiatG) – CERN with support from Serbia

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 16: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

SEE-GRID Operational & monitoring tools (3)

Integration status HGSM+SAM, HGSM+BBmSAM

Automatic creation of list of sites to be tested

HGSM+BDII Automatic creation of list of sites in

the infrastructure HGSM+GStat

Automatic creation of list of sites to be monitored

HGSM+RTM, HGSM+R-GMA Automatic creation of list of sites

monitoring and for accounting VOMS+Helpdesk

Automatically create new user accounts when accessing helpdesk

Certificate based access for Helpdesk

HGSMHGSM

HELP-DESKHELP-DESK

BDIIBDII R-GMAR-GMASAMSAM

GSTATGSTAT

VOMSVOMS

RTMRTMGooglemaps

Googlemaps

BBmSAMBBmSAM

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 17: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

HGSM database

SEE-GRID GOCDB Introduced as a lightweight version of GOCDB Allows us to easily change its format when necessary and to

adapt it to regional needs Allows us to provide custom exports on demand, depending

on operational tools/application developers

Contains statical information about all sitesDeveloped and maintained by TUBITAK-ULAKBIM, Turkey https://hgsm.grid.org.tr/

Used by EUMedGRID, other regional projects expressed interest

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 18: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

<Event name>, <Event location>, <event date> 18/x

BBmSAM portal Created for SLA monitoring

Generating site availability statistics according to several criteria

Overview (HTML) and full dump (CSV) of data possible

Extended into full SAM portal Availability for last 24h period for all sites/services Latest results per service History for nodes/services

BBmobileSAM Optimized for small-screen devices and low bandwidth Possible filtering of sites Possible three levels of details

BBmSAM & BBmobileSAM

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 19: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

SEE-GRID SLA

Hardware and connectivity criteria Min. amount of resources for sites to participate in the

infrastructure Network to fulfill operations test requirements

Level of support Site and security administrators availability and response time

Level of expertise Site and security administrators declaration of expertise

VO support Site to provide support to SEEGRID VO and its OPS role

Conformance to Operational Metrics Site availability Downtimes

SEE-GRID-2 SLA communicated to EGEE

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 20: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

Conformance to SEE-GRID SLA (1)

Availabilities of SEE-GRID CEs

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 21: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

Conformance to SEE-GRID SLA (2)

Weighted availabilities of SEE-GRID CEs

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 22: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

VO Management

Regional catch-all SEEGRID VO Members from all participating institutes Distributed VO management: all countries have VOMS admin

representativesNational VOs Serbia (AEGIS VO) Romania Turkey

Regional VO is supported on all sitesOther regional discipline-oriented VOs will be created soon (SEE-GRID-SCI) Seismology Meteorology Environmental sciences etc.

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 23: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

EGEE infrastructure

250 sites in 50 countries11 federationsMore than 55k CPUsAmount of available storage hard to specify, several thousands of PBAll core services redundantly availableSerbia is part of EGEE-SEE ROC Provides 800+ CPUs, out of region’s 2900 CPUs More details: http://goc.grid.sinica.edu.tw/gstat/

Accounting for the last three months: Serbia provides around 4% of all accounting in EGEEE EGEE-SEE provides around 9.5% of all accounting in EGEE Serbia provides around 52% of all EGEE-SEE accounting

(mostly to AEGIS, SEEGRID, SEE and ATLAS VO)

Regional Grid Training, University of Belgrade, 24-25 June 2008

Page 24: Www.see-grid-sci.eu SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade antun@phy.bg.ac.yu National, Regional and World-wide Grid eInfrastructures

Other World-wide Grid infrastructures

WLCG – World-wide LHC Computing Grid (EGEE subset, with firm commitments to LHC VOs)D-GridTeraGridOSGDEISA

Regional Grid Training, University of Belgrade, 24-25 June 2008