staged roll-out: status report
DESCRIPTION
Staged Roll-out: status report. Antonio Retico SA1 Coordination Meeting Barcelona, 24 Sep 2009. Contents. Good Afternoon. Transition to staged roll-out General progress report Detail on changes since end of July Points for discussion References. Recall. - PowerPoint PPT PresentationTRANSCRIPT
EGEE-III INFSO-RI-222667
Enabling Grids for E-sciencE
www.eu-egee.org
Antonio Retico
SA1 Coordination Meeting Barcelona, 24 Sep 2009
Staged Roll-out: status report
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
Contents
• Transition to staged roll-out– General progress report– Detail on changes since end of July
• Points for discussion
• References
Antonio Retico - SA1 coordination meeting - 24th Sep 2009 2
Good Afternoon
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
Recall
• Presentation on staged roll-out process at EGEE09– http://indico.cern.ch/contributionDisplay.py?contribId=373&sessi
onId=84&confId=55893
• Now following the transition plan for SA1 – https://twiki.cern.ch/twiki/bin/view/EGEE/StagedRolloutSA1
• Progresses on plan presented the 28th of July– http://indico.cern.ch/materialDisplay.py?sessionId=2&materialId=
2&confId=64396
Antonio Retico - SA1 coordination meeting - 24th Sep 2009 3
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
consolidation
EGEE-SA1 Coordination Meeting – 28th Jul 2009 4
Rough timeline [4]
30 Jun 31 Jul 31 Aug 30 Sep 31 Oct 30 Nov 31 Dec
preparation
transition 1
transition 2
workplan ready
All sites meeting
repos ready
EGEE09
GOCDB4?
Topology DB?
LHC start?
Transition plan Coordination
with SA3 Requirements
for GOCDB4 Prepare release
documentation Adapt PPS tools
task-based reporting Populate PPS registry Documentation
Management procedures
Test reports pages Start the operations Discontinue PPS
deployment test Sam and GridMap
displays
Commitments into GOCDB
Modify PPS tools
Transfer resource mgmt to ROCs/NGI
Add more PROD sites
Interface with regional MW re-distributions
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
Preparation tasks (Jun-Aug)
Agree with SA3 on the general lines (5th Jun) Prepare transition plan (2nd Jul) Requirements for GOCDB4 (2nd July) Agree with SA3 on timelines for repositories/release
pages (13th July)• Prepare release documentation
– Service-oriented release pages (MODIFIED)– Repositories– on SA3, due by the end of August
Configure PPS tools ( 20th August) Registry, task manager, templates, documentation on Antonio, due by mid August
EGEE-SA1 Coordination Meeting – 28th Jul 2009 5
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
Release Pages
• UMD release structure still under discussion• Several assumptions fell. E.g.:
– Service-oriented independent repositories– Update number
• Situation under control– Full convergence with SA3 on immediate technicalities– Release tools ready (in theory)– But never tried yet
Antonio Retico - SA1 coordination meeting - 24th Sep 2009 6
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
Transition tasks 1 (Aug-Oct)
Start task-based reporting for deployment testing (11th Aug) Change of habit for sites All-sites meeting to discuss it (CANCELED)
Re-configuration of sites in the PPS registry (20th Aug) Update documentation pages
Management procedures (8th Sep) Test reports pages (17th Sep)
• As soon as repos ready (31 Aug): start the operations– Exercising with all upcoming releases– Refine the procedures– Discuss at EGEE conference– Ideally PPS deployment test completely abandoned by mid September
• Set-up Sam and GridMap displays– Config changes done in BDII – No changes required in SAM and Gridmap– Instance of SAM Portal to be modified (NEW)
Antonio Retico - SA1 coordination meeting - 24th Sep 2009 7
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
Deployment test tasks
• 4 PPS Updates 74 tasks issued– 48 Done– 15 rejected (invalid)– 11 Pending (ghosts)
• Rejections mostly due to wrong assignments– E.g. wrong version of OS/architecture– Fixed in PPS registry – What about GOCDB ?
Antonio Retico - SA1 coordination meeting - 24th Sep 2009 8
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667 Antonio Retico - SA1 coordination meeting - 24th Sep 2009 9
New documents
• ASTAS (Automatic Savannah TAsk Submission) – Wrapper of the savannah “API” from SA3– Mini tutorial
https://twiki.cern.ch/twiki/bin/view/Main/AstasMiniTutorial
– A simple user guide for the “release” use case On AFS:
/afs/cern.ch/project/gd/egee/www/preproduction/ActivityManagement/astas/usage_v2.txt
• Automatically generated Test Report pages– Available for the latest PPS releases– www.cern.ch/pps/index.php?dir=./ActivityManagement/astas/
REPORTS/
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667 Antonio Retico - SA1 coordination meeting - 24th Sep 2009 10
Starting Release Operations
• In theory everything is ready. We could:– Suspend PPS deployment test– Replace it on the fly with staged roll-out
• A lot of conflicts with other releases during August– Urgent fixes to be managed with the “stable” process– Many others coming in the next month.– Difficult for us and SA3 to find a “quiet” slot to test the procedure
• A “staged” release is in the pipeline– https://gus.fzk.de/ws/ticket_info.php?ticket=51579 – Partially overlapped with deployment test– PPS sites may expect some duplication of tasks (sorry!)
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
SAM and Gridmap displays
• All involved sites tested with SAM
• Single SAM display still to be configured– Not a big deal (I hope)
Antonio Retico - SA1 coordination meeting - 24th Sep 2009 11
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
Transition tasks 2 (Oct-Nov)
• Transfer commitments from PPS registry to GOCDB– On the sites
• Topology db (Canceled)– Export of sites GOCDB Topology DB– Adaptation of displays (SAM, Gridmap, lists on websites …) (?)
• Adaptation of the PPS tools (ASTAS) (NEW)– Registration info taken from GOCDB– On Antonio
EGEE-SA1 Coordination Meeting – 28th Jul 2009 12
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
Consolidation tasks (Nov-Apr)
• Recruit more production sites– On the ROCs and SA1 coord
• Interface to regional MW distributions ? (NEW)• Training for ROCs/NGIs
– Use of tools for local programs
Antonio Retico - SA1 coordination meeting - 24th Sep 2009 13
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
Discussion points
• Support for OS/arch in commitment registry– Implemented in PPS registry– Feasible in GOCDB ?– If not, be prepared to receive invalid tasks. E.g.
Site ITWM supports SL5 WN Site ITWM receives deployment tasks for SL4 WNs
• PPS registry supports ACLs– Anyone willing to try and maintain their own commitments ?– Or better to wait for GOCDB?
• For example– Release managers of Local Distributions start getting involved
Antonio Retico - SA1 coordination meeting - 24th Sep 2009 14
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667
References
EGEE-SA1 Coordination Meeting – 28th Jul 2009 16
• [1] EGI: Managing the Software Process
http://indico.cern.ch/getFile.py/access?sessionId=2&resId=0&materialId=1&confId=57092
• [2] SA1: proposal and requirements for staged-roll-out of middleware updates
https://edms.cern.ch/document/997514/
• [3] SA1/SA3: Staged roll-out of grid middleware: general lines
https://twiki.cern.ch/twiki/bin/view/EGEE/StagedRolloutOverview
• [4] SA1: Implementation details and roadmap
https://twiki.cern.ch/twiki/bin/view/EGEE/StagedRolloutSA1
• All of them available on the PPS web site
http://www.cern.ch/pps/index.php?dir=./rollout/
Enabling Grids for E-sciencE
EGEE-III INFSO-RI-222667 Antonio Retico - SA1 coordination meeting - 24th Sep 2009 17
Questions?