key activities. mnd sections
DESCRIPTION
Key Activities. MND sections. Group meeting 13.08.08. Overview. Yesterday section meeting showed substantial progress in many directions: Central repository of monitoring metrics for LHC experiments (Elisa and Dashboard team, intersection development ) - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/1.jpg)
Key Activities. MND sections
Group meeting 13.08.08
![Page 2: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/2.jpg)
Overview• Yesterday section meeting showed substantial progress in many
directions:
- Central repository of monitoring metrics for LHC experiments (Elisa and Dashboard team, intersection development )
- GridMap view for site administrators (Elisa, Max, Dashboard team)- Testing of LB notification system (Olga Kodolova and LB Team)- Adapting CMS SAM availability applications for other LHC VOs (William)- Development of the Grid Messaging System and support of pilot users and applications (Daniel)- New features of ATLAS DDM and ProdSys monitoring. (Ricardo, Benjamin, Lu, Markus)- Creating of ATLAS Grid Information System (AGIS) and integration of Dashboard applications
with it (Raquel, Ricardo, Benjamin)- A lot of improvements to the CMS Site Status Board (Pablo)- Improvements of the CMS Interactive Job Monitoring application, following the requests of the
CMS community (Anastasia, Julia)- Improvements to the CMS schema and queries, fixing problems causing high DB load during
CCRC08 (Irina, Julia)- Use of GMS for propagating of job status reports from the worker node to the Dashboard DB
(Bernd)- Improvements for Data Transfer Reliability for ALICE experiments (Spyridon)
![Page 3: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/3.jpg)
Summer students
• We have four summer students in the MND section this year: Lu feng, Bernd Schodel, Markus Huber, Spyridon Koulozis
• Very good contribution to the sections work!
• See details about student presentation at yesterday section meeting:
http://indico.cern.ch/conferenceDisplay.py?confId=39049
![Page 4: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/4.jpg)
Central repository for common metrics and site view
• Objective of this activity• ● Provide a comprehensive monitoring tool which, from one unique
console,• gives an overview of the overall status of the services in their site. This• should be a tool easy to use, also for persons external to the VO, and
which• does not require a particular knowledge of each experiment.• ● This tool will extract information from the experiment specific tools• (Dashboard, MonALISA, Dirac, Phedex) and will display it in a consistent• way.• ● It will display the information using the gridmap technology and will
provide links to the source of the information• ● An additional requirement from sites is to have a clear definition of
the• targets from the experiments: this information should be displayed by
this• tool (but it has to be clearly defined and provided by the experiments)
![Page 5: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/5.jpg)
![Page 6: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/6.jpg)
Testing of LB notification
• We are first pilot users
• In case of success LB notification should replace RGMA and IC XML sources for job status changes information for jobs submitted via LCG WMS.
![Page 7: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/7.jpg)
Adapting of CMS SAM availability application for other VOs
• UI to follow the results of SAM tests is already in production for ATLAS
• Is coming for LHCb and ALICE
• Creating the collectors for topology info for LHC experiments to be introduced to SAM DB (William for ATLAS, Julia for CMS)
![Page 8: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/8.jpg)
ATLAS Data Management• Integration with AGIS• Collection of OSG scheduled downtimes• Notification messages for cloud activity / performance
– Delivery using the dashboard messaging API• Part of maintenance now given to ATLAS shifters and experts• Activities for dataset subscriptions (in development)
– Delivery by the end of August– New mandatory ‘activity’ field when issuing subscriptions– Transition to a single instance of the DDM dashboard – new
filters by activity instead of multiple instances• New statistics in the main display (Lu Feng)
![Page 9: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/9.jpg)
ATLAS Production System
• Integration with AGIS• RSS feed exposing problematic sites /
tasks• Cloud / site admin display (Lu Feng)• Grouping error messages by similarity
– Better than proddb error codes which are not always useful
• Improved shifter’s display• Reporting of pilot job details via python
API (issuing HTTP requests)
![Page 10: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/10.jpg)
AGIS• ATLAS Grid Information System• Built using the dashboard framework• Collection of site information
– From GOCDB for EGEE and OIM for OSG• Collection of service information
– From the LCG and OSG BDIIs• Collection of ATLAS cloud and tier information
– From the TiersOfATLAS and Panda database– Stopped after a transition period when update of info directly in
AGIS is fully tested• Querying available via a python API, command line tools,
direct HTTP with multiple output formats (including ToA)• Updating via secure HTTP calls• Prototype: http://agis.cern.ch/dashboard/request.py/agis
![Page 11: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/11.jpg)
ATLAS Tier0 Dashboard
• Developed in collaboration with the ATLAS T0 team (Guido Negri)
• T0 data exposed via a dashboard application
• Prototype display built using HTML canvas elements
![Page 12: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/12.jpg)
Grid Messaging System
![Page 13: Key Activities. MND sections](https://reader035.vdocuments.site/reader035/viewer/2022081512/5681458c550346895db27569/html5/thumbnails/13.jpg)
CMS Site Status Board