h. kornmayer magic-grid status report egaap meeting, athens, 21th april 2005 - 1 enablinggrids for...

19
H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 1 EnablingGrids for E-siencE Generic Application: Monte Carlo Production for the MAGIC telescope MAGIC Grid Status report Harald Kornmayer, Alberto Forti, Toni Coarasa Mirco Mazzucato, Manuel Delfino Forschungszentrum Karlsruhe; University of Udine; Max-Planck-Institut für Physik, München; CNAF/INFN Bologna; PIC, Bologna on behalf of the MAGIC collaboration

Upload: alfred-harrell

Post on 01-Jan-2016

216 views

Category:

Documents


1 download

TRANSCRIPT

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 1

EnablingGridsfor E-siencEGeneric Application:

Monte Carlo Production

for the MAGIC telescope

MAGIC Grid Status report

Harald Kornmayer, Alberto Forti, Toni Coarasa

Mirco Mazzucato, Manuel DelfinoForschungszentrum Karlsruhe; University of Udine;

Max-Planck-Institut für Physik, München; CNAF/INFN Bologna; PIC, Bologna

on behalf of the MAGIC collaboration

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 2

EnablingGridsfor E-siencE

Outline• Introduction

• What kind of MAGIC?

• DevelopmentUse Cases RequirementsWorkflows MetaData

• StatusHardware SoftwareDeployment

• ExperienceFirst Month on production is overGrid-added value

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 3

EnablingGridsfor E-siencE

Introduction: The MAGIC Telescope• Ground based

Air Cerenkov Telescope• Gamma ray: 30 GeV - TeV• LaPalma, Canary Islands

(28° North, 18° West)• 17 m diameter• operation since autumn 2003

(still in commissioning)• Collaborators: IFAE Barcelona, UAB Barcelona,

Humboldt U. Berlin, UC Davis, U. Lodz, UC Madrid, MPI München, INFN / U. Padova, U. Potchefstrom, INFN / U. Siena, Tuorla Observatory, INFN / U. Udine, U. Würzburg, Yerevan Physics Inst., ETH Zürich

Physics Goals: Origin of VHE Gamma raysActive Galactic NucleiSupernova RemnantsUnidentified EGRET sourcesGamma Ray Burst

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 4

EnablingGridsfor E-siencE

~ 10 kmParticleshower

Ground based γ-ray astronomy

~ 1o

Ch

eren

kov

ligh

t

~ 120 m

Gammaray

GLAST (~ 1 m2)

Cherenkov light Image of particle shower in telescope camera

reconstruct: arrival direction, energyreject hadron background

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 5

EnablingGridsfor E-siencE

MAGIC – Hadron rejection

• Based on extensive Monte Carlo Simulation

• air shower simulation program CORSIKA

• Simulation of hadronic background is very CPU consuming

• to simulate the background of one night, 70 CPUs (P4 2GHz) needs to

run 19200 days

• to simulate the gamma events of one night for a Crab like source takes

288 days.

• At higher energies (> 70 GeV) observations are possible already by

On-Off method (This reduces the On-time by a factor of two)

• Lowering the threshold of the MAGIC telescope requires new

methods based on Monte Carlo Simulations

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 6

EnablingGridsfor E-siencE

Developments - Requirements

• MAGIC needs a lot of CPU to simulate the hadronic background to

explore the energy range 10 GeV – 100 GeV

• MAGIC needs a coordinated effort for the MonteCarlo production

• MAGIC needs an easy accessible system

(Where are the data from run_1002 and run_1003?)

• MAGIC needs an scalable system (as MAGIC II will come 2007)

• MAGIC needs the possiblity to access data from other experiments

(HESS, Vertias, GLAST, PLANCK(?)) for multiwavelength campaigns

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 7

EnablingGridsfor E-siencE

Development - Team• The proposal was written in

March 2004• Support for the proposal from

– CNAF (M. Mazzucato)– PIC (M. Delfino)– FZK – MAGIC

• Organisation– Contact Person

• Harald Kornmayer– VO Manager

• Harald Kornmayer– Gridifying developers

• Alberto Forti• Harald Kornmayer• Toni Coarasa

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 8

EnablingGridsfor E-siencE

Development – MC Workflow

I need 1.5 million hadronicshowers with Energy E, Direction (theta, phi), ... As background sample for observation of „Crab nebula“

Run Magic MonteCarlo Simulation and register output data

Run Magic MonteCarlo Simulation and register output data

Run Magic MonteCarlo Simulation and register output data

Run Magic MonteCarlo Simulation and register output data

Run Magic Monte Carlo Simulation (MMCS) and register output data

Simulate the Telescope Geometry with the reflector program for all interesting MMCS files and register output data

Simulate the Starlight Background for a given position in the sky and register output data

Simulate the response of the MAGIC camera for all interesting reflector files and register output data

Merge the shower simulation and the StarLight simulation and produce a MonteCarlo data sample

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 9

EnablingGridsfor E-siencE

• Huge number of runs with different parametersWe want to record what we did!

MetaData needed!

MMCS – Meta data

Model• for MMCS

• Store run parameters• Monitor Grid Jobs• Manage MMCS data

• for reflector• prototype designed

• for camera• need to be done!

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 10

EnablingGridsfor E-siencE

Status - Hardware

• History:– The Hague, Nov. 2004:

• The MMCS prototype was developed on the CrossGrid testbed

• First test on GILDA

• Migration to real EGEE infrastructure needed

• Now: – Running on real EGEE infrastructure (april 2005)

• VO server at Nihkef/SARA

• BDII and Resource Broker at CNAF

• Replica services at PIC

• Resources (whereever available! come to that later!)

bash-2.05a$ lcg-infosites --vo magic allLRC endpoint for magic: http://rlsmagic.pic.es:8080/magic/edg-local-replica-catalog/services/edg-local-replica-catalogRMC endpoint for magic: http://rlsmagic.pic.es:8080/magic/edg-replica-metadata-catalog/services/edg-replica-metadata-catalogLRC endpoint for magic: http://rlsmagic.pic.es:8080/magic/edg-local-replica-catalog/services/edg-local-replica-catalogRMC endpoint for magic: http://rlsmagic.pic.es:8080/magic/edg-replica-metadata-catalog/services/edg-replica-metadata-catalog

****************************************************************These are the related data for magic: (in terms of CPUs)****************************************************************

#CPU Free Total Jobs Running Waiting ComputingElement---------------------------------------------------------- 20 20 0 0 0 ce01.pic.es:2119/jobmanager-torque-magic 168 4 1 1 0 lcgce02.ifae.es:2119/jobmanager-lcgpbs-magic 7 6 0 0 0 ce001.grid.bas.bg:2119/jobmanager-lcgpbs-magic 82 63 6 6 0 gridba2.ba.infn.it:2119/jobmanager-lcgpbs-long 82 63 3 3 0 gridba2.ba.infn.it:2119/jobmanager-lcgpbs-short1280 5 0 0 0 lcg-gridka-ce.fzk.de:2119/jobmanager-pbspro-long1280 5 0 0 0 lcg-gridka-ce.fzk.de:2119/jobmanager-pbspro-short 50 29 1 1 0 grid012.ct.infn.it:2119/jobmanager-lcgpbs-long 50 29 0 0 0 grid012.ct.infn.it:2119/jobmanager-lcgpbs-short 82 63 8 8 0 gridba2.ba.infn.it:2119/jobmanager-lcgpbs-infinite1280 5 0 0 0 lcg-gridka-ce.fzk.de:2119/jobmanager-pbspro-default 582 413 0 0 0 ce01-lcg.cr.cnaf.infn.it:2119/jobmanager-lcglsf-magic1280 5 0 0 0 lcg-gridka-ce.fzk.de:2119/jobmanager-pbspro-extralong 50 29 24 20 4 grid012.ct.infn.it:2119/jobmanager-lcgpbs-infinite 8 6 1 1 0 gridit-ce-001.cnaf.infn.it:2119/jobmanager-lcgpbs-long 8 6 0 0 0 gridit-ce-001.cnaf.infn.it:2119/jobmanager-lcgpbs-short 8 6 1 1 0 gridit-ce-001.cnaf.infn.it:2119/jobmanager-lcgpbs-infinite----------------------------------------------------------

The total values are:----------------------------------------------------------4889 648 45 41 4

**************************************************************These are the related data for magic: (in terms of SE)**************************************************************

Access point Avail Space(Kb) Used Space(Kb) Files SEs---------------------------------------------------------------------------------------------------------------gsiftp://castorgrid.ifae.es/castor/ifae.es/lcg/magic/ 1000000000000 0 0 castorgrid.ifae.esgsiftp://castorgrid.pic.es/castor/ifae.es/lcg/magic/ 1000000000000 0 0 castorgrid.pic.esgsiftp://castorgrid.ifae.es/castor/ifae.es/lcg/magic/ 53020932 0 0 castorgrid.ifae.esgsiftp://se001.grid.bas.bg/storage/magic/ 354813480 30087 4 se001.grid.bas.bggsiftp://gridba6.ba.infn.it/flatfiles/SE00/magic/ 1605498664 0 0 gridba6.ba.infn.itgsiftp://grid007g.cnaf.infn.it/flatfiles/SE00/magic/ 1497769448 0 0 grid007g.cnaf.infn.itgsiftp://lcg-gridka-se.fzk.de/grid/fzk.de/..../SE00/magic/ 1605498664 0 0 lcg-gridka-se.fzk.de

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 11

EnablingGridsfor E-siencE

Status – Hardware

Nikhef/Sara FZK

BA.INFN.IT

PICCNAF

BAS.BG

BDIIMetaDataBase

UI

VO-Server

UI

RB

CE

CE

CE

UI CE

RLS

SECE

MPI.DE

UI CE

CT.INFN.IT

CE CE

This system is running and used for the first Monte Carlo Production challenge on the Grid (12 MEvts)

First experience

EGEE-1

CE

EGEE-2

CE

EGEE-2

CE

(apr/may 2005)

SESE

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 12

EnablingGridsfor E-siencE

Status – Software

• MMCS system version 1.0

• is up and running!

• Reflector prototype

• tested

• Camera

• needs to be done!

• Currently we submit all the software with the jdl

no software installation required yet

methods like the GAIT are tested (see Crakow Grid Workshop 2004)

• The submission needs approx. 40 sec

• With a more dedicate application installation procedure, this can be improved!!

• MMCS system version 1.0

• Java based prototype

• Hides the EDG/lcg command from the user

• Access the Meta-database

• Runs on User Interface

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 13

EnablingGridsfor E-siencE

Experience – Execution

Data challenge Grid-1• 12M hadron events

• 12000 jobs needed• started march 2005• up to now ~ 4000 jobs

First tests:• with manual GUI submission• Reasons for failure

• Network problems• RB problems• Queue problems

Job successful:Output file

registered at PIC

Diagnostic:• no tools found• complex and time consuming use metadata base, log the failure, resubmit and don‘t care

170/3780 Jobs failed 4.5 % failure

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 14

EnablingGridsfor E-siencE

Experience – InputProcOutput

• How to submit a job?

JDL

• JDL should specify

• Get the input form

• InputSandBox

• InputData

• Run the program

• Executable

• Store the output at

• OutputSandBox

• OutputData

Inpu

tDat

a

Out

putD

ataFile on UI

InputSandBox

File on GridInputData

File to UIOutputSandBox

File to GridOutputData

OK!

No file transfer!

OK!

OK!

Answer from experts: write a script that copy the file

a SE to the WN BUT: I don‘t want to implement a WORKAROUND

for basic grid functionality

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 15

EnablingGridsfor E-siencE

• The VO is managed with the VO server

• Why do the MAGIC grid users needs to be registered at CERN?

• Clarification of the non-LCG-registration procedure needed!!

• SA1 centers must know about this procedure!!

• VO manager has no means to rejected offered resources

• Only the local sites decide, if they want to support the VO

• There is no certification/test procedure before

• this can produce instability of a running production system!

• The VO manager can not remove them!

• Who is managing the VO?

Experience – VO Management

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 16

EnablingGridsfor E-siencE

• Qualitative

• Enable „(e-)scientific“ collaboration between partners

• Enable the cooperation between different experiments

• „Towards a virtual observatory for VHE gamma rays“

(presentation next week)

• Enable the participation on Virtual Observatories

Grid added value

• Quantitative

• Number of temporarily available CPU unreachable

no money, no manpower

• Aim: 1.5 Mio Events in 24 hours

• Manpower for storage maintenance reduced

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 17

EnablingGridsfor E-siencE

Technical:

• „missing“ functionality in

middleware

• IPO definition

• Job status information to

computing centric

• VO management

• Set up

• Local mapping

• Central registration

bottlenecks

Technical:

• Tools for error diagnostics not

available

• VO supporting tools

• More Batch systems need

support

• Sun Grid Engine

• Firewall

• The range of open firewall

ports is a problem for new

computing centers

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 18

EnablingGridsfor E-siencE

Political:

• …

bottlenecks

Money:

• Near-future availablility

• EGEE-2 resources (?)

• Beyond 2007 (?)

• Far-future Grid operation

models?

• Sustainablility ?

Manpower:

• EGEE Account Manager

Administrative:

• Single point of contact

VO account manager

• Many boards

• EGAAP vs. OAG

• Many documents read

• Where to Send Middleware

Requirements?

• Communication flow inside

EGEE not obvious for new

VOs

H. Kornmayer MAGIC-GRID Status report EGAAP meeting, Athens, 21th April 2005 - 19

EnablingGridsfor E-siencE

EGEE – MAGIC Grid

• From TheHague to Athens• MAGIC Grid on EGEE became real

• First Monte Carlo Production challenge • 12 Mio Events • started mid of March• ongoing

• Evaluation of a Grid based analysis system for MAGIC

• New application domain on the Grid•„Towards a virtual VHE gamma ray observatory“

wait for the future