1 the alice offline environment f.carminati 13 june 2001

29
1 The ALICE Offline Environment F.Carminati 13 June 2001

Upload: hector-wilson

Post on 04-Jan-2016

219 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: 1 The ALICE Offline Environment F.Carminati 13 June 2001

1

The ALICE Offline Environment

F.Carminati 13 June 2001

Page 2: 1 The ALICE Offline Environment F.Carminati 13 June 2001

2ROOT WorkShop 2001June 13, 2001

ALICE Offline Project

ALICE is unique in the LHC context Major decisions already taken (DAQ, Off-line) Move to C++ completed Adoption of the ROOT framework Tightly knit Off-line team No ongoing “turf war” Physics performance and computing in a single

team One single development line Aggressive Data Challenge program Impressive achievement record And of course Minimal support from CERN (apart for ADC’s)

Page 3: 1 The ALICE Offline Environment F.Carminati 13 June 2001

3ROOT WorkShop 2001June 13, 2001

LHC Computing ReviewRecognition of

Size and cost of LHC computing (240MCHF by 07 + 80/y after)

Shortage of manpower in LHC core software teams Importance of Data Challenges

Recommendations for Common prototype (2002-2004) & projects (IT+LHC) CERN support of FLUKA / ROOT Establishment of SC2 to oversee LHC computing Drafting of a computing MoU

See: http://lhc-computing-review.web.cern.ch/lhc-computing-review-public

The battled is not ended, it just started!

Page 4: 1 The ALICE Offline Environment F.Carminati 13 June 2001

4ROOT WorkShop 2001June 13, 2001

ALICE Software Process highlights

Freely inspired by the most recent Software Engineering trends

Extreme Programming Explained, Addison Wesley Martin Fowler, The New Methodology, Linux, gnu, ROOT, KDE, GSL…

Exhaustive code documentation made nightly and on the web

UML diagrams, code listing, coding rule violations

Continuous code testing & verification, nightly build and tests Collective ownership of the code one single repositoryFlexible release cycle: release often, release earlySimple packaging and installation

Code is composed only of 2 main packages (ROOT and AliRoot)

We could not have done it without ROOT

Page 5: 1 The ALICE Offline Environment F.Carminati 13 June 2001

5ROOT WorkShop 2001June 13, 2001

ALICE Software Process synoptic

Requirements are necessary to develop the code ALICE users express their requirements continuously

Design is good for evolving the code Users participate in the system design continuously

Testing is important for quality and robustness Testing is done automatically every night

Integration is needed to ensure coherence AliRoot is completely integrated every night

Discussions are valuable for users and developers Discussion is continuous in the ALICE offline list

Page 6: 1 The ALICE Offline Environment F.Carminati 13 June 2001

6ROOT WorkShop 2001June 13, 2001

Status of AliRootVersion v3-05 just released Foreseen stable for 6 months Folder structure Major refactoring of the code AliStack introduced Reconstruction for all detectors from digits TRD tracking Inverse Kalman filter Major step toward global tracking for ITS – TPC –

TRD Abstract interface between VMC, G3 and G4 FLUKA

(in preparation) AliGenerator interface improved / extended

(cocktail & afterburners)

Page 7: 1 The ALICE Offline Environment F.Carminati 13 June 2001

7ROOT WorkShop 2001June 13, 2001

Folders

Long (stormy) discussions with René on the role of folders We had to clarify what is the relative role of

object design, folders structure and relation with containers

Now this is clear, and AliRoot is posting shared data structures to folders

It came at the right moment for us, as we are designing now global reconstruction

We should think about the relation between folders, GRID LFNs and datasets

Forward evolutive concept

Page 8: 1 The ALICE Offline Environment F.Carminati 13 June 2001

8ROOT WorkShop 2001June 13, 2001

Tasks

Less clear (at least for me) the use of tasks Elegant way to generic programming Going from data incapsulation to method

incapsulation For the moment we are afraid by the lack

of “expressivity” of the TTask->Execute() Also the necessary refactoring of the

code is probably rather large

More prototyping is needed before a decision is taken

Page 9: 1 The ALICE Offline Environment F.Carminati 13 June 2001

9ROOT WorkShop 2001June 13, 2001

Whiteboard Data Communication

Class 1 Class 2

Class 3

Class 4

Class 5Class 6

Class 7

Class 8

Page 10: 1 The ALICE Offline Environment F.Carminati 13 June 2001

10ROOT WorkShop 2001June 13, 2001

Update on VMC

The G3 interface is in production and evolving Possibility to translate geometry is TGeometry will

be in v3-06 pre-release

The G4 interface is technically working but G4 is a monolithic and non-modular program Conversion from GEANT3 geometry is heavy The question of MANY is still unsolved The physics of G4 seems out-of-control

Work on the FLUKA interface is ongoing, but the tracking / stacking is different from G3 / G4 and this may be a major challenge

Page 11: 1 The ALICE Offline Environment F.Carminati 13 June 2001

11ROOT WorkShop 2001June 13, 2001

The Virtual MC

User Code

VMC

G3

G3 transport

G3 geometry

G4 transport

G4 geometry

G4

FLUKA transport

FLUKA

Page 12: 1 The ALICE Offline Environment F.Carminati 13 June 2001

12ROOT WorkShop 2001June 13, 2001

The framework question

1990 2000

PAW

GEANT3

Recons?

ROOT

OK

GEANT4

LOST

?Discussio

n on Thu?

TParticle

VMC

TGenerator

Page 13: 1 The ALICE Offline Environment F.Carminati 13 June 2001

13ROOT WorkShop 2001June 13, 2001

The Framework questionThe problem of the MC is clearly not solvable in this context However a very important piece on which we can work is the Geometrical Modeller G3 modeller had an incredible longevity, but it has

several shortcomings This has been one of the areas of IT most active in

these past 20 years

A possible way ahead Evaluate few candidates (I have at least 2/3) Define a ROOT abstract interface for CSG modelling Interface with ROOT 3D graphics Define a representation on file

Page 14: 1 The ALICE Offline Environment F.Carminati 13 June 2001

14ROOT WorkShop 2001June 13, 2001

The Virtual MC II

User Code

VMC

Virtual Geometrical Modeller

G3 G3 transport

G4 transportG4

FLUKA transportFLUKA

Geometrical Modeller

Reconstruction

Visualisation

Page 15: 1 The ALICE Offline Environment F.Carminati 13 June 2001

15ROOT WorkShop 2001June 13, 2001

ALICE Data ChallengesALICE plans to acquire data at 1.25GB/sTo be confident we will be able to do that we run yearly data challenges of increasing complexity and sizeThe goals ADC III (1H2001) were Stable bandwidth of 100 MB/s over the

complete chain during a week Aggregate bandwidth of 300 MB/s in the DAQ Total 80 TB of data stored in MSS Testing of SMP servers Online monitoring tools Regional centres (not done!)

Page 16: 1 The ALICE Offline Environment F.Carminati 13 June 2001

16ROOT WorkShop 2001June 13, 2001

ADC III hardwareLHC testbed farm (ADC using part of the testbed) Low-cost PCs only: 2 Intel CPUs running Linux 35 CPU servers (22 on Gigabit Ethernet, 10 on Fast

Ethernet) ? disk servers 12 tape servers, 12 tape drives, 1000 tape cartridges Fast/Gigabit Ethernet network

6 switches from 3 manufacturers on 2 media

ALICE servers 3 PC servers: 6 Intel CPUs running Linux used as disk

servers (joint project ALICE/HP)

In total: ~ 10 % of the ALICE DAQ

Page 17: 1 The ALICE Offline Environment F.Carminati 13 June 2001

17ROOT WorkShop 2001June 13, 2001

Alice ADC III

GDC

LDC: Local Data ConcentratorGDC: Global Data Collector

Gigabit Ethernet switch(Summit 5i)12x1000 Base-T

4x1000 GBIC

CONTROL

SWITCH

Gbit Eth1 Gbit/s

ComputingCenterBld 513

Gbit Eth1 Gbit/s

GDCGDC GDC GDC GDC GDCGDC GDC GDC GDC GDC GDC

PC 2xP700Linux

SWITCH

LDCLDC

LDCLDC

CERNNetworkBackbone

Gigabit Ethernet switches(Cabletron Smart Switch Router 8600)

ComputingCenterBld 513

Fast Ethernet switch(3COM3900)

24x100 Base-T1x1000 Base-Fx uplink

Gigabit Ethernet switch(3COM 9100)12x1000 Base-T Base-Fx1x1000 Base-Fx uplink

TapeServer

TapeServer

TapeServer

8 IBM LTO Ultrium300 cartridges

SWITCH

PC 2x700 Intel/Linux Tbed

28/01/2001

SWITCH

Gigabit Ethernet switch(Summit 7i)28x1000 Base-T

4x1000 GBIC

DiskServer

LDCLDC

LDCLDC

SWITCH

LDCLDC

TapeServer

TapeServer

TapeServer

7 STK 9940260 cartridges

SWITCH

ALICE DAQBld 53

Fast Ethernet switch(3COM3900)

24x100 Base-T1x1000 Base-Fx uplink

SWITCH

GDC GDC GDC GDCGDC GDC GDC GDC

GDCGDCGDCGDC GDC GDC

Fast Ethernet switch(3COM3900)

24x100 Base-T1x1000 Base-Fx uplink

SWITCH

GDC GDC GDCGDC GDC GDC GDC

GDCGDCGDCGDCGDC GDC GDC

PC 2xP700Linux

LDC LDC LDC LDCLDC LDC LDC LDCLDC

LDC LDC LDC LDCLDC

LDC

GDCGDCGDC

DiskServer

DiskServer

DiskServer

DiskServer

DiskServer

Event building (DATE)

Recording (ROOT+RFIO) Mass Storage (CASTOR)

Page 18: 1 The ALICE Offline Environment F.Carminati 13 June 2001

18ROOT WorkShop 2001June 13, 2001

DATE + ROOT I/O

Page 19: 1 The ALICE Offline Environment F.Carminati 13 June 2001

19ROOT WorkShop 2001June 13, 2001

DATE + ROOT I/O + CASTOR

Writing to local disk

Migration to tape

Page 20: 1 The ALICE Offline Environment F.Carminati 13 June 2001

20ROOT WorkShop 2001June 13, 2001

DATE + ROOT I/O + CASTOR

Page 21: 1 The ALICE Offline Environment F.Carminati 13 June 2001

21ROOT WorkShop 2001June 13, 2001

DATE + ROOT I/O + CASTOR

Page 22: 1 The ALICE Offline Environment F.Carminati 13 June 2001

22ROOT WorkShop 2001June 13, 2001

HighlightsExcellent system stability during 3 months

Max DATE throughput: 550 MB/s (max) 350 MB/s (ALICE-like) Max DATE+ROOT throughput: 240 MB/s Max DATE+ROOT+CASTOR throughput: 120 MB/s

average during several days: 85 MB/s (> 50 TB/week) 2200 runs, 2* 107 events, longest DATE run (86 hours, 54 TB) 500 TB in DAQ, 200 TB in DAQ+ROOT I/O CASTOR > 100.000 files of 1 Gbyte => 110 Tbytes Database for metadata with 105 entries

HP SMP’s: cost-effective alternative to unexpensive disk serversOnline monitoring tools developedConcerns

LHC Computing Prototype should receive adequate resources for software and hardware

Page 23: 1 The ALICE Offline Environment F.Carminati 13 June 2001

23ROOT WorkShop 2001June 13, 2001

Performances and plans

0

50

100

150

200

250

300

350

1999 2000 2001

DAQ

Mass Storage (sust)

Mass Storage (peak)

0

500

1000

1500

2000

2500

3000

1999 2000 2001 2002 2003 2004 2005 2006

DAQMass Storage

Page 24: 1 The ALICE Offline Environment F.Carminati 13 June 2001

24ROOT WorkShop 2001June 13, 2001

ADC IVPlanning and resources under discussionTest of individual components before next ADCGoals: Increase performances (200MB/s to tape, 1GB/s

through the switch) Study key issues such as computers and fabric

architecture Include some L3 trigger functionality Involve 1 or 2 regional centres

Global ADC second half of 2002 (new tape generation and 10 Gbit Eth.)

Page 25: 1 The ALICE Offline Environment F.Carminati 13 June 2001

25ROOT WorkShop 2001June 13, 2001

DataGRID project statusHuge project 21 partners, 10MEuro, 3 years, 12 WorkPackages

Global design still evolving An ATF has been mandated to design the

architecture of the system to be delivered at PM6 (=now!)

To progress they need continuous feed-back from the users

Users are in three workpackages HEP in WP8 Earth Observation in WP9 Biology in WP10

ALICE long term project is distributed interactive data analysis

Page 26: 1 The ALICE Offline Environment F.Carminati 13 June 2001

26ROOT WorkShop 2001June 13, 2001

DataGrid & ROOT

Local

Remote

Selection

Parameters

Procedure

Proc.C

Proc.C

Proc.C

Proc.C

Proc.C

PROOF

CPU

CPU

CPU

CPU

CPU

CPU

TagDB

RDB

DB1

DB4DB5DB6

DB3

DB2

Page 27: 1 The ALICE Offline Environment F.Carminati 13 June 2001

27ROOT WorkShop 2001June 13, 2001

DataGrid & ROOTRoot RDB

Selection parameter

sTAG DB

selected

events

Grid RBLFN #hit

s

Grid MDS

Grid perf log

Grid perf mon

Grid replica catalog

Grid cost evaluator

best place

s

Spawn PROOF tasks

Grid net ws

Grid replica

manager

Grid autenticat

e

PROOF loop

Grid replica catalog

Update Root RDB

output LFNs

Send results back

Grid log & monitor

Page 28: 1 The ALICE Offline Environment F.Carminati 13 June 2001

28ROOT WorkShop 2001June 13, 2001

Comments to ROOT team

Really no hard comment to the root teamThe release policy on CVS could include a branch for every version This would allow to make “patches” to

production versions instead of being forced to take the HEAD to have a fix

HEADv4-01

v4-02

v5-02

v5-01

v6-02

v6-01

tag branch

Page 29: 1 The ALICE Offline Environment F.Carminati 13 June 2001

29ROOT WorkShop 2001June 13, 2001

Conclusion The development of AliRoot continues happilyThe organisation of the Off-line project is more and more oriented toward an OS distributed development model ROOT fits perfectly in this scheme

We are confident that ROOT will evolve to meet our requirements in 2007 and beyond We will try hard to have the recommendations of the

LHC Computing Review implemented at CERN

We are grateful to the ROOT team for their continued and excellent support