the impact of policy on uptake and usage of national and international computational grids

23
The Impact of Policy on Uptake and Usage of National and International Computational Grids S. J. Zasada, S. Manos, P. V. Coveney Centre for Computational Science, Department of Chemistry, University College London, Christopher Ingold Laboratories, 20 Gordon Street, London, WC1H 0AJ

Upload: tatyana-gaines

Post on 31-Dec-2015

25 views

Category:

Documents


4 download

DESCRIPTION

The Impact of Policy on Uptake and Usage of National and International Computational Grids. S. J. Zasada, S. Manos, P. V. Coveney Centre for Computational Science, Department of Chemistry, University College London, Christopher Ingold Laboratories, 20 Gordon Street, London, WC1H 0AJ. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

The Impact of Policy on Uptake and Usage of National and International

Computational Grids

S. J. Zasada, S. Manos, P. V. Coveney

Centre for Computational Science, Department of Chemistry, University College London, Christopher

Ingold Laboratories,

20 Gordon Street, London, WC1H 0AJ

Page 2: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Contents

• Case studies: Motivating applications– TeraGyroid– SPICE– GENIUS

• General requirements of these project• Impact on the policies of grid resource providers• Looking to the future

Page 3: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

The TeraGyroid project coupled cutting-edge grid technologies, high-performance computing, visualisation and computational steering capabilities to produce a major leap forward in soft condensed matter simulation.

Lattice sizes up to 10243

G. Giupponi, J. Harting, P.V. Coveney, Europhysics Letters, 73, 533-539 (2006).

J. Chin and P. V. Coveney,

Proc. R. Soc. London A, 462, 3575-3600 (2006).

LB3D/TeraGyroid Project

Won the award for "Most Innovative Data-Intensive Application" in the HPC Challenge competition at SC'03.

Page 4: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Computational Biomedicine: Simulated Pore Computing

Environment (SPICE)

Interactive HPC ANALYTICS CHALLENGE WINNER @ SC05

Translocation of biomolecules

through protein pores. Size,

complexity & timescale make

computations expensive. Millions of

CPU hours using “simple” MD. Need

to do better...

Novel Algorithm: Steered Molecular

Dynamics to “pull DNA through the

pore”. Jarzynksi's Equation to

compute equilibrium free energy

profile from non-equilibrium pulling.

Reduce comp. cost by approx. 100.

Page 5: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Federated Grid: UK-NGS & US-TeraGrid

High-end systems provide real-time interactivity. Advanced networks provide schedulable capacity and high QoS

Significant performance using optical switched light-paths -- UKLight/GLIF

SPICE: Grid Infrastructure

RealityGrid Steering Infrastructure (http://www.realitygrid.org)

Underlying grid middleware and complexity hidden from end-user

RealityGrid Steering API Application uses client side API

Page 6: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Grid Enabled Neurosurgical Imaging Using Simulation

The GENIUS project aims to model large scale patient specific cerebral blood flow in clinically relevant time frames

Objectives:• To study cerebral blood flow using patient-specific image-based models.

• To provide insights into the cerebral blood flow & anomalies.

• To develop tools and policies by means of which users can better exploit

the ability to reserve and co-reserve HPC resources.

• To develop interfaces which permit users to easily deploy and monitor

simulations across multiple computational resources.

• To visualize and steer the results of distributed simulations in real time

Page 7: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Book computing resources in advance or have a system by which simulations can be run urgently.

Shift imaging data around quickly over high-bandwidth low-latency dedicated links.

Interactive simulations and real-time visualisation for immediate feedback.

15-20 minuteturnaround

The clinical work flow

Page 8: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Efficient fluid solver for modelling brain bloodflow called HemeLB:

• Uses the lattice-Boltzmann method

• Efficient fluid solver for sparse geometries, like a vascular tree

• Machine-topology aware graph growing partitioning technique,

to help hide cross-site latencies

• Optimized inter- and intra-machine

communications

• Full checkpoint capabilities

Modelling blood flow using HemeLB

Page 9: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Haemodynamic simulation and visualisationFirst step is the conversion of patient-specific MRA or 3DRA data (DICOM format) to a 3D model,

vasculature is of high contrast, 300 - 400 m resolution, 5003 - 7003 voxels3DRA - 3-dimensional rotational angiography, vasculature is obtained using digital subtraction

imaging with a high-contrast x-ray absorbing fluid.

Page 10: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

• Typical computing scenario involves jobs submitted into a queue– Submit -> Run -> Post-process

• This won’t work in a clinical scenario since correctness and timeliness are important in clinical computing - late results are useless

Advance reservationsEmergency computing Grid middleware - the Application Hosting environmentBlood flow modelling, computational steering and

real-time in-situ visualisationDistributed ComputingLightpaths

• THE REQUIREMENT: To incorporate these methodologies into a clinician’s day to day activities, rather than just providing such facilities on an ad hoc basis.

GENIUS Toolkit

Beyond the batch job

Page 11: The Impact of Policy on Uptake and Usage of National and International  Computational Grids
Page 12: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Cross-site Runs with MPI-g

• GENIUS has been designed to run across multiple machines using MPI-g

• Some problems won’t fit on a single machine, and require the RAM/processors of multiple machines on the grid.

• MPI-g allows for jobs to be turned around faster by using small numbers of processors on several machines - essential for clinician

• HemeLB performs well on cross site runs, and makes use of overlapping communication in MPI-g

Page 13: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

HemeLB/MPI-g Requires Co-Allocation

• We can reserve multiple resources for specified time periods

• Co-allocation is useful for meta-computing jobs like HemeLB, viz and for workflow applications.

• We use HARC - Highly Available Robust Co-scheduler (developed by Jon Maclaren at LSU).

Slide courtesy Jon Maclaren

Page 14: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

HARC• HARC provides a secure co-allocation service

– Multiple Acceptors are used

– Works well provided a majority of Acceptors stay alive

– Paxos Commit keeps everything in sync

– Gives the (distributed) service high availability

– Deployment of 7 acceptors --> Mean Time To Failure ~ years

– Transport-level security using X.509 certificates

• HARC is a good platform on which to build portals/other services– XML over HTTPS - simplerthan SOAP services

– Easy to interoperate with

– Very easy to use with the Java Client API

Page 15: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

SPRUCESpecial PRiority and Urgent Computing Environment

• Applications with dynamic data and result deadlines are being deployed

• Late results are useless– Wildfire path prediction– Storm/Flood prediction– Patient specific medical treatment

• Some jobs need priority access “Right-of-Way Token”

Page 16: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Real Time Visualisation and Steering

• A way to let HemeLB know the parameters to be steered --> we use the RealityGrid steering system to steer the input data on the fly.

• One aim is to do all this for distributed (cross-site) simulations

• For medical applications, need may be urgent

Page 17: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Application Hosting Environment

• Need to utilize resources from globally distributed grids– Administratively distinct– Running different middleware stacks

• Wrestling with middleware can't be a limiting step for scientists

• Need tools to hide complexity of underlying grids

Page 18: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

General requirements of these projects

• Ability to co-reserve resources• Launch emergency simulations• Consistent interfaces for federated access• Access to back end nodes: steering, visualisation• Lightpath network connections• Cross site simulations (MPIg)• Support for software (ReG steering toolkit etc)

Page 19: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Impact on resource provider policies

• TeraGrid, NGS & HPCx starting to support advanced reservation with HARC

• DEISA are evaluating HARC deployment on their systems

• Some TeraGrid sites support emergency jobs with SPRUCE

• Lightpath connection in place between Manchester and Oxford NGS nodes

• MPIg and RealityGrid steering deployed on NGS and TeraGrid resources

Page 20: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Virtual Physiological Human

• Funded under EU FP 7• 15 projects: 1 NoE, 3 IPs, 9 STREPs, 2 CAs.

“a methodological and technological framework that, once established, will enable collaborative investigation of the human body as a single complex system ... It is a way to share observations, to derive predictive hypotheses from them, and to integrate them into a constantly improving understanding of human physiology/pathology, by regarding it as a single system.”

Page 21: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

• Computational experiments integrated seamlessly into current clinical practice

• Clinical decisions influenced by patient specific computations: turnaround time for data acquisition, simulation, post-processing, visualisation, final results and reporting.

• Fitting the computational time scale to the clinical time scale:– Capture the clinical workflow– Get results which will influence clinical decisions: 1 day? 1 week?– This project - 15 to 30 minutes

• Development of procedures and software in consultation with clinicians• Security/Access is a concern

• On-demand availability of storage, networking and computational resources

VPH requires clinical (grid) computing?

Page 22: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Conclusions

• The projects presented have all put pressure on resource providers to offer new services and new ways of working

• For interactive work the batch processing model does not work

• If HPC is to be exploited by clinicians it needs to be used in a way that fits in with the clinical workflow

• VPH initiative: Likely to increase pressure for non-standard services from resource providers

Page 23: The Impact of Policy on Uptake and Usage of National and International  Computational Grids

Rob HainesRobin PinningJohn BrookeStephen PicklesMark Mc KeownNGS staffTeraGrid StaffLONI StaffJANET/David SalmonSimon CliffordFrank SmithNick OvendenBrian ToonenNicholas KaronisDavid HawkesJon MaclarenShantenu JhaDaniel KatzShawn BrownKen YoshimotoDoru Marcusiu

Acknowledgements