the hartree centre - hpc advisory council · 2020. 1. 16. · “the hartree centre’s expertise...

52
The Hartree Centre: Experience in Addressing Industrial, Societal and Scientific Challenges Prof Vassil Alexandrov Chief Science Officer

Upload: others

Post on 09-Oct-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

The Hartree Centre:Experience in Addressing Industrial,

Societal and Scientific Challenges

Prof Vassil Alexandrov

Chief Science Officer

Page 2: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Transforming UK industry by accelerating the

adoption of high performance computing,

big data and AI.

Our mission

Page 3: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Lenovo / Arm development system

Mission: To transform UK industry by accelerating the

adoption of high performance computing, data-centric

computing and AI.

Better products & servicesdelivered faster & cheaper

DeploymentDevelopment &prototyping

Research & innovationthrough collaboration

Hartree Centre

Page 4: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

What we do

− Collaborative R&DAddress industrial, societal and scientific challenges.

− Platform as a serviceGive your own experts pay-as-you-go access to our compute power

− Creating digital assetsLicense the new industry-led software applications we create with IBM Research

− Training and skillsDrop in on our comprehensive programme of specialist training courses and events

or design a bespoke course for your team

Page 5: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product
Page 6: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Government &

public sectorLocal business

networksTechnology

partners

Network of expertise

Academia

Universities

International

research

communities

Page 7: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Data

Science HPC

AI Emerging Technological

Paradigms

Hartree Research Themes

Page 8: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Our track record

Page 9: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Collaborative R&D

Hartree Centre experts optimised codes for

modelling component design, speeding up

run times by approx 20%

“Working with the Hartree Centre, we have

quickly made significant improvements to our

code, delivering faster turnaround and more

capability to our engineers.”

− Matthew Street, Rolls-Royce

Case study | Code optimisation for aero engines

Page 10: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Collaborative R&DCase study | Computer aided formulation

Faster development process for products

like shampoo, reducing physical testing

“The Hartree Centre’s high performance

computing capabilities help us achieve better

design solutions for our consumers, delivered

by more efficient, cost-effective and

sustainable processes.”

− Paul Howells, Unilever

Page 11: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Collaborative R&D

Transforming the patient experience using

cognitive technology and data analytics

“Helping our patients and their families

prepare properly for coming into hospital will

really reduce their anxiety and could mean

they spend more meaningful time with doctors

so we are able to get them better faster.”

− Iain Hennessey, Alder Hey Children’s Hospital

Case study | Building the cognitive hospital

Page 12: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Creating digital assets

Building pest risk prediction models with

the potential to:

• Enable the farming industry to more

accurately plan preventative measures

• Reduce crop losses

• Drive down insurance rates through lower

probability of crop damage

Case study | Smart crop protection

Page 13: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Intro

Our Platforms

Page 14: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Our platforms

Intel platforms

Bull Sequana X1000 (840 Skylake + 840 KNL processors) - One of the largest supercomputers in

Europe focusing primarily on industrial-led challenges.

IBM big data analytics cluster | 288TB

IBM data centric platforms

IBM Power8 + NVLink + Tesla P100

IBM Power8 + Nvidia K80

Accelerated & emerging tech

Maxeler FPGA system

ARM 64-bit platform

Clustervision novel cooling demonstrator

Page 15: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Detailed Case Studies

HPSE

Page 16: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Case Study:

• Performance: Needs to get the results in time for

forecast, ever-increasing accuracy goals for

climate simulations.

• Productivity: hundreds of people contributing

with different areas of expertise, 2 million lines of

code (UM)

• Portability: Very risky to chose just one platform:

may not be future-proofed, hardware changes

more often than software, procurement

negotiation disadvantage if you can only run on

one architecture, ...Difficult to compromise on

one

Page 17: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Algorithm

Kernel

Parallel SystemComputational

Science

Natural

Science

Operates on full fields

Operates on local

elements or columns

Given domain-specific knowledge and information about the Algorithm and

Kernels, PSyclone can generate the Parallel System layer.

Domain Specific Languages:

Embedded Fortran-to-Fortran code generation system used by the UK

MetOffice next-generation weather and climate simulation model (LFRic)

Page 18: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

• Serial performance optimizations.

• Distributed-memory parallelism with MPI.

• Shared-memory parallelism.

• GPGPU Programming.

Example: Direct Simulation Monte Carlo (for rarified gas flows)

Software Parallelization and Optimization

Page 19: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Currently undertaking an industrial collaboration Briggs

Automotive Company:

• Involves analysing and improving the design of the

BAC Mono single-seat sports car using large-scale

CFD computations.

PRACE (Partnership for Advanced Computing

in Europe)

Page 20: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

European project that targets to provide the template for an

upcoming Exascale system by co-designing and implementing a

petascale-level prototype with ground-breaking characteristics.

Builds on top of cost-efficient architecture enabled by novel inter-

die links and FPGA acceleration.

Work package 2: Applications, Co-design, Porting and Evaluation

Work package 3: System software and programming environment

Page 21: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Porting DL_MESO (DPD) on

Nvidia GPUs

Jony Castagna

Page 22: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

What is DL_MESO (DPD)

• DL_MESO is a general purpose mesoscale simulation package

developed by Michael Seaton for CCP5 and UKCOMES under a grant

provided by EPSRC.

• It is written in Fortran90 and C++ and supports both Lattice Boltzmann

Equation (LBE) and Dissipative Particle Dynamics (DPD) methods.

• https://www.scd.stfc.ac.uk/Pages/DL_MESO.aspx

Page 23: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

- Free spherical particles which interact over a rangethat is of the same order as their diameters.

- The particles can be thought of as assemblies or aggregates ofmolecules, such as solvent molecules or polymers, or more simply as carriers of momentum.

Fi is the sum of conservative, drag and random (or stochastic) pair forces:

...similar to MD

i j

cut off for short

range forces

+

long range forces

Page 24: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Examples of DL_MESO_DPD applications

Phase separation

Polyelectrolyte

Vesicle Formation

Lipid Bilayer

DL_MESO: highly scalable mesoscale simulations

Molecular Simulation 39 (10) pp. 796-821, 2013

Page 25: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

DL_MESO_DPD on GPU

initialisation, IO, etc.

(Fortran)

pass arrays to C

(Fortran)

copy to device

(CUDA C)

start main loop

time = final time ?pass arrays back to

Fortran and End

no

yes

100 % compatible IO with master version

Porting DL_MESO on NVidia GPU

all done by the GPU

with 1 thread per

particle

host = CPU

device = GPU

first step VV

construct neighbour list

generate random values

find short range forces

find long range forces

(FFT)

second step VV

gather statistics

Page 26: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Multi GPU version

Compute internal

cell forces

between particles

step 1

step 2

step3

Compute

boundary cell

forces between

particles

overlap computation

with communications!

Page 27: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Multi GPU version

GPU 0 GPU 1

GPU 2 GPU 3

halo regions

Page 28: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Multi GPU version

Use of communication between GPUs:1) to exchange particles positions using ghost cells (ideally

while computing the internal cells!)

3) gathering statistics

4) transfer info for FFTW (not implemented yet!)

2) for particles

leaving the domain

Page 29: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Multi GPU version

exchange particles positions using ghost cells

x

y

z

step 1:

exchange x-y planes

(2 communications)

Page 30: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Multi GPU version

exchange particles positions using ghost cells

x

y

z

step 2:

exchange x-z planes

(with halo data!)

Page 31: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Multi GPU version

exchange particles positions using ghost cells

step 3:

exchange y-z planes

(with halo data!)

x

y

zneed only 6 communications

instead of 26!

Page 32: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Weak scaling

1.2 billion particles for a mixture phase separation

Piz Daint

(CSCS)

*GPUs

GPUs

0

0.2

0.4

0.6

0.8

1

0 128 256 384 512

weak scaling

ideal

DL_MESO (GPU version)

Page 33: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Strong scaling

1.8 billion particles for a mixture phase separation

- no imbalance, if not within the same GPU due the particles

clustering!

GPUs

Page 34: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Xiaohu Case studies

Page 35: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

IMPORTANCE: Hartree Centre key technologies, align with STFCglobal challenge schemes.

Finite Element Method Smoothed Particle Hydrodynamics

Software Development for Energy and Environment

Nuclear Schlumberger oil reservoir

NERC ocean roadmap EPSRC MAGIC

Wave impact on oil rig Wave energy converter

Tsunami CCP-WSI

Page 36: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Large scale application software development, advancedcomputational methods development.

Sparse Linear Solver

FEM SPH/ISPH

Unstructured Mesh

Pre/Post Processing

SPH Pre/Post

Processing

Mesh topology

Management

Mesh

Adaptivity

Basic FEM Math

operators

FEM Matrix

Assembly

Basic SPH Math

operators

Nearest Neighbour

List Search

Mesh/Particles Reordering

Scalable Algorithm Development

SPH Particle

Refinement

MPI OpenMP CUDA OpenCL OpenACC

C/C++ Fortran Python

DDM/DLB

Page 37: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Unstructured Application Framework

Exascale AlgorithmsCommon Particle

Methods Kernels

Common Unstructured

Application Framework

Performance PortabilityUnstructured Mesh

Techs

EPSRC eCSE

EPSRC, CCP, SLA

Innovate UK, GCRF

EPSRC, Innovate UK, HRBEU

Page 38: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Advanced Monte Carlo

Methods for Linear Algebra

Vassil Alexandrov

Page 39: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Advanced Monte Carlo Methods for Linear Algebra on Advanced Accelerator Architectures Anton Lebedev (Institute for Theoretical Physics, University of Tuebingen, Germany), [email protected] Vassil Alexandrov (Hartree Centre STFC, UK and ICREA, Spain) [email protected]

Hybrid Monte Carlo Speed-ups V100 Speed-ups Intel Xeon

vs Hybrid MSPAI vs Intel Xeon 8160 vs V100 and K80

Page 40: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

OpenMP vs GPU implementation

Page 41: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

OpenMP vs GPU implementation

Page 42: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

OpenMP vs GPU implementation

Page 43: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Data Science

Simon Goodchild

Page 44: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

0

50

100

Accuracy % 13

features

Accuracy % 13 features

Page 45: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

“The Hartree Centre’s expertise

made our testing process cheaper

and more reliable, enabling us to

get our product to market faster.”

- William Wilson, GLOBAL-365

“Rolls-Royce has worked with the

Hartree Centre for many years due to their

combination of highly skilled computational

scientists and state-of-the-art HPC.”

- Leigh Lapworth, Rolls-Royce

Hartree Centre Impact

Page 46: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Evaluation participants

Page 47: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Training and Education at Hartree Centre

Page 48: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

HPC RD&I:

supercomputing,

Big data Analytics,

AI, RSE, Visual

Computing

UKRI / STFC

programs &

initiatives

Strategic

partnerships &

international

collaboration

Commercial &

Research

projects

University

liaison

Training

courses &

Knowledge

Transfer

Outreach

activities

Hartree research and training engagement

Page 49: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product
Page 50: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Knowledge Transfer, Events and Activities

Over 3000 person-days of training delivered to industrial and academic audiences

• Workshops: Aimed at decision-makers they shed light on High-performance

Computing(HPC), Data Analytics, and Research Software Engineering or Artificial

Intelligence applications(AI) incorporating individual consultations with our Business

Development team (BD).

• Training courses: The usual format is 1 to 3 days with sessions alternating between

presenting novel concept following by a technical hands-on session to facilitate the

trainees in applying the learned technique/skill on-the-job. The courses usually end

up with a “bring your own problem” session in order to build the confidence of the

participants in tackling the discussed technology/methodology.

• Upskilling and collaboration for specific project: A practical workshop with

experienced researchers and engineers from the Hartree Centre can kick-start a

collaboration by exploring some of the required steps. For example how to collect

and prepare your data for AI applications, how to evaluate the performance of a

software and to optimise applications for HPC systems or in the cloud.

Page 51: The Hartree Centre - HPC Advisory Council · 2020. 1. 16. · “The Hartree Centre’s expertise made our testing process cheaper and more reliable, enabling us to get our product

Impact of Hartree engagement with Universities

• Hartree Annual Doctoral Symposium, open to all of the CDTs collaborating with the Centre. The aim of the Symposium is to provide a forum for the students to share the results of their research work through talks, poster sessions and discussions.

• Internships & Placements, under Individual Research Training plan. Hartree centre guarantee a free access to all Hartree training courses and Seminars for the students it is supervising, and address the requirements for their technological skills.

• Functional Skills - as presentation of research results in front of academic and non-academic audiences is as equally important as the ability to produce technical and business reports and successfully publish research papers, opportunities to work in collaboration with Hartree researchers on written and verbal communication targeting diverse audiences are part of the PDP.