the eurolab4hpc project has received funding from the ... · fusion reactors core; cnea, argentina...
TRANSCRIPT
![Page 1: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/1.jpg)
The Eurolab4HPC project has received funding from the European Union Horizon 2020 Framework Programme (H2020-EU.1.2.2. - FET Proactive) under grant agreement number 800962.1
![Page 2: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/2.jpg)
Eurolab4HPC• Eurolab4HPC: 2015–2017 (CSA)• Eurolab4HPC2: 2018–2020 (CSA)
2
![Page 3: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/3.jpg)
Structure the HPC Community (1/2)
3
Thematic workshops• Dedicated and focussed workshops where
the different stakeholders for a particular theme
• can meet, brainstorm, work on a vision, and make plans for collaborations
Oct 2018 (Heraklion): Eurolab4HPC Thematic workshop on Accelerators
Apr 2019 (Edinburgh): Eurolab4HPC Thematic workshop on multicore programming
Oct 2019 (Bilbao): Eurolab4HPC Thematic Session onprogramming models for upcoming supercomputers
Eurolab school• Colocated with HiPEAC ACACES summer school• Three courses on HPC• Eurolab offered travel grants for students
![Page 4: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/4.jpg)
Short collaborations
4
• Travel costs for short stays covered up to €5,000
• Duration approx. 3 months• Foster tight integration among
research teams at multiple levels• Hardware, architectures,
programming and applications• PhD students, post-doctoral
researchers and academics
Structure the HPC Community (2/2)
Name Title To From
Alejandro Soba
HPC for Neutron Transport in a Fusion Reactors Core
CNEA, Argentina BSC
Albert Kahira Scaling Large Machine Learning Models for HPC
BSC Tokyo Institute of Technology
Sandra Catalan
Math Libraries Migration and Optimization on ARM-based processors
BSC Fujitsu
IvanFernandezVega
Processing in Memory for Data Intensive Applications
University of Malaga
ETHZ
Christopher Eldred
Themis Univ. GrenobleAlpes/Inria
Imperial College
![Page 5: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/5.jpg)
5
EuroLab-4-HPC 2017 Vision• Long-term vision for excellence in European HPC research• Written by expert working group
• Luca Benini, Babak Falsafi, Paul Carpenter, Avi Mendelson, Theo Ungerer• Sandro Bartolini, François Bodin, Dietmar Fey, Marc Duranton,
Igor Zacharov
• Complements HiPEAC Vision• Broad scope of computing and society: HPC, AI, embedded• 2017–2023 research horizon
• Complements ETP4HPC SRA – Strategic Research Agenda• Industry-driven roadmap to build competitive European HPC technology value chain.• 6-year horizon
• Released August 2017• 2020 update under preparation• BoF: Challenges and Opportunities in Academic HPC Systems
Research in 203017:15 to 18:45 on Wednesday 20 November (today!)
![Page 6: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/6.jpg)
6
FETHPCEuroHPC
R&D projects
EuroLab4HPCvision
EuroLab4HPC
EXDCI
HPC applications
ETP4HPCSRA
HPC infrastructure
PRACE
HPC industryETP4HPC
HPC systems academia
Codesign
CoEs FOCUSCOE
![Page 7: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/7.jpg)
7
Promote entrepreneurship (1/2)
Idea-to-business training• First held in Gothenburg, November 2018• Very early stages, i.e. when founders only have a piece
of technology and an idea of the potential market. • Business Model Canvas• Customer Development Process
Open source innovation camp⇒ Open source natural forum for:• Testing ideas and innovate,• Promoting entrepreneurship,• Fostering industry take-up `• Training more experts in HW and SW
• Supported RISC-V Workshop, June 2019, Zurich
• Eurolab4HPC extends it with additional events together with the FOSSI foundation
• Eurolab also presented prizes to winners of call for contributions for open source HPC projects
![Page 8: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/8.jpg)
Business prototyping projects (BPPs)
8
• Successful projects are mentored using the business model canvas.
• Business model hypotheses iteratively verified or adjusted to come up with business plan
• Eurolab4HPC grant of € 15,000• Typically lasts 3 months
Promote entrepreneurship (2/2)
Name Title Start date
Josep de La PuenteJose María CelaAlejandro Martí
USER - Urgent Simulation of Earthquakes for Resilience
February 2019
Monica de MierStephan Mohr Gerhard Goldbeck
ATOMSIM - Atomistic simulations for materials design
December 2018
Petar RadojkovicPaul CarpenterMario Nemirovsky
PREDICT: Prediction of HPC failures in the field
June 2019
![Page 9: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/9.jpg)
Industry days
9
• First Eurolab4HPC Industry Day, November 2018 (Multicore Day, Sweden)
• Second Eurolab4HPC Industry Day, February 2019 (University of Edinburgh)
Stimulate technology transfer (1/2)
Technology transfer training• First Eurolab4HPC Technology Transfer Training
Course, HLRS, Stuttgart, March 2019• Two-day course intended to equip academic
researchers• Basic understanding of the entrepreneurial process• Generation of business models• Managerial challenges
![Page 10: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/10.jpg)
Technology transfer projects (TTPs)
10
• Bilateral technology transfer between• one academic partner as technology provider and• one industry partner as technology receiver
• Funded TTP typically lasts 6 months • One TTP project was granted• Total budget of the TTP is €90,000, of which Eurolab4HPC
pays up to 50% (€45,000)
Stimulate technology transfer (2/2)
![Page 11: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/11.jpg)
BoF: Challenges and Opportunities in Academic HPC Systems Research in 2030
17:15 to 18:45 on Wednesday 20 November (today!)Room 708 on ground floor
Public workshop in process of preparing 2020 vision update
John Davis (BSC)Martin Schulz (LRZ + TUM)
https://sc19.supercomputing.org/presentation/?id=bof197&sess=sess337
11
![Page 13: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/13.jpg)
Backup
13
![Page 14: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/14.jpg)
14
Comparison of HPC RoadmapsGoal Timespan SWOT /
political Scope Num.pages
HiPEAC vision Steer European academic research (driven by industry)
“short”: 3 years,“mid”: 6 years,long: >2020
Y HPC + embedded 72
ETP4HPC SRA / EXDCI
Strengthening European [industrial] HPC ecosystem
6 years(2014 to 2020) Y HPC except
applications 92
PRACE Scientific Case
[Academic] need for European HPC infrastructure
8 years(2012 to 2020) Y HPC applications 159
EESI (European Exascale Software Initiative)
Development of efficient exascale applications 5 to 10 years N Exascale applications 34
BDVA (Big Data Value Association) Big Data technologies roadmap 2020 - Big data 45
Rethink BigRoadmap for European Technologies in Hardware and Networking for Big Data
- Big data
ECSEL MASRIAEuropean leadership in enabling and industrial technologies. Competitive EU ECS industry.
2015 roadmap to about 2025 Y Electronic components
and systems (ECS) 91
Eurolab4HPC Academic excellence in HPC 8 to 10 years N Whole HPC stack 20—25
![Page 15: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/15.jpg)
15
![Page 16: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/16.jpg)
New application and software technologies pull
• Strong scaling evolutionary applications• Importance of legacy codes• Need to adapt for scalability and heterogeneity
• Potential new HPC applications• HPDA: High-performance Data Analytics• Deep learning/neuromorphic• Industrial Internet of Things; Industry 4.0• New expert programming (DSLs)• Approximate computing (concerns SW and HW)• ...
• Need for co-design• Interactivity, real-time and bursting to cloud
16
![Page 17: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/17.jpg)
System software and programming environment
• Sustained increase in system complexity, specialization and heterogeneity => intelligent programming environments
• Complex application performance analysis and debugging• Too many threads to analyse• Big gap between anomalous behaviour and source code changes
• Disruptive models of computation• Will Fortran, C, sockets, POSIX, MPI, OpenMP be replaced? (probably not)
• Convergence between storage and memory• Neuromorphic, resistive and quantum computing
• Task-based, or other programming abstraction?
17
![Page 18: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/18.jpg)
Vertical Challenges: Green ICT, Energy and Resiliency• Green ICT
• Computer systems already consume a large part of our natural resources • Improving PUE (power usage effectiveness) and DCIE (data centre
infrastructure efficiency)• Heat reuse• Environmental impact: reducing CO2, life cycle assessment
• Energy• Energy efficiency: efficient software stacks, libraries• NVM, new materials
• Resiliency• Shrinking increases failure rates: 10s atoms per transistor• Enormous number of components• Compiler level resilience, reliability in runtime and programming models, ABFT
(Application/Algorithm Based Fault Tolerance)
18
![Page 19: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/19.jpg)
Data centre and cloud computing
• Convergence of HPC and cloud computing• Data analytics complement simulation• Science entering 4th paradigm
• 1: theory, 2: empirical, 3: simulation• instrument data (LHC/SKA), simulation data, sensor data, human data
• Warehouse-scale computers• Commodity systems and very low prices• Increasingly shared concerns: enterprise – HPC
• Scale out, dynamic resource management• High utilization, parallelization (multicore, GPU,FPGA)• Low latency interconnect, application resiliency• Infrastructure costs, economies of scale
19
![Page 20: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute](https://reader034.vdocuments.site/reader034/viewer/2022042613/5f8a48d7e7e7375bf62a96ff/html5/thumbnails/20.jpg)
Disruptive Technologies will Push Future HPC
• Sustaining Technology (improving HW in ways generally expected) • Continuous CMOS Scaling • Die Stacking - 3D-Chip
• Disruptive Technology in Hardware/VLSI (innovation that creates a new line of HPC HW superseding existing HPC techniques):
• NVM Technologies (Memristors, STT-RAM)• Photonics
• Disruptive Technology (alternative ways of computing)• Resistive Computing• Neuromorphic Computing • Quantum Computing
• Beyond CMOS• Nanotubes• Graphene• Diamond
20