building the world’s largest linux supercomputers · 2003. 7. 8. · building the system ndesign...

23
Building the World’s Largest Linux Supercomputers Kim Clark V.P. Engineering

Upload: others

Post on 08-Oct-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Building the World’s Largest Linux Supercomputers

Kim ClarkV.P. Engineering

Page 2: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Leading the Cluster Revolution

Lawrence Livermore11.2 TFLOPSLinux Networx E22,304 Intel Processors

Los Alamos10 TFLOPSLinux Networx E22,048 Intel Processors

Argonne1.68 TFLOPSLinux Networx E2408 Intel Processors

The Linux Networx TeraFLOPS Club

Page 3: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements
Page 4: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

World’s Most PowerfulLinux Supercomputer

nRpeak=11.2 TF, Rmax=7.634 TF (68% efficiency)

n # 3 Top 500 Supercomputer Listn # 9 Capability Class Systems

(IDC Balanced Ratings)

Page 5: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

LLNL System Facts

n 1152 Evolocity II (.8u) Nodes

n 2.4 GHz Intel® Xeon™ ProcessorsnQuadrics ELAN3 QsNetn 4.6 TBytes Memory

n 138 TBytes Local Storage

n 115 TBytes Global Storage

Page 6: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Building The Systemn Design Prep

n Rack Layoutn Cabling Layoutn Test Plan

n Facility Prepn Power Measurementsn HVAC Estimations

n Built At Factory Firstn Tear Down 3 Daysn Rebuild at LLNL 3 days

Page 7: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Testing The System

n Test PlannBurn In Full RacknBasic Network TestnMPI Stress TestnLinpacknPre-Ship Test SuitenPost-Ship Test SuitenFinal Acceptance

Page 8: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Cooling The Systemn Node

n Redundant Fansn CPU temp is 24°C Under Heavy Load

n Rackn Patented Cooling Chassisn 18° C In , 22°C Out

n Systemn Hot air flow mixn CRAC 80 Tons

Page 9: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Powering The System

n350 Watts per sq. footnTotal: 280 kWnTwo 50 amp feeds/rackn2 PDUs per rackn ICE Box Management

nPower ManagementnTemperature SensingnSerial Console AccessnNode Beaconing

Page 10: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Reliability

Failures Happen!n 1000 Components with a 150,000 hour individual MTBF

have an aggregate MTBF of 150 hours if there is no redundancy!

n To combat low reliability, applications must checkpoint frequently which degrades performance!

Higher Reliability = Higher Performance

Page 11: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

MCR (Multi-programmatic Capability Cluster) Architecturen Scalable Unitsn1 FSU (First Scalable Unit)n11 CNSU (Compute Node Scalable Units)

n NetworksnMPI - QuadricsnManagement - Ethernet 10/100nDebug - SerialnOST - GigEnLogin - GigE

Page 12: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements
Page 13: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Scalable Units

n FSU n60 Compute Nodesn32 Gateway Nodes (QsNet -> GigE)n 2 Login Nodesn 2 Management Nodesn 2 MDS Nodes (Kimberlite for HA)

nCNSU (Compute Node Scalable Units)n96 Compute Nodes Each

Page 14: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

MPI Network

nSingle-rail Quadrics Elan3n5 usec latency for short messagesnGot 325 out of 340 MB/sec

n 3:1 Oversubscribed Fat-tree Networkn12 first tier, 4 second tier switchesnLess than 50% degradation (66%

expected)

Page 15: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Software Stack

LinuxBIOS

LinuxXFS

Lustre

Mpich (Quadrics)

RMS/SLURM

Maui/DPCS

ClusterWorX

Page 16: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

LinuxBIOS

n LinuxBIOS on all nodesn < 2 Seconds from Power-on to Boot Loadern < 1 Minute from Boot to Login CompletenRemote Manageability

nEdit CMOS ParmsnFlash/store ROM image

nSupports Multi-cast Boot

Page 17: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Lustre

Client

MDS

MDS

OST

Page 18: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Linux NetworX Accomplishments With MCR

n Fastest Intel or Linux-based systemnShortest Delivery Time for Top 5 Systemn Full System Pre-Stage At FactorynRemote Imagingn1152 nodes imaged in 15 minutes.

n LinuxBIOSnLess than 1 minute Boot

nHighest CPU density - Evolocity II (.8u)

Page 19: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Dr. Still:“MCR is a great machine.

... Please buy more machines like it.”

Science Runsn 768 CPU Pf3d Laser

simulationn 6 Billion cells!n Able to dump restart

files at 464 MB/sec

Page 20: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

High Productivity Computing Systems

Reliabilityn Superior patented cooling mechanism

n Redundant bearingless fans

n No component less than 150K hour MTBF

(Real World = 3 weeks)

n Onsite Buildup

Scalabilityn MCR added 192 nodes

Page 21: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

High Productivity Computing Systems

Manageability

n HW Management – ICE Box

n SW Management – Clusterworx

n Linux BIOS

n High Density

Performance

n High Performance Linpack / IDC Ratings

Page 22: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

Application for Smaller Systems

n Same Componentsn Same Cooling Technologyn Same QA/Processn Full System Pre-Stage At Factoryn Quicker to implementn Can run smaller systems in ambient

temperatures.n Can add as needs grow.

Page 23: Building the World’s Largest Linux Supercomputers · 2003. 7. 8. · Building The System nDesign Prep nRack Layout nCabling Layout nTest Plan nFacility Prep nPower Measurements

World’s Fastest Linux Supercomputers