myths and reality bordeaux sept. 2, 2013 · pdf file · 2013-09-02o power...

41
Xavier VIGOUROUX HOSCAR Workshop Bordeaux Sept. 2, 2013 Current trends and impacts

Upload: dangngoc

Post on 20-Mar-2018

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

Myths and reality

Xavier VIGOUROUX

HOSCAR Workshop

Bordeaux

Sept. 2, 2013

Current trends and impacts

Page 2: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

WHO AM I

Page 3: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

A VERY QUICK SUMMARY

Bef. ‘95

• Many architectures: RISC, MIPS, SIMD, MIMD …

• Many topologies: TORUS, Hypercubes…

• Exemples: Cray T3E, Thinking Machines CM-5, Intel Paragon, MasPar MP-1, Parsytec Gcel

Hard time for software

developers No Software compatibilities (PVM was just appearing)

‘95 – ‘05

• Industrialization

• Productivity

• Performance gains were seamless

Aft. ‘05

• Power constraint

• No more ILP gain

• Complex memory Hierarchy

Good time for software

developers

Better Performance by just waiting

Hard time for software

developers Getting performance requires work

Article suggestion: http://herbsutter.com/welcome-to-the-jungle/

Page 4: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

2004 – END OF ROAD FOR MONOCORE

“THE SHIFT IS ALSO AN INDICATION THAT THE INTEL PENTIUM 4 LINE

HAD REACHED ITS PERFORMANCE LIMIT FOR THE AMOUNT OF POWER IT

CONSUMED.”

N. BROOKWOOD - ANALYST

Page 5: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

2004 – EARLY ADOPTERS FOR GPGPU

Source: http://gpgpu.org/s2004, http://gpgpu.org/oldsite/data/history.shtml, http://gpgpu.org/about,

2001 – programmable shader (NVidia) 2002 – name “GPGPU” by Mark Harris

2003 – cloud simulation on GPGPU by Mark Harris

Page 6: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

2005 – TANSTAAFL* … BUT IT’S OVER

Ref: “The Free Lunch Is Over - A Fundamental Turn Toward Concurrency in Software” By Herb Sutter in Dr. Dobb's Journal * “There ain't no such thing as a free lunch”

Page 7: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution
Page 8: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

Source: http://www.ncdc.noaa.gov/sotc/global/

Page 9: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution
Page 10: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

Cooling

CPU

Page 11: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

COOLING

30°C – 80kW/Rack

6°C – 30kW

12kW/rack

Page 12: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

DIRECT LIQUID COOLING

Water has closed as possible to the heat source (CPU at 70

C)

Water can be hotter (as T is key)

Room can be hotter (remove CRAC)

And without any change in maintenance process

CPU can be changed,

DIMM can be changed

Blades can be removed

30°C – 80kW/Rack

PUE < 1.08

Page 13: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

DIRECT LIQUID COOLING

Most efficient pure x86 supercomputer in Green 500 june’13 (météo France)

New version with GPGPU is coming

Page 14: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

ADIABATIC COOLING

Cooling capacity 108 kW

Power consumption 2,8 kW

Water used (estim. h/year) 472

Water used (estim. m3) 53

50%

Page 15: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

Cooling

CPU

Page 16: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

FREQUENCY AND POWER CONSUMPTION

Sources: http://blog.stuffedcow.net/2012/10/intel32nm-22nm-core-i5-comparison/

IVB Voltage-Frequency Shmoo plot

Power vs. Frequency, 1.26 V, variable temperature IVB Power vs. Voltage -1.6 GHz and 2.4 GHz, 90°C

Page 17: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

0

500

1000

1500

2000

2500

3000

3500

19

84

19

85

19

86

19

87

19

88

19

89

19

90

19

91

19

92

19

93

19

94

19

95

19

96

19

97

19

98

19

99

20

00

20

01

20

02

20

03

20

04

20

05

20

06

20

07

20

08

20

09

20

10

20

11

20

12

20

13

CPU FREQUENCY

!

Page 18: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

[freq] x [#flop/cycle] x [#cores]

Page 19: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution
Page 20: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

Simple Processors

HUGE multicore Processors

ARCHITECTURE EVOLUTION SUMMARY

Coprocessors many very simple units

HUGE multicore Processors

Simple Specific cores

Coprocessors many very simple units

Page 21: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

o ARM sells IP not a physical core o ARM is present in

o Mobile (phone, tablet, laptop) o Embedded (electronics) o Enterprise (switch, disk, printers, servers…) o Home (TV, camera, games consoles)

o Shipped chips o 2012 - 27 billions o 2017 - 41 billions

Why ARM cores ? o Price ($1 - $25) o Power consumption is key for mobile device (100mW) o Very versatile

Source: http://ir.arm.com/phoenix.zhtml?c=197211&p=irol-presentations

Page 22: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

HOW TO SIZE A MACHINE

[Money] => [CAPEX] + [OPEX]

[CAPEX] Flops/$ is decreasing Cheaper More flops More Watt

[OPEX] More Watt (from CAPEX and PUE) $/Watt is increasing Much More Expensive

cpu.h

W.h

Page 23: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

ELECTRICITY, A SIGNIFICANT PART OF HPC BUDGET

Page 24: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

• Write code taking into account “execution policy”

• Execute your job with a choice of policy (time to result, Joule to result, Price to result)

• Pay less by a cossing the policy (time to result, Joule to result)

Soft dev & Users

• You will have to bridge the gap between Hardware and Users

• Write code taking into account “execution policy” (upward and downward)

• You will face heterogeneity

Midleware providers

• You will charge in Watt.hour

• You can adjust your batch scheduler according to policy (green energy, power capping, …)

• Efficient hardware will be valuable (DLC, …)

• You will adjust your electricity contract

Cloud And

Data Center Director

IMPACT OF ELECTRICITY COST INCREASE ON YOU

Page 25: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

WHAT IS TO IMPROVE

Page 26: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

WHAT DO CONSUME YOUR APPLICATIONS?

o Pluggins: RAPL, IPMI (OS) and RRD

o Per job (global value & time slice)

o Per node

o Per user

oNew srun parameter to allow CPU frequency scaling for job execution

Page 27: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

High Definition Energy Efficiency Monitoring

Page 28: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

TOOL CHAIN

Tools

API

Accurate, swift, non intrusive Probes

Hardware (cpu, chipset, …)

User

Metrics

Appli. libs System f

Policy

Page 29: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

EXECUTION POLICY

Time to result Watt to Result Price to Result Ecoimpact to

Result

As fast as possible

Minimal power consump-tion

Cheapeast Execution

Green Energy

$ W.h s. CO2

Page 30: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

Fastest

POLICIES COMPARISON

time

Watt

EcoImpact

Cheapest

Watt to Result

Renewable Source

Off-peak hours

Page 31: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

0

20

40

60

80

100

120

0

0,1

0,2

0,3

0,4

0,5

0,6

Am

ou

nt

of

CO

2 e

mit

ted

per

kW

h (

in g

ram

mes

)

Pri

ce o

f a

kWh

(in

eu

ros)

Days of the agenda

Price Pollution

M.E.M. Diouri, O. Glück and L. Lefèvre, SESAMES: A Smart Grid based framework for consuming less and better in extreme scale infrastructures, GreenCom 2013, 20-23 August, Beijing 2013.

SESAMES FRAMEWORK: A MULTI-CRITERIA GREEN SCHEDULER FOR CONSUMING LESS AND BETTER

Page 32: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

TOOLS

POWER CONSUMPTION

(W.h)

COMPUTE PERFORMANCE

(flops/s)

• Becomes an increasing budget • Will be charged on end users • Is a strict constraint for large systems • flops/Watt is becoming a metric (see

green500).

• Applications are written for compute performance

• Tools are seeking performance issues (host spot, idle time)

• Cost of a run is expressed in core.hour (because it’s easy to measure)

TOOLS EXIST HERE TOOLS DO NOT EXIST HERE HDEEM MERGES THE BOTH WORLDS Precise metrics in performance AND power consumption

Page 33: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

TU DRESDEN IN A NUTSHELL

The technische universität dresden is the largest institute of higher education in the city of dresden, the largest university in saxony and one of the 10

largest universities in germany.

#students 34’993

#foreign students 3’442 (9,8%)

#employees 6’123

#faculties 14

Total Budget € 500M (circa)

Page 34: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

INTERESTED IN HDEEM

New Hardware

Developping tools

Developping tools

Using tools

Developping tools

Page 35: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

SOLUTION OVERVIEW

IB Fabric / GigE Fabric

Cluster

SATA / SAS

Login (2) Admin (2)

FASS – Phase 1

GPU Cluster

32 Nodes

48x Kepler

Export (2)

Instrumented blades

SMP (2)

Power Monitor

for

Node Global Power

VR CPU

CPU1

VR CPU

CPU2

Power

Measurement

FPGA

i2C - 400Kb / VR

i2C - 400Kb

BMC(Baseboard Management Controller)

Operating System

(OS)

XBUS

50Mb

Ethernet 100Mb

Cluster

Manager

NODE

VR DIMM1

VR DIMM2

VR DIMM3

VR DIMM4

VR DIMM1

VR DIMM2

VR DIMM3

VR DIMM4

Freq: 500Hz - Error: 2%

FPGA API

Page 36: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

DON’T FORGET ABOUT THE JUNGLE

Tools

API

Accurate, swift, non intrusive Probes

HW 1

User

Metrics

Appli. libs System f

HW 2 … HW n Hardware (cpu, chipset, …)

Policy

Page 37: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution
Page 38: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution
Page 39: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

1914

Production:1908 – 1927 50% of the world market Low cost ($850 – 4 months) Easy Maintenance 93 min to build one

Page 40: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

IF ALL YOU HAVE IS A HAMMER…

Page 41: Myths and reality Bordeaux Sept. 2, 2013 · PDF file · 2013-09-02o Power consumption is key for mobile device (100mW) o Very versatile ... • Write code taking into account “execution

Frequency Multicore

Simplification

Diversification

Optimisation