scientific computing: an end user’s guide · scientific computing: an end user’s guide. ......

36
Scientific Computing: An End Scientific Computing: An End User’s Guide User’s Guide Paul Reynolds, Weidlinger Associates Inc Paul Reynolds, Weidlinger Associates Inc [email protected] [email protected]

Upload: vunhan

Post on 05-Oct-2018

228 views

Category:

Documents


0 download

TRANSCRIPT

Sci

entif

ic C

ompu

ting:

An

End

S

cien

tific

Com

putin

g: A

n E

nd

Use

r’s G

uide

Use

r’s G

uide

Pau

l Rey

nold

s, W

eidl

inge

r Ass

ocia

tes

Inc

Pau

l Rey

nold

s, W

eidl

inge

r Ass

ocia

tes

Inc

reyn

olds

@ca

.wai

.com

reyn

olds

@ca

.wai

.com

Con

tent

Con

tent

Sci

entif

ic C

ompu

ting

App

licat

ions

Sci

entif

ic C

ompu

ting

App

licat

ions

Mar

ket S

ize

Mar

ket S

ize

Spe

ed o

f Com

puta

tion/

Lim

iting

Fac

tors

Spe

ed o

f Com

puta

tion/

Lim

iting

Fac

tors

Non

Non

-- tech

nica

l Con

side

ratio

nste

chni

cal C

onsi

dera

tions

Par

alle

l Com

putin

gP

aral

lel C

ompu

ting

Sup

erco

mpu

ting

and

Clu

ster

sS

uper

com

putin

g an

d C

lust

ers

Tren

ds in

HP

CTr

ends

in H

PC

App

licat

ions

App

licat

ions

HP

C

HP

C ––

Hig

h P

erfo

rman

ce C

ompu

ting

Hig

h P

erfo

rman

ce C

ompu

ting

Stru

ctur

ed

Stru

ctur

ed N

umer

ics

Num

eric

s––

Fini

te e

lem

ent m

odel

ling,

wea

ther

mod

ellin

gFi

nite

ele

men

t mod

ellin

g, w

eath

er m

odel

ling

Uns

truct

ured

U

nstru

ctur

ed N

umer

ics

Num

eric

s––

Mon

te C

arlo

sim

ulat

ions

Mon

te C

arlo

sim

ulat

ions

Rea

lR

eal -- t

ime

Com

puta

tion

time

Com

puta

tion

––S

peec

h an

d im

age

reco

gniti

onS

peec

h an

d im

age

reco

gniti

onLa

rge

mem

ory

and

Inpu

t/Out

put (

IO)

Larg

e m

emor

y an

d In

put/O

utpu

t (IO

)––

Dat

abas

es, t

rans

actio

n pr

oces

sing

Dat

abas

es, t

rans

actio

n pr

oces

sing

Gra

phic

s an

d D

esig

n G

raph

ics

and

Des

ign

––C

AD

CA

D

Def

initi

ons

Def

initi

ons

Bit

Bit

––a

0 or

a 1

a 0

or a

1––

Byt

e B

yte

––8

bits

8 bi

tsK

iloby

te 1

,000

byt

es (1

0K

iloby

te 1

,000

byt

es (1

033))

Meg

abyt

e 1,

000,

000

byte

s (1

0M

egab

yte

1,00

0,00

0 by

tes

(1066

))G

igab

yte

1,00

0,00

0,00

0 by

tes

(10

Gig

abyt

e 1,

000,

000,

000

byte

s (1

099))

Tera

byte

1,0

00,0

00,0

00,0

00 b

ytes

(10

Tera

byte

1,0

00,0

00,0

00,0

00 b

ytes

(101

212))

Wor

d W

ord

––a

num

ber o

f byt

es, m

achi

ne d

epen

dent

a nu

mbe

r of b

ytes

, mac

hine

dep

ende

nt––

Sin

gle

Pre

cisi

on, 4

byt

es (3

2 bi

ts)

Sin

gle

Pre

cisi

on, 4

byt

es (3

2 bi

ts)

––D

oubl

e P

reci

sion

, 8 b

ytes

(64

bits

)D

oubl

e P

reci

sion

, 8 b

ytes

(64

bits

)

CP

U

CP

U ––

Cen

tral P

roce

ssin

g U

nit

Cen

tral P

roce

ssin

g U

nit

Mas

s m

arke

t chi

ps, l

ow p

rice,

hig

h vo

lum

eM

ass

mar

ket c

hips

, low

pric

e, h

igh

volu

me

––In

tel P

entiu

m4,

AM

D

Inte

l Pen

tium

4, A

MD

Ath

lon

Ath

lon

32 b

it In

tege

r, 64

to 8

0 bi

t Flo

atin

g P

oint

32 b

it In

tege

r, 64

to 8

0 bi

t Flo

atin

g P

oint

Ser

ver/w

orks

tatio

n ch

ips,

typi

cally

low

vol

ume

Ser

ver/w

orks

tatio

n ch

ips,

typi

cally

low

vol

ume

(<10

0x),

usua

lly h

igh

pric

e (>

10x)

, old

er p

roce

ss(<

100x

), us

ually

hig

h pr

ice

(>10

x), o

lder

pro

cess

––IB

M P

ower

4+IB

M P

ower

4+C

ompa

q A

lpha

Com

paq

Alp

ha––

Inte

l Ita

nium

Inte

l Ita

nium

SG

I MIP

SS

GI M

IPS

––A

MD

A

MD

Opt

eron

Opt

eron

HP

PA

HP

PA

-- RIS

CR

ISC

––S

UN

S

UN

Ultr

aSpa

rcU

ltraS

parc

64 b

it In

tege

r and

Flo

atin

g P

oint

64 b

it In

tege

r and

Flo

atin

g P

oint

Ser

ver S

hipm

ents

Ser

ver S

hipm

ents

Serv

er S

yste

m C

ost

Sour

ce: ‘

64 B

it M

arke

t Out

look

’, In

sigh

t64,

Nat

han

Bro

okw

ood,

pre

sent

ed a

t AM

D D

evel

oper

Sym

posi

um 2

002

Ser

ver S

hipm

ents

Ser

ver S

hipm

ents

4.3

Mill

ion

Uni

ts S

hipp

ed

in 2

001

$52.

5 B

illio

n Sa

les

in 2

001

Sour

ce: ‘

64 B

it M

arke

t Out

look

’, In

sigh

t64,

Nat

han

Bro

okw

ood,

pre

sent

ed a

t AM

D D

evel

oper

Sym

posi

um 2

002

Diff

eren

t Typ

es o

f Num

bers

Diff

eren

t Typ

es o

f Num

bers

Inte

ger

Inte

ger ––

who

le n

umbe

rs

who

le n

umbe

rs ––

0,1,

2,3,

……

.,n0,

1,2,

3,…

….,n

Floa

ting

Poi

nt

Floa

ting

Poi

nt ––

cont

ain

deci

mal

poi

ntco

ntai

n de

cim

al p

oint

––1.

5, 3

.141

5926

, 99.

9999

91.

5, 3

.141

5926

, 99.

9999

9D

iffer

ent c

ompu

tatio

nal p

arts

of C

PU

use

d fo

r eac

hD

iffer

ent c

ompu

tatio

nal p

arts

of C

PU

use

d fo

r eac

h

Sin

gle

Pre

cisi

on

Sin

gle

Pre

cisi

on ––

32 b

it 32

bit

––223

232

––In

tege

r 4,2

94,9

67,2

96 (4

x10

Inte

ger 4

,294

,967

,296

(4x1

099))

––Fl

oatin

g P

oint

Fl

oatin

g P

oint

±±1010

3838

Dou

ble

Pre

cisi

on

Dou

ble

Pre

cisi

on ––

64 b

it 64

bit

––226

464

––In

tege

r 18,

446,

744,

073,

709,

551,

616

(18x

10In

tege

r 18,

446,

744,

073,

709,

551,

616

(18x

101818

))––

Floa

ting

Poi

nt

Floa

ting

Poi

nt ±±

101030

830

8

––In

tege

r val

ues

used

for m

emor

y al

loca

tion

Inte

ger v

alue

s us

ed fo

r mem

ory

allo

catio

n––

32 b

it sy

stem

s lim

ited

to 4

GB

ytes

RA

M32

bit

syst

ems

limite

d to

4G

Byt

es R

AM

Cos

t of 4

C

ost o

f 4 G

Byt

esG

Byt

esR

AM

RA

M

Sour

ce: ‘

64 B

it M

arke

t Out

look

’, In

sigh

t64,

Nat

han

Bro

okw

ood,

pre

sent

ed a

t AM

D D

evel

oper

Sym

posi

um 2

002

Com

putin

g S

peed

Com

putin

g S

peed

Her

tz (H

z)

Her

tz (H

z) --

cycl

e pe

r sec

ond

cycl

e pe

r sec

ond

––G

igah

ertz

(GH

z) 1

,000

,000

,000

cyc

les/

sG

igah

ertz

(GH

z) 1

,000

,000

,000

cyc

les/

s––

1 cy

cle

per n

anos

econ

d (n

s)1

cycl

e pe

r nan

osec

ond

(ns)

––1

ns is

to 1

sec

ond

as 1

sec

ond

is to

31.

5 ye

ars

1 ns

is to

1 s

econ

d as

1 s

econ

d is

to 3

1.5

year

s––

Ligh

t tra

vels

30

cm in

1 n

sLi

ght t

rave

ls 3

0 cm

in 1

ns

Mod

ern

CP

Us

rang

e fro

m 1

to 3

GH

z (1

H ‘0

3)M

oder

n C

PU

s ra

nge

from

1 to

3 G

Hz

(1H

‘03)

––U

se 0

.13

to 0

.18

Use

0.1

3 to

0.1

8 µµ m

trac

ksm

trac

ksV

isib

le li

ght 0

.4 to

0.8

V

isib

le li

ght 0

.4 to

0.8

µµm

wav

elen

gth

m w

avel

engt

h––

50 to

200

Mill

ion

trans

isto

rs in

100

to 5

00 m

m50

to 2

00 M

illio

n tra

nsis

tors

in 1

00 to

500

mm

22

Com

putin

g S

peed

Com

putin

g S

peed

CP

U P

erfo

rman

ceC

PU

Per

form

ance

––(C

lock

Rat

e) x

(IP

C)

(Clo

ck R

ate)

x (I

PC

)––

IPC

IP

C --

Inst

ruct

ions

per

Cyc

leIn

stru

ctio

ns p

er C

ycle

Mac

hine

and

pro

blem

dep

ende

ntM

achi

ne a

nd p

robl

em d

epen

dent

FLO

PS

FL

OP

S ––

Floa

ting

Poi

nt O

pera

tion

per S

econ

dFl

oatin

g P

oint

Ope

ratio

n pe

r Sec

ond

––G

igaF

lop

Gig

aFlo

por

or

GFl

opG

Flop

––1

billi

on F

lops

1 bi

llion

Flo

ps––

Hig

h E

nd D

eskt

op P

C (P

4 or

H

igh

End

Des

ktop

PC

(P4

or A

thlo

nA

thlo

n ) 4

to 7

) 4

to 7

GFl

ops

GFl

ops

Pot

entia

l P

oten

tial ––

actu

al d

epen

ds u

pon

appl

icat

ion

and

actu

al d

epen

ds u

pon

appl

icat

ion

and

arch

itect

ure

arch

itect

ure

Per

form

ance

of H

PC

CP

Us

Per

form

ance

of H

PC

CP

Us

Sour

ce: ‘

The

Bat

tle in

64

bit L

and,

200

3 an

d B

eyon

d’, P

aul D

eMon

e, w

ww

.real

wor

ldte

ch.c

om

PZF

lex

and

Des

ktop

CP

Us

PZF

lex

and

Des

ktop

CP

Us

App

roxi

mat

e Pe

rfor

man

ce o

f Var

ious

Pro

cess

ors

for P

ZFle

x

0

500

1000

1500

2000

2500

3000

3500

1996 - SGI Origin

1996 - Ppro 200MHz

1997 - Pentium II 300MHz

1998 - Pentium II 450MHz

1999 - Pentium III 650MHz

1999 - Athlon 800MHz

2000 - Pentium III 1GHz

2000 - Pentium 4 1.4GHz

2000 - Athlon 1.2GHz

2001 - Pentium 4 2GHz

2001 - Athlon 1.6GHz

2002 - Pentium 4 3GHz

2002 - Athlon 2.13 GHZ

Year

and

Sys

tem

Approximate Performance

$500

k

$2k

to $

3k

$1k

to $

3k

$500

k

$2k

to $

3k

$1k

to $

3k

PZF

lex

PZF

lex

and

Des

ktop

CP

Us

and

Des

ktop

CP

Us

Exp

licit

elem

ent u

se o

f mem

ory

scal

es li

near

ly

Exp

licit

elem

ent u

se o

f mem

ory

scal

es li

near

ly

with

num

ber o

f ele

men

tsw

ith n

umbe

r of e

lem

ents

––1

mill

ion

2D e

lem

ents

use

~ 6

4 M

B1

mill

ion

2D e

lem

ents

use

~ 6

4 M

B––

1 m

illio

n 3D

ele

men

ts u

se ~

90

MB

1 m

illio

n 3D

ele

men

ts u

se ~

90

MB

––Ty

pica

l mod

els

solv

e in

min

utes

to a

few

hou

rsTy

pica

l mod

els

solv

e in

min

utes

to a

few

hou

rs––

Larg

e m

odel

s so

lve

in 8

to 2

4 ho

urs

Larg

e m

odel

s so

lve

in 8

to 2

4 ho

urs

––M

ulti

Mul

ti --m

illio

n el

emen

t sim

ulat

ions

on

stan

dard

m

illio

n el

emen

t sim

ulat

ions

on

stan

dard

de

skto

p P

Cde

skto

p P

C

Impr

ovin

g P

erfo

rman

ceIm

prov

ing

Per

form

ance

Impr

ove

codi

ngIm

prov

e co

ding

Impr

ove

com

pile

rIm

prov

e co

mpi

ler

Clo

ck s

peed

C

lock

spe

ed ––

appr

ox +

50%

per

yea

rap

prox

+50

% p

er y

ear

IPC

IPC ––C

PU

floa

ting

poin

t uni

tsC

PU

floa

ting

poin

t uni

tsIn

crea

se N

umbe

r, pi

pelin

ed (1

cal

c pe

r cyc

le)

Incr

ease

Num

ber,

pipe

lined

(1 c

alc

per c

ycle

)––

Vec

tor u

nits

Vec

tor u

nits

Sin

gle

Inst

ruct

ion

Mul

tiple

Dat

a, S

IMD

Sin

gle

Inst

ruct

ion

Mul

tiple

Dat

a, S

IMD

––S

SE

, SS

E2,

3D

Now

!, S

SE

, SS

E2,

3D

Now

!, A

ltive

cA

ltive

cN

ot a

pplic

able

to a

ll da

ta s

truct

ures

Not

app

licab

le to

all

data

stru

ctur

esE

xtra

pro

gram

min

g w

ork

requ

ired

Ext

ra p

rogr

amm

ing

wor

k re

quire

d

Impr

ovin

g P

erfo

rman

ceIm

prov

ing

Per

form

ance

Dat

a D

ata

ii nte

grnt

egr ii t

yty––

EC

C

EC

C ––

Err

or C

orre

ctE

rror

Cor

rect

ii ng

mem

ory

ng m

emor

y1

b1

b iit d

etec

tt d

etec

t ii on,

2 b

on, 2

bii t

dete

ctt d

etec

t ii on

and

corr

ect

on a

nd c

orre

ctii o

nonC

hC

h iipp k

ikillll

––de

tect

and

‘hid

e’ e

rror

dete

ct a

nd ‘h

ide’

err

or-- p

rone

cel

lspr

one

cells

Ens

ure

com

puta

tiona

l uni

ts ‘f

ed’ w

ith d

ata

Ens

ure

com

puta

tiona

l uni

ts ‘f

ed’ w

ith d

ata

––D

ata

read

in, c

ompu

tatio

n, re

sult

writ

ten

Dat

a re

ad in

, com

puta

tion,

resu

lt w

ritte

n––

Mem

ory

subs

yste

m im

porta

ntM

emor

y su

bsys

tem

impo

rtant

Ban

dwid

th (B

W),

rate

of d

ata

trans

fer t

o C

PU

Ban

dwid

th (B

W),

rate

of d

ata

trans

fer t

o C

PU

––2.

1GB

/s to

5.3

GB

/s2.

1GB

/s to

5.3

GB

/s––

Impr

ove

by ra

te (2

66M

Hz,

333

MH

z), p

ath

wid

th (6

4, 1

28bi

t)Im

prov

e by

rate

(266

MH

z, 3

33M

Hz)

, pat

h w

idth

(64,

128

bit)

Impr

ovin

g P

erfo

rman

ceIm

prov

ing

Per

form

ance

Ens

ure

com

puta

tiona

l uni

ts ‘f

ed’ w

ith d

ata

Ens

ure

com

puta

tiona

l uni

ts ‘f

ed’ w

ith d

ata

––La

tenc

y, ti

me

from

requ

est f

rom

dat

a to

arr

ival

Late

ncy,

tim

e fro

m re

ques

t fro

m d

ata

to a

rriv

al50

to 2

00 n

s 50

to 2

00 n

s ––

50 to

200

cyc

les

at 1

GH

z50

to 2

00 c

ycle

s at

1G

Hz

––P

roce

ssor

sits

idle

unt

il da

ta a

rriv

esP

roce

ssor

sits

idle

unt

il da

ta a

rriv

es

Dat

a D

ata

pref

etch

pref

etch

––lo

ad d

ata

ahea

d of

tim

e, tr

ade

BW

for

lo

ad d

ata

ahea

d of

tim

e, tr

ade

BW

for

la

tenc

yla

tenc

yB

ranc

h pr

edic

tion

Bra

nch

pred

ictio

n ––

gues

ses

wha

t to

calc

ulat

egu

esse

s w

hat t

o ca

lcul

ate

Cac

he

Cac

he ––

smal

l, fa

st m

emor

y cl

ose

to C

PU

smal

l, fa

st m

emor

y cl

ose

to C

PU

––L1

32

kBL1

32

kB-- 1

28kB

, L2

128k

B12

8kB

, L2

128k

B-- 2

MB

2MB

Impr

ovin

g P

erfo

rman

ceIm

prov

ing

Per

form

ance

Sym

met

ric M

ulti

Thre

adin

g (S

MT)

Sym

met

ric M

ulti

Thre

adin

g (S

MT)

––U

ses

Use

s un

deru

tilis

edun

deru

tilis

edco

mpu

tatio

nal u

nits

(Int

eger

or

com

puta

tiona

l uni

ts (I

nteg

er o

r Fl

oatin

g P

oint

)Fl

oatin

g P

oint

)––

E.g

. Tas

k1 a

nd 2

, 50%

Inte

ger,

50%

FP e

ach

E.g

. Tas

k1 a

nd 2

, 50%

Inte

ger,

50%

FP e

ach

Trad

ition

ally

run

sequ

entia

llyTr

aditi

onal

ly ru

n se

quen

tially

Run

bot

h si

mul

tane

ousl

y, u

se 1

00%

INT

and

FPR

un b

oth

sim

ulta

neou

sly,

use

100

% IN

T an

d FP

––E

ffect

ivel

y do

uble

per

form

ance

in th

eory

Effe

ctiv

ely

doub

le p

erfo

rman

ce in

theo

ry

Onl

y P

entiu

m4

curr

ently

impl

emen

ts S

MT

Onl

y P

entiu

m4

curr

ently

impl

emen

ts S

MT

––O

nly

~20%

impr

ovem

ent d

ue to

poo

r res

ourc

esO

nly

~20%

impr

ovem

ent d

ue to

poo

r res

ourc

es

No

adva

ntag

e fo

r FE

A (9

0%+

FP)

No

adva

ntag

e fo

r FE

A (9

0%+

FP)

Mor

e us

eful

for m

ixed

app

licat

ions

, e.g

. dat

abas

eM

ore

usef

ul fo

r mix

ed a

pplic

atio

ns, e

.g. d

atab

ase

Oth

er C

onsi

dera

tions

Oth

er C

onsi

dera

tions

Rel

iabi

lity

Rel

iabi

lity

––99

.9%

upt

ime,

9 h

ours

/yea

r dow

ntim

e99

.9%

upt

ime,

9 h

ours

/yea

r dow

ntim

eU

nacc

epta

ble

for c

ompa

ny tr

ansa

ctio

n se

rver

Una

ccep

tabl

e fo

r com

pany

tran

sact

ion

serv

erC

ompo

nent

cou

nt, 1

00 u

nits

, 5 w

eeks

/yea

rC

ompo

nent

cou

nt, 1

00 u

nits

, 5 w

eeks

/yea

r––

Hig

h en

d sy

stem

s qu

ote

99.9

99%

upt

ime

Hig

h en

d sy

stem

s qu

ote

99.9

99%

upt

ime

Pow

er a

nd C

oolin

gP

ower

and

Coo

ling

––1

CP

U, 5

0 to

150

Wat

ts, 2

001

CP

U, 5

0 to

150

Wat

ts, 2

00-- 2

50W

per

sys

tem

250W

per

sys

tem

––20

sys

tem

s, 5

kW+

20 s

yste

ms,

5kW

+ ––

adeq

uate

pow

er a

nd A

C?

adeq

uate

pow

er a

nd A

C?

––U

PS

U

PS

––ba

ckup

bat

tery

in c

ase

of p

ower

failu

reba

ckup

bat

tery

in c

ase

of p

ower

failu

reB

acku

pB

acku

p––

GB

ytes

GB

ytes

of d

ata

per s

imul

atio

n, s

aved

to d

isk

of d

ata

per s

imul

atio

n, s

aved

to d

isk

––TB

of d

ata

tota

l, di

sk b

acku

p R

AID

/Mirr

or, t

ape

back

upTB

of d

ata

tota

l, di

sk b

acku

p R

AID

/Mirr

or, t

ape

back

upA

dmin

istra

tion

Adm

inis

tratio

n ––

can

be s

igni

fican

tca

n be

sig

nific

ant

Oth

er C

onsi

dera

tions

Oth

er C

onsi

dera

tions

Ofte

n ra

pid

Ofte

n ra

pid

obsc

elec

ence

obsc

elec

ence

Per

cent

age

time

Per

cent

age

time

utilis

edut

ilised

––sh

are

reso

urce

?sh

are

reso

urce

?O

pera

ting

Sys

tem

Ope

ratin

g S

yste

m––

Win

dow

sW

indo

ws

Stro

ng c

ompi

ler a

nd a

pplic

atio

n su

ppor

t, co

st p

er u

nit

Stro

ng c

ompi

ler a

nd a

pplic

atio

n su

ppor

t, co

st p

er u

nit

(ver

y hi

gh fo

r >2C

PU

), do

es n

ot s

cale

, var

iabl

e (v

ery

high

for >

2CP

U),

does

not

sca

le, v

aria

ble

stab

ility

, usu

ally

sup

ports

late

st h

ardw

are,

mos

t use

rs

stab

ility

, usu

ally

sup

ports

late

st h

ardw

are,

mos

t use

rs

com

forta

ble

com

forta

ble

––U

NIX

UN

IX Com

pile

r sup

port

varie

s by

sys

tem

/C

ompi

ler s

uppo

rt va

ries

by s

yste

m/ fl

avou

rfla

vour

, ver

y , v

ery

stab

le, s

cale

s w

ell,

mor

e co

nser

vativ

est

able

, sca

les

wel

l, m

ore

cons

erva

tive

––Li

nux

Linu

x ––

free,

rapi

dly

impr

ovin

g, v

aria

ble

supp

ort

free,

rapi

dly

impr

ovin

g, v

aria

ble

supp

ort

Wor

ld’s

Top

5 S

uper

com

pute

rsW

orld

’s T

op 5

Sup

erco

mpu

ters

Inte

lIn

tel

5694

5694

Linu

x N

etw

ork

Linu

x N

etw

ork

LLN

LLL

NL

IBM

IBM

7226

7226

ASC

I Whi

teA

SCI W

hite

LLN

LLL

NL

Alph

aAl

pha

7727

7727

ASC

I/QA

SCI/Q

LAN

L (2

& 3

)LA

NL

(2 &

3)

NEC

NEC

3586

035

860

Eart

hSim

ulat

orEa

rthS

imul

ator

Japa

n (C

ost $

400m

illio

n)Ja

pan

(Cos

t $40

0mill

ion)

Chi

pC

hip

GFl

ops

GFl

ops

Syst

em N

ame/

Loca

tion

Syst

em N

ame/

Loca

tion

Mar

ket s

hare

of T

op 5

00, p

erce

ntag

e of

pro

cess

ing

pow

er, N

ov 2

002

IBM

–20

%Su

n –

16%

HP

–21

%SG

I –8%

Alp

ha –

22%

Inte

l –10

%

Par

alle

l Com

putin

gP

aral

lel C

ompu

ting

Use

mul

tiple

CP

Us/

syst

ems

per c

alcu

latio

nU

se m

ultip

le C

PU

s/sy

stem

s pe

r cal

cula

tion

––O

pen

MP

(SM

P)

Ope

n M

P (S

MP

)S

hare

d m

emor

y an

d O

pera

ting

Sys

tem

(OS

)S

hare

d m

emor

y an

d O

pera

ting

Sys

tem

(OS

)––

2 C

PU

sys

tem

s re

lativ

ely

inex

pens

ive,

4, 8

, 16+

2

CP

U s

yste

ms

rela

tivel

y in

expe

nsiv

e, 4

, 8, 1

6+

poss

ible

poss

ible

––M

emor

y ba

ndw

idth

ofte

n sh

ared

, low

ers

perfo

rman

ceM

emor

y ba

ndw

idth

ofte

n sh

ared

, low

ers

perfo

rman

ceH

igh

end

syst

ems

use

prop

rieta

ry c

onne

ctio

nsH

igh

end

syst

ems

use

prop

rieta

ry c

onne

ctio

nsE

ffect

ive

but e

xpen

sive

Effe

ctiv

e bu

t exp

ensi

veR

epea

ted

oper

atio

ns e

xecu

ted

in p

aral

lel

Rep

eate

d op

erat

ions

exe

cute

d in

par

alle

l––

Fine

gra

in

Fine

gra

in p

aral

lelis

atio

npa

ralle

lisat

ion

Alm

ost a

lway

s <1

00%

effi

cien

t A

lmos

t alw

ays

<100

% e

ffici

ent

––La

rger

pro

blem

s La

rger

pro

blem

s pa

ralle

lise

para

llelis

ebe

tter

bette

r

Par

alle

l Com

putin

gP

aral

lel C

ompu

ting

Mes

sage

Pas

sing

(MP

I)M

essa

ge P

assi

ng (M

PI)

––D

ata

sent

bet

wee

n m

ultip

le ru

nnin

g D

ata

sent

bet

wee

n m

ultip

le ru

nnin

g pr

ogra

mm

espr

ogra

mm

es––

Spl

it pr

oble

m in

to p

iece

sS

plit

prob

lem

into

pie

ces

Coa

rse

grai

n C

oars

e gr

ain

para

llelis

atio

npa

ralle

lisat

ion

Sim

ilar i

ssue

s to

OM

PS

imila

r iss

ues

to O

MP

––A

llow

s fo

r lar

ge ‘c

lust

ers’

of i

nexp

ensi

ve c

omm

odity

A

llow

s fo

r lar

ge ‘c

lust

ers’

of i

nexp

ensi

ve c

omm

odity

sy

stem

s sy

stem

s ––

‘Beo

wul

f’, L

inux

‘Beo

wul

f’, L

inux

Stro

ng c

ompi

ler s

uppo

rt es

sent

ial

Stro

ng c

ompi

ler s

uppo

rt es

sent

ial

Ben

efits

of C

lust

ers

Ben

efits

of C

lust

ers

Sca

ling

of R

esou

rces

Sca

ling

of R

esou

rces

Sha

ring

of R

esou

rces

Sha

ring

of R

esou

rces

Bes

t Pric

e/P

erfo

rman

ce R

atio

(PP

R)

Bes

t Pric

e/P

erfo

rman

ce R

atio

(PP

R)

––P

PR

is C

onst

ant w

ith G

row

ing

Sys

tem

Siz

eP

PR

is C

onst

ant w

ith G

row

ing

Sys

tem

Siz

eFl

exib

ility

Flex

ibili

tyH

igh

Avai

labi

lity

Hig

h Av

aila

bilit

yFa

ult R

esili

ence

Faul

t Res

ilien

ce

Tren

ds in

HP

CTr

ends

in H

PC

Com

mod

ity h

ardw

are

incr

easi

ngly

C

omm

odity

har

dwar

e in

crea

sing

ly fa

vour

edfa

vour

ed––

Inte

l/AM

D d

eskt

op C

PU

s ve

ry p

ower

ful

Inte

l/AM

D d

eskt

op C

PU

s ve

ry p

ower

ful

Ade

quat

e fo

r man

y H

PC

app

licat

ions

, ove

rkill

for e

mai

lA

dequ

ate

for m

any

HP

C a

pplic

atio

ns, o

verk

ill fo

r em

ail

4GB

lim

it m

ajor

pro

blem

4GB

lim

it m

ajor

pro

blem

––‘B

eow

ulf’

clus

ters

mor

e co

mm

on‘B

eow

ulf’

clus

ters

mor

e co

mm

onR

ecen

t clu

ster

pur

chas

e at

WA

IR

ecen

t clu

ster

pur

chas

e at

WA

I––

10 d

ual A

MD

10

dua

l AM

D A

thlo

nA

thlo

n , 4

GB

ea,

~10

0GFl

ops

peak

tota

l, 4

GB

ea,

~10

0GFl

ops

peak

tota

l––

Mas

ter N

ode,

Bac

kup

Mas

ter,

1TB

RA

ID d

isk,

M

aste

r Nod

e, B

acku

p M

aste

r, 1T

B R

AID

dis

k,

Cab

inet

, UP

S, S

oftw

are,

Sup

port

Cab

inet

, UP

S, S

oftw

are,

Sup

port

––~$

40,0

00 (

~$40

,000

( ₤₤25

,000

)25

,000

)

Tren

ds in

HP

CTr

ends

in H

PC

New

N

ew fa

bsfa

bs$1

billi

on+,

500

+ de

sign

team

$1bi

llion

+, 5

00+

desi

gn te

amH

PC

sal

es a

lone

can

not s

uppo

rt R

&D

HP

C s

ales

alo

ne c

anno

t sup

port

R&

D––

Com

pani

es c

ombi

ne d

eskt

op/H

PC

R&

DC

ompa

nies

com

bine

des

ktop

/HP

C R

&D

Des

ktop

chi

ps ‘c

rippl

ed’ H

PC

chi

psD

eskt

op c

hips

‘crip

pled

’ HP

C c

hips

––E

.gE

.gIB

M P

ower

4+ a

nd P

ower

PC

970

(App

le)

IBM

Pow

er4+

and

Pow

erP

C97

0 (A

pple

)

––C

ompa

nies

dro

p ow

n R

&D

alto

geth

erC

ompa

nies

dro

p ow

n R

&D

alto

geth

erE

.g. S

GI d

rop

MIP

S, H

P d

rop

PA

E.g

. SG

I dro

p M

IPS

, HP

dro

p P

A-- R

ISC

, use

Itan

ium

RIS

C, u

se It

aniu

mC

ompa

q dr

ops

Alp

haC

ompa

q dr

ops

Alp

ha––

Com

pani

es p

ool R

&D

reso

urce

sC

ompa

nies

poo

l R&

D re

sour

ces

E.g

. Int

el a

nd H

P o

n Ita

nium

, AM

D a

nd IB

M o

n K

9E

.g. I

ntel

and

HP

on

Itani

um, A

MD

and

IBM

on

K9

Tren

ds in

HP

CTr

ends

in H

PC

Inte

l IA

64, I

tani

umIn

tel I

A64

, Ita

nium

––E

PIC

, com

plet

ely

new

arc

hite

ctur

eE

PIC

, com

plet

ely

new

arc

hite

ctur

eP

oten

tially

ver

y po

wer

ful F

P p

erfo

rman

ceP

oten

tially

ver

y po

wer

ful F

P p

erfo

rman

ceR

equi

res

exce

llent

com

pile

rs, s

till w

aitin

gR

equi

res

exce

llent

com

pile

rs, s

till w

aitin

g––

Rew

orke

d co

deR

ewor

ked

code

Larg

e, e

xpen

sive

, hot

, low

clo

ck s

peed

Larg

e, e

xpen

sive

, hot

, low

clo

ck s

peed

Inco

mpa

tible

with

exi

stin

g O

S/S

oftw

are

Inco

mpa

tible

with

exi

stin

g O

S/S

oftw

are

Abo

ut 4

yea

rs la

teA

bout

4 y

ears

late

Sol

d ~6

000

in 2

yea

rsS

old

~600

0 in

2 y

ears

Req

uire

d en

orm

ous

inve

stm

ent i

n bo

th R

&D

and

R

equi

red

enor

mou

s in

vest

men

t in

both

R&

D a

nd

mar

ketin

gm

arke

ting

Tren

ds in

HP

CTr

ends

in H

PC

Inte

lIn

tel

––C

ontin

ue s

peed

ram

p of

Pen

tium

4C

ontin

ue s

peed

ram

p of

Pen

tium

4P

ush

prem

ium

P4

‘Xeo

ns’ i

nto

low

end

HP

C m

arke

tP

ush

prem

ium

P4

‘Xeo

ns’ i

nto

low

end

HP

C m

arke

t

IBM

IBM ––P

ower

4+ d

ual c

ore

Pow

er4+

dua

l cor

e ––

2CP

U o

n di

e2C

PU

on

die

Pow

erP

C97

0 de

skto

p ve

rsio

nP

ower

PC

970

desk

top

vers

ion

––O

nly

1 C

PU

, add

SIM

D

Onl

y 1

CP

U, a

dd S

IMD

––ch

eap,

use

d by

App

le in

G5

chea

p, u

sed

by A

pple

in G

5

––A

lso

sell

syst

ems

with

Inte

l and

AM

D c

hips

Als

o se

ll sy

stem

s w

ith In

tel a

nd A

MD

chi

ps

IBM

Pow

er S

erie

sIB

M P

ower

Ser

ies

Sour

ce: ‘

The

Bat

tle in

64

bit L

and,

200

3 an

d B

eyon

d’, P

aul D

eMon

e, w

ww

.real

wor

ldte

ch.c

om

Tren

ds in

HP

CTr

ends

in H

PC

SG

I, H

PS

GI,

HP

––S

topp

ed o

wn

deve

lopm

ent t

o us

e Ita

nium

Sto

pped

ow

n de

velo

pmen

t to

use

Itani

umS

ever

al y

ears

late

cau

ses

loss

of p

erfo

rman

ceS

ever

al y

ears

late

cau

ses

loss

of p

erfo

rman

ce––

Die

shr

ink

exis

ting

chip

s D

ie s

hrin

k ex

istin

g ch

ips

––ch

eap

alte

rnat

ive

chea

p al

tern

ativ

e

SU

NS

UN

––D

evel

opm

ent o

f lat

est c

hips

slo

wD

evel

opm

ent o

f lat

est c

hips

slo

wS

igni

fican

t lag

in p

erfo

rman

ceS

igni

fican

t lag

in p

erfo

rman

ce‘C

oast

ing’

on

IT in

ertia

‘Coa

stin

g’ o

n IT

iner

tiaH

eadl

ess

chic

ken

stra

tegy

Hea

dles

s ch

icke

n st

rate

gy

Com

paq

Com

paq

––M

erge

d w

ith H

P, g

ave

Alp

ha to

Inte

lM

erge

d w

ith H

P, g

ave

Alp

ha to

Inte

l

Tren

ds in

HP

CTr

ends

in H

PC

AM

DA

MD

––A

thlo

nA

thlo

nfir

st n

onfir

st n

on-- In

tel C

PU

to ta

ke te

chni

cal a

nd c

ost

Inte

l CP

U to

take

tech

nica

l and

cos

t le

ad in

des

ktop

mar

ket

lead

in d

eskt

op m

arke

t––

Dev

elop

ed 6

4bit

chip

D

evel

oped

64b

it ch

ip ––

exte

nd x

86ex

tend

x86

Com

patib

le w

ith a

ll ex

istin

g de

skto

p O

S a

nd s

oftw

are

Com

patib

le w

ith a

ll ex

istin

g de

skto

p O

S a

nd s

oftw

are

Offe

r ‘ch

eap’

upg

rade

to 6

4 bi

tsO

ffer ‘

chea

p’ u

pgra

de to

64

bits

Des

igne

d w

ith s

erve

r/MP

mar

ket i

n m

ind

Des

igne

d w

ith s

erve

r/MP

mar

ket i

n m

ind

––C

utC

ut-- d

own

vers

ion

for d

eskt

opdo

wn

vers

ion

for d

eskt

op

––H

igh

band

wid

th lo

w la

tenc

y da

ta c

onne

ctio

nsH

igh

band

wid

th lo

w la

tenc

y da

ta c

onne

ctio

nsH

yper

trans

port

Hyp

ertra

nspo

rt––

allo

ws

BW

to s

cale

with

no.

of C

PU

sal

low

s B

W to

sca

le w

ith n

o. o

f CP

Us

––S

imila

r cos

t to

desk

top

Sim

ilar c

ost t

o de

skto

pQ

uad

CP

U b

ox w

ith 8

GB

< $

10,0

00Q

uad

CP

U b

ox w

ith 8

GB

< $

10,0

00––

Wor

king

clo

sely

with

IBM

on

desi

gn a

nd m

anuf

actu

reW

orki

ng c

lose

ly w

ith IB

M o

n de

sign

and

man

ufac

ture

Sin

gle

AM

D

Sin

gle

AM

D O

pter

onO

pter

on

Qua

d A

MD

Q

uad

AM

D O

pter

onO

pter

onS

yste

mS

yste

m

2003

/200

4 H

PC

CP

Us

2003

/200

4 H

PC

CP

Us

Sour

ce: ‘

The

Bat

tle in

64

bit L

and,

200

3 an

d B

eyon

d’, P

aul D

eMon

e, w

ww

.real

wor

ldte

ch.c

om

2003

/200

4 H

PC

CP

Us

2003

/200

4 H

PC

CP

Us

Sour

ce: ‘

The

Bat

tle in

64

bit L

and,

200

3 an

d B

eyon

d’, P

aul D

eMon

e, w

ww

.real

wor

ldte

ch.c

om

Sum

mar

yS

umm

ary

Wid

e ra

nge

of h

ardw

are

and

softw

are

choi

ceW

ide

rang

e of

har

dwar

e an

d so

ftwar

e ch

oice

––N

ot a

ll ar

e cr

eate

d eq

ual

Not

all

are

crea

ted

equa

lK

now

you

r app

licat

ion

Kno

w y

our a

pplic

atio

n––

CP

U, M

emor

y bo

und?

CP

U, M

emor

y bo

und?

Mar

ket c

onso

lidat

ion,

M

arke

t con

solid

atio

n, c

omm

oditi

satio

nco

mm

oditi

satio

nV

ery

pow

erfu

l sys

tem

s av

aila

ble

Ver

y po

wer

ful s

yste

ms

avai

labl

e––

Rea

sona

ble

cost

, new

pos

sibi

litie

sR

easo

nabl

e co

st, n

ew p

ossi

bilit

ies