juhan kim, the kias cac january 23, 2011. to build linux cluster from the commodity hardware like...

37
How to Build Parallel Computers in a Lab Juhan Kim, The KIAS CAC January 23, 2011

Post on 19-Dec-2015

217 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

How to Build Parallel Computers in a Lab

Juhan Kim, The KIAS CAC

January 23, 2011

Page 2: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Moor’s Law

Court

esy

of

Gre

g B

runo

Page 3: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

The Cheapest Way to Get a Super Computer To build linux cluster

from the commodity hardware like cpu (x-eon,amd), network (gi-gabit,infiniband), etc

Page 4: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Overview of the TOP500.org http://www.top500.org/ Top 500 supercomputers from 1993 –

now Lists in order of LINPACK test (double

precision) Rpeak, Rmax

Page 5: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Top500 : Countries

Page 6: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Top500 : processor types

Page 7: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Top500 : network

Page 8: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Top500 : Operating systems

Page 9: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Top500: Applications

Page 10: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Top500: # of CPU cores

Page 11: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Cluster Hardware CPU

AMD Opteron Pentium (Xeon) Itanium2 benchmark test: http://www.spec.org/cpu/

12 integer benchmarks Video compression, quantum computing, path-finding, combinartorial optimization,

discrete event simulation, etc. 17 floating-point benchmarks

Fluid dynamics, quantum chemistry, general relativity, ray tracing, quantum chem-istry, weather prediction, biochemestry, etc.

mean of 12/17 resulting speeds/throughputs of integer/floating-point calculation No I/O & No network communication Spec[xx]2006 : to measure the speed of a core Spec[xx]2006_rate : to measure the throughput of a chip/chips

Network Commodity 1G/10G Infiniband

Page 12: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Intel X86

Most widely used processor

EM64T : 64bit sup-ported

Fastest integer arithmetic process

Fastest floating point arithmetic process

Page 13: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

AMD Opteron

x86_64 architec-ture

Good performance/cost in floating-point & integer arithmetic process

Page 14: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Itanum2

First released in 2001

Impressive Linpack efficiecy: ~90%

Page 15: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

SpecFP2006_rate

W300,000

W500,000

Page 16: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

SpecFP2006 rate

W1,000,000

$700

Page 17: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Processor GHz Cores SPECint2006

SPECfp2006 Price

Xeon E5620 2.4 4 31.6 37 0.5 MW

Opteron 4184 2.8 6 25.2 34.2 350 USD

Opteron 6174 2.2 12 21.5 30.7 1.5 MW

Intel Core i7-940 2.9 4 30.8 31.3 0.8 MW

Intel Itanium2(18MB)

1.6 2 11.5 17.7 700 USD

IBM Power7 (780)

- 16 44 (4.14GHz) 71.5 (3.86GHz) ???

Processor Summary (Processor Speed)

•Performance: • IBM Power >> Xeon > Opteron 4184 > Core i7 > Opteron 6174 > Ita-nium2

•Performance/cost :• Xeon > Opteron 4184 > Core i7 > Opteron 6174 > Itanium2

Page 18: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Processor GHz (Gflops)

Ncore Nthread

SPECint2006_rate/core

SPECfp2006_rate/core

Xeon E5620 2.4(38.4) 4 2 27.8 21.1(84.8/chip)

Opteron 4184 2.8(67.2) 6 1 19.8 15.1(90.6/chip)

Opteron 6174 2.2 (105.6) 12 1 16.1 13.0(156/chip)

Intel Core i7-940 2.9 (46.4) 4 2 29 21.5(86/chip)

Intel Itanium2(18MB)

1.6 (12.8) 2 1 11.7 7.25(14.5/chip)

IBM Power7 (780)

3.72 (89.3 (?))

6 4 40 35.5(142/chip)

Processor Summary (multi processing speed)

Page 19: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

200

400

600

800

1000

1200

1400

1600

1800

2000

0

20

40

60

80

100

120

140

SpecColumn1

Performance(thoughput)/cost

I7-860/920: best perf/costOpteron series: show a good performance/costI7-940, x5670: poor ratio

E5620I7-940

X5670

I7-860

Opteron6172Opteron6136

E5630x5550

I7-920

8cores

12cores

Page 20: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Off-the-self computer boxes Dell PowerEdge R610

1U rack mount 2-way Xeon E5620

(2.4Ghz, 4cores) 24GB Memory 160GB HardDisk 3,075 USD (5.38MWon) Specfp2006: 37 Specfp2006_rate: 169 Xeon E5620 vs. E5506

(2.13Ghz) 20 % cheaper 30% slower

Page 21: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Dell Opteron server

Dell PowerEdge R815 1U rack mount 2-way Opteron 6128

(2Ghz, 8cores) 32GB Memory 250GB disk 4,556 USD Specfp2006: 28 Specfp2006_rate: 233 Opteron 6128 vs. 6136

(2.4Ghz) 40% more expensive 20 % faster

Page 22: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

IBM Xeon Server

IBM x3650 M3 2-way Xeon E5620

(2.4Ghz, 4cores) 2U rack mount 24GB mem 146GB Disk 5,958 USD Specfp2006: 37 Specfp2006_rate:

169

Page 23: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

IBM Opteron server

IBM x3755 M3 3U rack mount 2-way Opteron 6128

(2Ghz, 8cores) 24GB Memory 300GB disk 7,729 USD Specfp2006: 28 Specfp2006_rate: 233 Opteron 6128 vs. 6136

(2.4Ghz) 10% more expensive 20 % faster

Page 24: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

IBM Power7 server IBM Power 710 Express

1-way socket Power7/ 3.Ghz (4cores) 4GB memory 150GB disk Specfp2006_rate: ??? 5,852 USD 8,120 USD (6cores,3.7Ghz;

16GB) 14,620 USD (8cores,

3.55Ghz; 16GB) Specfp2006_rate

213 (6core, 3.72Ghz, 8,120 USD)

248 (8cores, 3.55Ghz, 14,620 USD)

Page 25: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Comparisons

Throughput/cost1. Dell PowerEdge Xeon R610 : 0.055/$2. Dell PowerEdge Opteron R815 : 0.051/$3. IBM Opteron x3755 : 0.03/$4. IBM Xeon x3650 : 0.028/$5. IBM Power7 6-core : 0.026/$

Page 26: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Network

Page 27: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Myrinet

http://www.myri.com/myrinet/prod-uct_list.html

Throughput : 800 MB/s Latency: 2.7μs Cost: 20,000 USD for 16port

1,250 USD/port ~ 1.4 Mwon/port

Page 28: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Infiniband network (latency: 0.1 μsecond)

•Throughput (QDR): 40Gbps•Infiniband Chassis: 50MWon•Leaf (18port) : 10MW•Adapter card: 1MW•Cable: 0.8MW

•64nodes: 255MW•32nodes : 130MW (?)•Cost/port = 4MW/port

Page 29: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Quadrics

No more available Closed in 2009

Page 30: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

System Monitoring Tool Ganglia

Using lm_sensors CPU load, network usage,

Disk usage, temperature Providing statistics (daily,

weekly, monthly usages) How to install: http://

blog.naver.com/PostView.nhn?blogId=me2you2&logNo=100037667623&redirect=Dlog&widgetTypeCall=true

Page 31: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Adding temperature monitoring script to Ganglia# cat > /etc/

cron.d/cputemp

0-59/1 * * * * root /pack-ages/exam-ples/cputemp

^D#

#!/bin/bash# author: Mike Snitzer <[email protected]># desc: used to make lm_sensors metrics available to ganglia# /etc/sysconfig/ganglia is used to specify INTERFACECONFIG=/etc/sysconfig/ganglia[ -f $CONFIG ] && . $CONFIG

#default to eth0if [ -z "$MCAST_IF" ]; then MCAST_IF=eth0fiGMETRIC_BIN=/usr/bin/gmetric# establish a base commandline#GMETRIC="$GMETRIC_BIN -i $MCAST_IF"GMETRIC="$GMETRIC_BIN”SENSORS=/usr/bin/sensors# load the lm_sensors modulesmodule=`/sbin/lsmod | awk '{print $1}' | grep i2c-piix4`if [ -z "$module" ]; then /sbin/modprobe i2c-piix4 # lm87 is for supermicro P3TDLE, replace when appropriate /sbin/modprobe lm87 /sbin/modprobe coretempfi# send cpu temps if gmond is running`/sbin/service gmond status > /dev/null`if [ $? -eq 0 ]; then # send cpu temperatures let count=0 for temp in `${SENSORS} | grep Core | cut -b 8-18`; do $GMETRIC -t float -n "cpu${count}_temp" -u "C" -v $temp let count+=1 done

# send cpu fan speed let count=0 for fan in `${SENSORS} | grep fan1 | cut -b 9-17`; do $GMETRIC -t uint32 -n "cpu${count}_fan" -u "RPM" -v $fan let count+=1 donefi

Page 32: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Research Funds available in Ko-rea

http://www.nrf.re.kr/html/kr/busi-ness/business_01_01_02.html

Page 33: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Danawa Linux CPU Cluster (33n-odes)

Installation: Jan. 2010 Intel i7 Bloomfield 920 Single socket, 8GB memory 1G network SpecFP2006_rate= 78.9/4cores 4 cabinets (44U) 4 X 3kW UPS Danawa price: 51MWon (

http://www.danawa.com/) Company-proposed price:

150MWon 1month assembling by one per-

son 1month system installing by one

person Need a lot of man power

Page 34: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

GPU Cluster @ CAC 25 sets of CPU+GPU

machines Peak performance ~

10x64(dual socket, quad core) node CPU clusters

Total cost ~ 1.5 x 64 node CPU cluster

CPU nodes: IBM x3550 (dual Xeon E3640, 24 GB, 3x1Gbps)

20KWatt power

Page 35: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Fund vs. Cluster size

1. 32node Danawa cluster (SPecfp2006_rate: 2560)2. 64node IBM Xeon cluster (8960) + 3year warranty3. 32node Dell Opteron cluster (7488)4. 64node Dell Opteron cluster (14976)5. 24node Nvidia S2050 + IBM Xeon cluster (10x#2)

1 23 4 4i2i3i

5

Page 36: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Conclusion

Spec2006/Rpeak Xeon E5620> i7-940 > Power7 >

Opteron >> Itanium Spec2006/cost

i7 series>Xeon E-series>Opteron>Xeon X-series>>Itanium

Cost IBM > Dell >> Danawa

Page 37: Juhan Kim, The KIAS CAC January 23, 2011.  To build linux cluster from the commodity hardware like cpu (xeon,amd), network (gigabit,infiniband), etc

Conclusion 2/ Discussion

Interconnection Infiniband/Myrinet:

40/20% of total cost based on the IBM/Dell Cluster 80/60 % of total cost based on the Danawa Cluster

Disk Storage Lustre File System: a few 10s Mwon for 10TB

one (a few?) MDS (Meta Data Server) + many OSS (Object Storage Server)

SAN : very expensive