linux advanced managementblog.syszone.co.kr/attachment/2867923805.pdf6. hardware control remote...

70
IBM System x Enterprise Architecture Linux Advanced Management - 1 - Linux Advanced Management Kim Wan Hee , FTSS Last Update 2007/07/24

Upload: others

Post on 09-Apr-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 1 -

Linux Advanced Management

Kim Wan Hee , FTSS

Last Update 2007/07/24

Page 2: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 2 -

Index

1. xCAT (Extreme Cluster Administrat ion Toolkit ) 05

1.1. xCAT ์„ค์น˜ 07

1.2. xCAT ํ™˜๊ฒฝ ์„ค์ • 07

1.3. Time Service ๋“ฑ๋ก 08

1.4. Compute Node SOL (Serial Over LAN) Enable 08

1.5. xCAT Config File 09

1.6. xCAT Node Installation Process 10

1.7. ๋ชจ๋“  ๋…ธ๋“œ์˜ ๊ตฌ์„ฑ 11

1.8. /etc/hosts ํŒŒ์ผ ์ •์˜ 11

1.9. Blade Center Node Installation 11

1.10 xCAT ์ค‘์š” ๋ช…๋ น์–ด 21

2. HPC Software Stack Instal lat ion 24

2.1 Intel C Compiler 25

2.2 Intel Fortran Compiler 25

2.3 Intel MPI 25

2.4 Intel MKL (Math Kernel Library) 26

2.5 PBS Pro 26

2.6 PBS Pro Check 26

3. HPL Compile, Instal lat ion and Running BMT 28

3.1 HPC ์‹œ์Šคํ…œ์˜ ์„ฑ๋Šฅ์ˆ˜์น˜ - FLOPS 28

3.2 ์ด๋ก ์„ฑ๋Šฅ vs ์‹ค์ œ์„ฑ๋Šฅ โ€“ Rpeak vs Rmax 28

3.3 HPL (High Performance Linpack) 28

3.4 ์„ ํ˜•๋Œ€์ˆ˜ โ€“ BLAS , ATLAS 29

3.5 MPI (Message Passing Interface) 29

3.6 MPICH and LIbrary Installation 30

3.6.1 ์‚ฌ์ „ ์‹œ์Šคํ…œ ๊ตฌ์„ฑ ํ™˜๊ฒฝ 30

3.6.2 MPICH Installation 30

3.6.3 ATLAS Installation 34

3.6.4 ATLAS Cache Size 34

3.6.5 GotoBLAS Installation 35

3.6.7 HPL Installation 37

3.7 Run HPL 40

3.7.1 Compile 40

3.7.2 Send the Execution file 40

3.7.3 Preparation for MPI Job 40

3.7.4 xhpl ์‹คํ–‰ 40

Page 3: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 3 -

4 GPFS (General Parallel File System) 43

4.1 GPFS ๊ฐœ์š” 43

4.2 GPFS ํ™˜๊ฒฝ 43

4.3 GPFS ํŠน์„ฑ 44

4.4 GPFS (GENERAL PARALLEL FILE SYSTEM) ์„ค์น˜ 45

4.4.1 GPFS ์„ค์น˜ ์ „ ์ค€๋น„์‚ฌํ•ญ 45

4.4.2 GPFS ์„ค์น˜ ๋ฐ Portability Layer Buildํ•˜๊ธฐ 45

4.5 GPFS (GENERAL PARALLEL FILE SYSTEM) ๊ตฌ์„ฑ 46

4.5.1 GPFS test ์‹œ์Šคํ…œ ๊ตฌ์„ฑ๋„ 46

4.5.2 GPFS ๊ตฌ์„ฑ ์‹œ ๊ณ ๋ ค์‚ฌํ•ญ 47

4.5.3 GPFS Cluster ๊ตฌ์„ฑ 48

4.6 GPFS (GENERAL PARALLEL FILE SYSTEM) ์šด์˜ ๋ฐ ๊ด€๋ฆฌ 50

4.6.1 GPFS Filesystem ํ™•์ธ ๋ฐ ๊ด€๋ฆฌ 50

4.6.2 GPFS Node ๊ด€๋ฆฌ 53

4.6.3 GPFS Cluster config๋ณ€๊ฒฝ 54

4.7 GPFS(GENERAL PARALLEL FILE SYSTEM) ๊ตฌ์„ฑ๋ณ€๊ฒฝ ๋ฐ ์žฅ์• ๋ณต๊ตฌ 55

4.7.1 GPFS IP ๋ฐ host name ๋ณ€๊ฒฝ 55

4.7.2 GPFS cluster configuration data ํŒŒ์ผ ๋ณต๊ตฌ 56

5. XEN 57

5.1 Xen 57

5.2 Xen ์—์„œ ์ง€์›ํ•˜๋Š” ๊ฐ€์ƒํ™” ํ˜•ํƒœ 57

5.3 Xen ์˜ ๊ธฐ๋ณธ ๊ตฌ์กฐ ๋ฐ Memory Ballooning ์ด๋ž€? 57

5.4 Xen ๊ตฌ์„ฑ ์‹œ ํ™•์ธ ์‚ฌํ•ญ 58

5.5 virt-install ๋ช…๋ น์œผ๋กœ Xen para-virtualized guest install 58

5.6 virt-manager ๋ช…๋ น์œผ๋กœ Xen para-virtualized guest install 61

5.7 Virtual system Management 67

5.8 Command Base Management 70

Page 4: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 4 -

๋ณธ ๋ฌธ์„œ๋Š” IBM Residency Program ์— ์ฐธ์—ฌํ•˜์—ฌ ์ฃผ์‹  Business Partner ๋ถ„๋“ค๊ป˜์„œ ๋งŒ

๋“œ์‹  ์ž๋ฃŒ ์ž…๋‹ˆ๋‹ค. ๋ฐ”์˜์‹  ์‹œ๊ฐ„์ค‘์—๋„ 5์ผ๊ฐ„ ๊ต์œก์— ์ฐธ์„ ํ•ด์ฃผ์‹  Engineer ๋ถ„๋“ค๊ป˜ ๊ฐ

์‚ฌ ๋“œ๋ฆฝ๋‹ˆ๋‹ค.

Page 5: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 5 -

1. xCAT (Extreme Cluster Administration Toolkit)

xCAT (Extreme Cluster Administration Toolkit)์€ Linux Cluster์™€ ๊ฐ™์ด ๋งŽ์€

๋™์ผํ•œ ํ•˜๋“œ์›จ์–ด ์ปดํฌ๋„ŒํŠธ๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ํ™˜๊ฒฝ์—์„œ ๊ด€๋ฆฌ์ž๊ฐ€ Build, Configure,

Administer, ๊ทธ๋ฆฌ๊ณ  Maintain ์ž‘์—…์„ ์ค‘์•™ ์ง‘์ค‘ํ™”๋œ ํ™˜๊ฒฝ์—์„œ ํšจ์œจ์ ์œผ๋กœ ๊ด€๋ฆฌ

๊ฐ€๋Šฅ์ผ€ ํ•˜๋Š” Perl/Unix Shell Script Base์˜ Tool์ž…๋‹ˆ๋‹ค. ์•„๋ž˜์™€ ๊ฐ™์€ ์‹œ์Šคํ…œ ํ™˜

๊ฒฝ์—์„œ ๊ตฌํ˜„์ด ๊ฐ€๋Šฅํ•˜๋‹ค.

1. High Performance (HPC): such as computing physics, seismic, CFD, FEA, weather, bioinformatics and other simulations. 2. Horizontal Scaling (HS): such as Web farms. 3. Administrative: A very convenient platform, although non-traditional, to install and administer a number of Linux machines. 4. Microsoft Windows and other Operating Systems: With xCAT's cloning and imaging support, it can be used to rapidly deploy and conveniently manage clusters with compute nodes that run Windows or any other operating system.

1. OS/Distribution support Any OS on compute nodes via OS agnostic imaging support. 2. Hardware Control Remote Power control (on/off state) through IBM Management Processor Network, BMC, and/or APC Master Switch. 3. Hardware Control Remote software reset (rpower). 4. Hardware Control Remote Network BIOS/firmware update and configuration on IBM hardware. 5. Hardware Control Remote OS console with pluggable support for a number terminal servers. 6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers. 7. Boot Control Ability to remotely change boot type (network or local disk) with syslinux. 8. Automated parallel install using scripted RedHat kickstart, SUSE Linux autoyast, on ia32, x86_64, ppc, and ia64. 9. Automated parallel install using imaging with other Linux distributions, Windows, and other operating systems. 10. Automated network installation with supported PXE NICs, with etherboot or BootP on supported NICs without PXE. 11. Monitoring hardware alerts and email notification with IBM's Management Processor Network and SNMP alerts. 12. Monitoring remote vitals such as fan speed, temperature, and more with IBM's Management Processor Network. 13. Monitoring remote hardware event logs with IBM's Management Processor Network/IPMI Interface. 14. Administration utilities such as parallel remote shell, ping, rsync, and copy. 15. Administration utilities such as remote hardware inventory with IBM's Management Processor Network. 16. Software Stack PBS and Maui schedulers to build scripts, documentation, automated setup, extra related utilities, and deep integration. 17. Software Stack Myrinet to automate setup and installation. 18. Software Stack MPI to build scripts, documentation, and automated setup for MPICH, MPICH-GM, and LAM. 19. Usability command line utilities for all cluster management functions. 20. Usability single operations can be applied in parallel to multiple nodes with very flexible and customizable group/range functionality. 21. Flexible support for various user defined node types. 22. Diskless support using warewulf.

Page 6: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 6 -

<xCAT์„ ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•œ ๊ธฐ๋ณธ HPC ํ™˜๊ฒฝ>

<HPC๋ฅผ ๊ตฌ์„ฑํ•˜๊ณ  ์žˆ๋Š” ์ค‘์š” ์ปดํฌ๋„ŒํŠธ>

Management node -- Management node functionality can reside on a single server or multiple servers. In a single server environment, the management server operates in standalone mode. The management node is connected to the compute nodes over the Management VLAN on the switch. The management server is where the xCAT software is installed and is used exclusively to control the compute nodes using various xCAT functionalities: OS installation, remote power control, firmware updates, Serial-over-LAN, etc. In our conceptual cluster, there is one management node. User/login nodes -- Ideally, the compute nodes of a cluster should not accept external connections and should only be accessible to system administrators through the management server. System users can log in to user nodes (or login nodes) in order to run their workloads on the cluster. Each user node consists of an image with full editing capabilities, the required development libraries, compilers, and everything else required to produce a cluster-enabled application and retrieve results. Storage servers and disks -- You can connect several storage servers to a diskbased backend using various mechanisms. Connecting storage to the cluster can be direct or through a storage area network (SAN) switch, either by fiber, copper, or a mixture of the two. These servers provide shared storage access to the other servers within the cluster. If data backup is required, connect

xCAT Server

Page 7: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 7 -

the backup device to the storage server using an extra copper or fiber link. For the example cluster, the storage back end is a single entity, providing shared file system access across the cluster. Compute nodes -- These nodes run the cluster workload, accepting jobs from the scheduler (if a job scheduler is used on the cluster). The compute nodes are the most disposable part of the cluster. The system administrator can easily reinstall or reconfigure them using the management server.

1.1. xCAT ์„ค์น˜

http://www.alphaworks.ibm.com/tech/xCAT ์—์„œ Download Link ์—์„œ ๊ด€๋ จ ํŒŒ์ผ Downloadํ•จ

- ๋ฐ›์€ ํŒŒ์ผ์„ /opt ์— ํ’€์–ด์คŒ

# cd /opt

# tar zxvpf /tmp/xcat-dist-core-1.2.0.tgz

# tar zxvpf /tmp/xcat-dist-oss-1.2.0.tgz

# tar zxvpf /tmp/xcat-dist-ibm-1.2.0.tgz (extract only if you have IBM xSeries nodes)

cp /opt/xcat/samples/etc/* /opt/xcat/etc

1.2. xCAT ํ™˜๊ฒฝ ์„ค์ •

#export XCATROOT=/opt/xcat

#cd $XCATROOT/sbin

#./setupxcat

#copycds ์„ค์น˜ ํ•  OS image๋ฅผ ์ €์žฅํ•จ.(xCAT์„ค์น˜ ํ›„ ์–ด๋Š ๋‹จ๊ณ„์—์„œ๋‚˜ ์ง„ํ–‰ ๊ฐ€๋Šฅ)

[root@x305 sbin]# cat /root/.bash_profile

# .bash_profile

# Get the aliases and functions

if [ -f ~/.bashrc ]; then

. ~/.bashrc

fi

# User specific environment and startup programs

PATH=$PATH:$HOME/bin

BASH_ENV=$HOME/.bashrc

USERNAME="root"

export USERNAME BASH_ENV PATH

# xCAT Profile

export XCATROOT=/opt/xcat

export XCATPREFIX=/opt/xcat

export PATH=$PATH:$XCATROOT/bin:$XCATROOT/sbin /etc/man.config ์— [ MANPATH /opt/xcat/man ] ์ถ”๊ฐ€

Page 8: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 8 -

1.3. Time Service ๋“ฑ๋ก

# mv -f /etc/ntp.conf /etc/ntp.conf.ORIG

# touch /etc/ntp.conf

# vim /etc/ntp.conf

-------------------------------------

server 127.127.1.0

fudge 127.127.1.0 stratum 10

driftfile /etc/ntp/drift

-------------------------------------

* Time์„œ๋ฒ„์—์„œ์˜ Timezone ์„ค์ •

setup

setclock OR hwclock -w

chkconfig --add ntpd

service ntpd start

* NTP ํด๋ผ์ด์–ธํŠธ ์„ค์ •

server mgt.cluster.com

driftfile /etc/ntp/drift

multicastclient

broadcastdelay 0.008

authenticate no

keys /etc/ntp/keys

trustedkey 65535

requestkey 65535

controlkey 65535

ntp์„œ๋ฒ„์—์„œ /etc/ntp๋””๋ ‰ํ† ๋ฆฌ์•ˆ์— ์žˆ๋Š” driftํŒŒ์ผ ๋ณต์‚ฌ

* NTP ๋ฐ๋ชฌ ํ…Œ์ŠคํŠธ

[root@blade329 STREAM]# ntpstat

synchronised to NTP server (172.20.0.1) at stratum 12

time correct to within 23 ms

polling server every 128 s

1.4. Compute Node SOL (Serial Over LAN) Enable

For IBM System x 3550, 3650, and 3455 Flash the Management Processor (BMC) to the latest version. Flash BIOS to the latest version. Remove the power cord for 10 seconds. Restore the power cord. Reboot and press F1 to enter the BIOS configuration. Configure the BIOS settings for optimum performance and then edit the Devices and I/O Ports as shown below: Devices and I/O Ports โ€ข Serial port A: Port 3F8, IRQ 4 โ€ข Serial port B: Disabled โ€ข Remote Console Redirection o Remote Console Active: Enabled o Remote Console COM Port: COM 1 o Remote Console Baud Rate: 19200 To use Remote Console Text Emulation: VT100/VT220 Configure the text emulation settings as listed below:

Page 9: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 9 -

โ€ข Remote Console Keyboard Emulation: VT100/VT220 โ€ข Remote Console After Boot: Enabled โ€ข Remote Console Flow Control: Hardware Configure the โ€œStartupโ€ settings as listed below: From the main menu, select Startup Sequence and configure the settings as follows: โ€ข First Startup Device: CD ROM โ€ข Second Startup Device: Diskette Drive 0 โ€ข Third Startup Device: Network โ€ข Forth Startup Device: Hard Disk โ€ข Wale On LAN: Disabled โ€ข Planer Ethernet PXE/DHCP: Planer Ethernet 1 43 โ€ข Boot Fail Count: Disabled

For IBM System BladeCenter Server Flash the Management Processor (BMC) to the latest version. Flash BIOS to the latest version. Remove the power cord for 10 seconds. Restore the power cord. Reboot and press F1 to enter the BIOS configuration. Configure the BIOS settings for optimum performance and then edit the Devices and I/O Ports as shown below: Devices and I/O Ports โ€ข Serial port A: Auto configured โ€ข Serial port B: Auto configured โ€ข Remote Console Redirection o Remote Console Active: Enabled o Remote Console COM Port: COM 2 o Remote Console Baud Rate: 19200 To use Remote Console Text Emulation: VT100/VT220 Configure the text emulation settings as listed below: โ€ข Remote Console Keyboard Emulation: VT100/VT220 โ€ข Remote Console After Boot: Enabled โ€ข Remote Console Flow Control: Hardware Configure the โ€œStartupโ€ settings as listed below: From the main menu, select Startup Sequence and configure the settings as follows: โ€ข First Startup Device: CD ROM โ€ข Second Startup Device: Diskette Drive 0 โ€ข Third Startup Device: Network โ€ข Forth Startup Device: Hard Disk โ€ข Wale On LAN: Disabled โ€ข Planer Ethernet PXE/DHCP: Planer Ethernet 1 43

โ€ข Boot Fail Count: Disabled

1.5. xCAT Config File

Cluster ์ •์˜, ํ™˜๊ฒฝ ๊ตฌ์„ฑํŒŒ์ผ์ด ์žˆ๋Š” ๊ฒฝ๋กœ๋Š” /opt/xcat/etc/* ์— ์žˆ์–ด์•ผ ํ•˜๋Š”๋ฐ ์••์ถ•์„ ํ’€๋ฉด etc

Directory ๋Š” ์—†์Šต๋‹ˆ๋‹ค. (์ด๋•Œ /opt/xcat/samples/etc/* ์— ์˜ˆ์ œ ํŒŒ์ผ์ด ์žˆ์œผ๋‹ˆ ์ฐธ๊ณ ํ•ด์„œ ๋งŒ๋“ค์–ด ์‚ฌ์šฉ์„

ํ•˜๋ฉด ๋ฉ๋‹ˆ๋‹ค.) ํ™˜๊ฒฝ์„ค์ •์˜ ๋” ์ž์„ธํ•œ ๋‚ด์šฉ์„ ์›ํ•œ๋‹ค๋ฉด ์•„๋ž˜์˜ Site ์˜ Redbook ์„ ์ฐธ๊ณ  ํ•˜๋ฉด ๋ฉ๋‹ˆ๋‹ค.

ํ™˜๊ฒฝ ๊ตฌ์„ฑ Directory ์—๋Š” ๊ฐ Node,Switch,Terminal server, Node NIC MAC ๋“ฑ xCat ๋‚ด๋ถ€ ๋…ธ๋“œ์˜ ๋ชจ

๋“  ์ •๋ณด๋“ค์ด ๋“ค์–ด ๊ฐ€๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.

Page 10: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 10 -

*๊ฐ ๊ตฌ์„ฑ ํŒŒ์ผ ์ด๋ฆ„ ๋ชฉ๋ก

site.tab / nodehm.tab / nodelist.tab / nodepos.tab / noderes.tab / nodetype.tab / passwd.tab / postscripts.tab /

postdeps.tab / snmptrapd.conf / networks.tab

mac.tab (loaded with non-collectable MACs, e.g. terminal servers, switches, RSAs, etc...)

*Terminal server ๊ด€๋ จ ๊ตฌ์„ฑ Table ์ด๋ฆ„ ๋ชฉ๋ก conserver.tab

conserver.cf

*MAC Address ์ •๋ณด๋ฅผ Cisco Switch ์—์„œ ์ˆ˜์ง‘์„ ํ•˜๊ธฐ ์œ„ํ•œ ๊ตฌ์„ฑ ์ •๋ณด

cisco.tab

summit48i.tab

blackdiamond.tab

*IBM eServer xSeries Management Processor(์˜ˆ RSA Card) mpa.tab

mp.tab

*APC Master Switch ๋ฅผ ์œ„ํ•œ ํ…Œ์ด๋ธ” apc.tab

*APC Master Switch Plus ๋ฅผ ์œ„ํ•œ ํ…Œ์ด๋ธ” apcp.tab

*xCAT flash support ๋ฅผ ์œ„ํ•œ ํ…Œ์ด๋ธ” nodemodel.tab

*EMP ์ง€์› ๊ตฌ์„ฑ ํ…Œ์ด๋ธ” emp.tab

*Baytech ์ง€์› ๊ตฌ์„ฑ ํ…Œ์ด๋ธ” baytech.tab

*xCat GPFS ์ง€์› ๊ตฌ์„ฑ ํ…Œ์ด๋ธ” gpfs.tab

*IPMI ์ง€์› ๊ตฌ์„ฑ ํ…Œ์ด๋ธ” (BMC๋ฅผ ์ด์šฉํ•œ node management) ipmi.tab

*Blade์„œ๋ฒ„ ์Šฌ๋กฏ์œ„์น˜์™€ switch port๋ฒˆํ˜ธ mapping ํ…Œ์ด๋ธ” bcnet.tab

1.6. xCAT Node Installation Process

[์„ค์น˜ ํ”„๋กœ์„ธ์Šค ๋‹ค์ด์–ด๊ทธ๋žจ]

Page 11: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 11 -

1.7. ๋ชจ๋“  ๋…ธ๋“œ์˜ ๊ตฌ์„ฑ

IANS:

Update all firmware to latest levels.

Configure firmware/BIOS/CMOS to NEVER prompt for anything.

Configure network to boot before HD.

Enable management processor.

Redirect POST/BIOS out serial if possible.

* rbootseq ๋ช…๋ น์œผ๋กœ Bladecenter boot order๋ณ€๊ฒฝ

* rbootseq noderange n,c,hd0,f

1.8. /etc/hosts ํŒŒ์ผ ์ •์˜

[root@mgt pxelinux.cfg]# vi /etc/hosts

# Do not remove the following line, or various programs

# that require network functionality will fail.

127.0.0.1 localhost.localdomain localhost ์—ฌ๊ธฐ hostname์ด ๋“ค์–ด๊ฐ€์ง€ ์•Š๋„๋ก ์ฃผ์˜

192.168.12.74 mgt mgt.cluster.com

192.168.70.125 bc1

192.168.70.127 sw1

192.168.70.128 sw2

192.168.12.1 blade1

192.168.12.2 blade2

192.168.12.3 blade3

192.168.12.4 blade4

1.9. Blade Center Node Installation

1) /opt/xcat/etc ์˜ ๊ตฌ์„ฑํŒŒ์ผ ํŽธ์ง‘

/opt/xcat/etc/site.tab Cluster ์ „์ฒด ํ™˜๊ฒฝ ๊ตฌ์„ฑ ํŽธ์ง‘

/opt/xcat/etc/nodelist.tab ๊ฐ Node ์˜ Group ํŽธ์ง‘

/opt/xcat/etc/noderes.tab ๊ฐ ๋…ธ๋“œ๋“ค์ด ์„ค์น˜ ๋ ๋•Œ tftp,nfs๋“ฑ์˜ ์„œ๋ฒ„ ๊ฒฝ๋กœ์ง€์ •

/opt/xcat/etc/nodetype.tab ๊ฐ Node ์˜ Type ์„ค์ •

(node๋ฅผ group์œผ๋กœ ์ •์˜ํ• ๋•Œ, nodelist ๋ฐ nodetype์— ๋“ฑ๋ก์„ ํ•ด์•ผ ํ•œ๋‹ค.)

/opt/xcat/etc/nodehm.tab Node Hardware Management ์„ค์ •

/opt/xcat/etc/nodemodel.tab ๊ฐ ๋…ธ๋“œ์˜ machine ์ •์˜

/opt/xcat/etc/passwd.tab ๊ฐ Management Node ์˜ Password ์„ค์ •

/opt/xcat/etc/mac.tab ๊ฐ ๋…ธ๋“œ์˜ Mac Address ์„ค์ •

/opt/xcat/etc/conserver.cf ๊ฐ ๋…ธ๋“œ์˜ conserver ์„ค์ •

/opt/xcat/etc/conserver.tab conserver์˜ํ•ด ๊ด€๋ฆฌ๋  ๋…ธ๋“œ๋ชฉ๋ก ์„ค์ •

/opt/xcat/etc/mpa.tab Management processor Adapter ๋ฐ module ์ •์˜

Page 12: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 12 -

/opt/xcat/etc/mp.tab Management processor Network์˜ ํ† ํด๋กœ์ง€ ์ •์˜

/opt/xcat/etc/bcnet.tab blade์„œ๋ฒ„ slot number์™€ blade switch๋ฒˆํ˜ธ ์ •์˜

/opt/xcat/etc/networks.tab dhcp์—์„œ ์ •์˜ํ•˜๋Š” ๋ชจ๋“  ๋„คํŠธ์›Œํฌ๋ฅผ ์ •์˜

* /opt/xcat/etc/์•ˆ์— ํŒŒ์ผ ์ˆ˜์ • ํ›„ setupxcat๋ช…๋ น์„ ์‹คํ–‰ํ•ด์•ผ ๋ณ€๊ฒฝ๋œ ์„ค์ •์ด ๋ฐ˜์˜๋จ

[root@mgt etc]# cat site.tab sshkeyver 1

rsh /usr/bin/ssh

rcp /usr/bin/scp

gkhfile /opt/xcat/etc/gkh

tftpdir /tftpboot

tftpxcatroot xcat

domain cluster.com

dnssearch cluster.com

nameservers 192.168.12.74

forwarders NA

nets 192.168.12.0:255.255.255.0,192.168.70.0:255.255.255.0

dnsdir /var/named

dnsallowq NA

domainaliasip NA

mxhosts mailhosts.cluster.com

mailhosts mgt

master mgt

homefs mgt:/home

localfs mgt:/usr/local

pbshome /var/spool/pbs

pbsprefix /opt/pbs

pbsserver mgt

scheduler maui

xcatprefix /opt/xcat

keyboard us

timezone US/Eastern

offutc -5

mapperhost mgt

serialmac 0

serialbps 19200

snmpc public

snmpd 192.168.12.74

poweralerts Y

timeservers mgt

logdays 7

installdir /install

clustername wopr

dhcpver 3

dhcpconf /etc/dhcpd.conf

dynamicr eth0,ia32,192.168.12.74,255.255.255.0,192.168.12.1,192.168.12.254

usernodes NA

usermaster mgt

nisdomain NA

nismaster NA

nisslaves NA

homelinks NA

chagemin 0

chagemax 60

chagewarn 10

Page 13: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 13 -

chageinactive 0

mpcliroot /opt/xcat/lib/mpcli

ipmimaxp 16

ipmitimeout 30

dpcliroot /opt/xcat/i686

ipmitool /opt/xcat/i686/sbin/ipmitool

myrimask 255.255.0.0

mpicli2root /opt/xcat/lib/mpicli2

ipmiretries 2

xcatdport 3001

[root@mgt etc]# cat nodelist.tab #node1 all,compute,opteron,rack1

blade1 all,compute,blade,box1

blade2 all,compute,blade,box1

blade3 all,compute,blade,box1

blade4 all,compute,blade,box1

sw1 sw

bc1 bc

[root@mgt etc]# cat noderes.tab all mgt,mgt,/install,1,N,N,N,N,N,N,N,eth0,eth0

[root@mgt etc]# cat nodetype.tab blade1 rhas4,x86_64,blade

blade2 rhas4,x86_64,blade

blade3 rhas4,x86_64,blade

blade4 rhas4,x86_64,blade

[root@mgt etc]# cat nodehm.tab #node1 openipmi,openipmi,NA,openipmi,NA,conserver,NA,openipmi,rcons,pxe,bcm5700,vnc,Y,openipmi,NA,9600

blade1 mp,mp,NA,mp,mp,conserver,mp,mp,bcmm,pxe,bcm5700,NA,N,mp,mp,NA Blade Center์šฉ

blade2 mp,mp,NA,mp,mp,conserver,mp,mp,bcmm,pxe,bcm5700,NA,N,mp,mp,NA

blade3 mp,mp,NA,mp,mp,conserver,mp,mp,bcmm,pxe,bcm5700,NA,N,mp,mp,NA

blade4 mp,mp,NA,mp,mp,conserver,mp,mp,bcmm,pxe,bcm5700,NA,N,mp,mp,NA

node001 ipmi,ipmi,ipmi,ipmi,ipmi,conserver,NA,ipmi,switch,pxe,bcm5700,vnc,Y,ipmi,NA,19200 BMC์šฉ

node01 mp,mp,mp,mp,mp,conserver,rcons,mp,rcons,pxe,eepro100,vnc,N RSA์šฉ

bc1 NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,N,NA,NA

[root@mgt etc]# cat nodemodel.tab blade1 blade

blade2 blade

blade3 blade

blade4 blade

[root@mgt etc]# cat mac.tab blade1-eth0 00:11:25:9E:A7:BA

blade1-eth1 00:11:25:9E:A7:BB

blade2-eth0 00:11:25:9E:A7:B0

blade2-eth1 00:11:25:9E:A7:B1

Page 14: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 14 -

blade3-eth0 00:11:25:9E:A7:DE

blade3-eth1 00:11:25:9E:A7:DF

blade4-eth0 00:11:25:9E:AA:A2

blade4-eth1 00:11:25:9E:AA:A3

[root@mgt etc]# cat conserver.cf LOGDIR=/var/log/consoles

blade1:|sol.bc blade1::&: Blade Center์šฉ

blade2:|sol.bc blade2::&:

blade3:|sol.bc blade3::&:

blade4:|sol.bc blade4::&:

node001:|sol.e326 node001::&: BMC์šฉ

node01:!ts01:3001:&: RSA์šฉ

%%

trusted: 127.0.0.1

[root@mgt etc]# cat conserver.tab # conserver.tab

#

# This table defines the relationship between nodes and console servers.

#

# <conserver> hostname of the console server

# <console> name of the console that corresponds to conserver.cf

#<node> <conserver>,<console>

blade1 localhost,blade1

blade2 localhost,blade2

blade3 localhost,blade3

blade4 localhost,blade4

conserver.tab~ mpa.tab nodehm.tab~ nodetype.tab site.tab.ORG

[root@mgt etc]# cat mpa.tab

# mpa.tab

#

# This table lists the Management Processor Adapters and Modules.

#

# <type> asma,rsa,rsa2,bc

# <name> internal name (*)

# <number> internal number,unique and >10000 (asma, rsa only)

# <command> telnet,mpcli,http(hawk only) (bc must be http)

# <reset> http(hawk only),mpcli,NA

# <rvid> telnet(asma only),NA

# <dhcp> Y/N (rsa only)

# <gateway> default gateway IP address or NA or using DHCP

# <dns> DNS server or NA if using DHCP

#

# NOTE: command and reset method should not be the same, if the

# command interface is locked of unavailable, a different

# interface must be used. Please read the

# managementprocessor-HOWTO.html

#

# (*)internal name should be the node name if the mpa is the

# primary management adapter for that node

Page 15: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 15 -

#

#<node> <type>,<name>,<number>,<command>,<reset>,<rvid>,<dhcp>,<gateway>,<dns>

#

bc1 bc,bc1,NA,http,http,mpcli,NA,NA,NA

[root@mgt etc]# cat mpa.tab # mpa.tab

#

# This table lists the Management Processor Adapters and Modules.

#

# <type> asma,rsa,rsa2,bc

# <name> internal name (*)

# <number> internal number,unique and >10000 (asma, rsa only)

# <command> telnet,mpcli,http(hawk only) (bc must be http)

# <reset> http(hawk only),mpcli,NA

# <rvid> telnet(asma only),NA

# <dhcp> Y/N (rsa only)

# <gateway> default gateway IP address or NA or using DHCP

# <dns> DNS server or NA if using DHCP

#

# NOTE: command and reset method should not be the same, if the

# command interface is locked of unavailable, a different

# interface must be used. Please read the

# managementprocessor-HOWTO.html

#

# (*)internal name should be the node name if the mpa is the

# primary management adapter for that node

#

#<node> <type>,<name>,<number>,<command>,<reset>,<rvid>,<dhcp>,<gateway>,<dns>

#

bc1 bc,bc1,NA,http,http,mpcli,NA,NA,NA Blade Center์šฉ

bmc001 bmc,bmc001,NA,mpcli,mpcli,mpcli,NA,172.29.0.1,NA BMC์šฉ

rsa01 rsa,rsa01,10001,mpcli,mpcli,NA,N,NA RSA์šฉ

[root@mgt etc]# cat mp.tab # mp.tab

#

# This table describes the topology of the Management Processor Network. This

# enables xcat to contact the management processors via the mpa connected to

# the same chain.

#

# <mpa> name of the mpa connected to this node

# <name> internal name of the mp adapter (*)

#

# (*)this field should be NA if the mpa is the primary management

# adapter for that node

#

#<node> <mpa>,<name>

#node10 rsa1,node10

#

# This table maps nodes to node chassis.

#

# <bc> hostname of the node chassis management module.

# <bay> position of the node. DO NOT PAD WITH ZEROS.

#

Page 16: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 16 -

#<node> <bc>,<bay>

blade1 bc1,1 Blade Center์šฉ

blade2 bc1,2

blade3 bc1,3

blade4 bc1,4

node001 bmc001,node001 BMC์šฉ

node01 rsa01,node01 RSA์šฉ

[root@mgt etc]# cat bcnet.tab blade1 sw1,1

blade2 sw1,2

blade3 sw1,3

blade4 sw1,4

[root@mgt etc]# cat networks.tab # networks.tab

#

#network mask,gateway,dns,dns,dns,...

192.168.12.0 255.255.255.0,NA,192.168.12.74

192.168.70.0 255.255.255.0,NA,192.168.12.74

169.254.0.0 255.255.0.0,NA,NA

2) blade์„œ๋ฒ„ kickstart file ์ƒ์„ฑ(kicstart templateํŒŒ์ผ์„ blade.tmpl์ด๋ฆ„์œผ๋กœ ๋ณต์‚ฌํ•˜์—ฌ ์‚ฌ์šฉํ•œ๋‹ค.)

[root@mgt base]# cd /opt/xcat/install/rhas4/x86_64/base

[root@mgt base]# cp all.tmpl blade.tmpl

3) makedns -- site.tab ๋ฐ hostํŒŒ์ผ์„ ์ฐธ์กฐํ•˜์—ฌ named.confํŒŒ์ผ ์ƒ์„ฑ

[root@mgt xcat]# makedns

Stopping named: [ OK ]

named: no process killed

Starting named: [ OK ]

4) makedhcp โ€“ site.tab,networks.tab,hostsํŒŒ์ผ์„ ์ฐธ์กฐํ•˜์—ฌdhcp.confํŒŒ์ผ ์ƒ์„ฑ

[root@mgt xcat]# makedhcp โ€“new

updating existing /opt/xcat/etc/networks.tab

skipping 172.30.0.0 (entry exists)

skipping 172.29.0.0 (entry exists)

skipping 169.254.0.0 (entry exists)

skipping 172.20.0.0 (entry exists)

Saving orginal /etc/dhcpd.conf as /etc/dhcpd.conf.ORIG

added network eth0

added network eth1

updated /etc/sysconfig/dhcpd with DHCPD_INTERFACE or DHCPDARGS="eth0 eth1"

added network all

added subnet 172.20.0.0/255.255.0.0

added subnet 172.29.0.0/255.255.0.0

added subnet 172.30.0.0/255.255.0.0

added subnet 169.254.0.0/255.255.0.0

adding dynamic eth0

Shutting down dhcpd: [ OK ]

Starting dhcpd: [ OK ]

Page 17: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 17 -

5) mkstage -- stage1

[root@mgt stage]# cd /opt/xcat/stage/

[root@mgt stage]# ./mkstage

- mkstage๋ช…๋ น์œผ๋กœ /tftpboot/xcat๋””๋ ‰ํ† ๋ฆฌ์•ˆ์— pxelinux์œผ๋กœ os๋ฅผ ์„ค์น˜ํ•˜๊ธฐ ์œ„ํ•œ kernel๋ฐ ramdisk image, ์„ค์ •ํŒŒ์ผ๋“ค์ด ๋งŒ๋“ค

์–ด์ง

6) getmacs <nodereange> -- stage2

* blade์„œ๋ฒ„์˜ mac address๋ฅผ ์ˆ˜์ง‘ํ•˜์—ฌ mac.tab์— ์ถ”๊ฐ€ํ•œ๋‹ค.

[root@mgt stage]# getmacs box1

Please reset nodes: blade1 blade2 blade3 blade4 blade5 blade6 blade7 blade8 blade9

blade10 blade11 blade12 blade13 blade14

Press [Enter] when ready...

Saving output to mac-16754.lst in current directory /opt/xcat/stage.

blade1-eth0 00:14:5E:D5:D0:46

blade1-eth1 00:14:5E:D5:D0:48

blade10-eth0 00:14:5E:D5:E3:54

blade10-eth1 00:14:5E:D5:E3:56

blade11-eth0 00:14:5E:D5:E4:F2

blade11-eth1 00:14:5E:D5:E4:F4

blade12-eth0 00:14:5E:D5:CF:86

blade12-eth1 00:14:5E:D5:CF:88

blade13-eth0 00:14:5E:D5:D3:2E

blade13-eth1 00:14:5E:D5:D3:30

blade14-eth0 00:14:5E:D5:E2:28

blade14-eth1 00:14:5E:D5:E2:2A

blade2-eth0 00:14:5E:D5:DB:86

blade2-eth1 00:14:5E:D5:DB:88

blade3-eth0 00:14:5E:D5:D9:E8

blade3-eth1 00:14:5E:D5:D9:EA

blade4-eth0 00:14:5E:D5:DB:E6

blade4-eth1 00:14:5E:D5:DB:E8

blade5-eth0 00:14:5E:D5:D3:B2

blade5-eth1 00:14:5E:D5:D3:B4

blade6-eth0 00:14:5E:D5:E4:08

blade6-eth1 00:14:5E:D5:E4:0A

Auto merge mac-16754.lst with /opt/xcat/etc/mac.tab(y/n)? y

7) makedhcp --allmac

* ์ด์ „์— ๋งŒ๋“ค์—ˆ๋˜ dhcpd.confํŒŒ์ผ์— ๊ฐ host์˜ mac address๋ฅผ ์ถ”๊ฐ€ํ•˜์—ฌ ์žฌ ์ƒ์„ฑ

[root@mgt stage]# makedhcp --allmac

updating existing /opt/xcat/etc/networks.tab

skipping 172.30.0.0 (entry exists)

skipping 172.29.0.0 (entry exists)

skipping 169.254.0.0 (entry exists)

skipping 172.20.0.0 (entry exists)

updated /etc/sysconfig/dhcpd with DHCPD_INTERFACE or DHCPDARGS="eth0 eth1"

8) mpacheck โ€“l <mm range>(BladeCenter mm ์ฒดํฌ)

[root@mgt stage]# mpacheck -l bc1

bc1: Host IP Address: 172.30.101.5

bc1: Subnet mask: 255.255.0.0

Page 18: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 18 -

bc1: Gateway IP Address: 0.0.0.0

bc1: Data Rate: Auto

bc1: Duplex Mode: Auto

bc1: SNMP Traps: Enabled

bc1: SNMP Agent: Enabled

bc1: SNMP Community Name: public

bc1: SNMP IP Address 1: 0.0.0.0

9) mpname <noderange> -- stage3

* ๊ฐ blade๋“ค์˜ hardeware management processor๋ฅผ rename ํ•œ๋‹ค.

[root@mgt stage]# mpname box1

blade1: blade bc1,1 renamed from blade1 to blade1

blade10: blade bc1,10 renamed from blade10 to blade10

blade11: blade bc1,11 renamed from blade11 to blade11

blade12: blade bc1,12 renamed from blade12 to blade12

blade13: blade bc1,13 renamed from blade13 to blade13

blade14: blade bc1,14 renamed from blade14 to blade14

blade2: blade bc1,2 renamed from blade2 to blade2

blade3: blade bc1,3 renamed from blade3 to blade3

blade4: blade bc1,4 renamed from blade4 to blade4

blade5: blade bc1,5 renamed from blade5 to blade5

blade6: blade bc1,6 renamed from blade6 to blade6

blade7: blade bc1,7 renamed from blade7 to blade7

blade8: blade bc1,8 renamed from blade8 to blade8

blade9: blade bc1,9 renamed from blade9 to blade9

10) mpascan <mm range>

* ๋ชจ๋“  blade์„œ๋ฒ„ ์ด๋ฆ„๋“ค์ด ์„ค์ •tab์— ์ •์˜๋˜์—ˆ๋Š”์ง€๋ฅผ ํ™•์ธ

[root@mgt stage]# mpascan bc1

bc1: blade1 1

bc1: blade10 10

bc1: blade11 11

bc1: blade12 12

bc1: blade13 13

bc1: blade14 14

bc1: blade2 2

bc1: blade3 3

bc1: blade4 4

bc1: blade5 5

bc1: blade6 6

bc1: blade7 7

bc1: blade8 8

bc1: blade9 9

11) RHEL AS4 CD ๋‚ด์šฉ์„ Management node์— ๋ณต์‚ฌํ•œ๋‹ค.

[root@mgt etc]# copycds

Please insert CD 1 and press [Enter]...

- ์ž๋™์ ์œผ๋กœ /install/rhas4/x86_64 ๋””๋ ‰ํ† ๋ฆฌ ์ƒ์„ฑ๊ณผ ํ•จ๊ป˜ CD๋‚ด์šฉ์„ ๋””๋ ‰ํ† ๋ฆฌ์— ๋ณต์‚ฌํ•œ๋‹ค.

Page 19: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 19 -

12) post ๋””๋ ‰ํ† ๋ฆฌ ๋ณต์‚ฌ

[root@mgt /]# cd /opt/xcat

[root@mgt xcat]# find post -print | cpio -dump /install

13) management node ssh key์ƒ์„ฑ

[root@mgt xcat]# gensshkeys root

[root@mgt xcat]# cp /root/.ssh/* /install/post/.ssh/ -f

- * management node ์˜ ssh โ€“ key ๋ฅผ ์ƒ์„ฑํ•˜์—ฌ /install ์— ๋†“๊ณ  ๊ฐ ๊ณ„์‚ฐ ๋…ธ๋“œ๋“ค์ด ์„ค์น˜๋ฅผ ํ• ๋•Œ์—

~/.ssh/๋””๋ ‰ํ† ๋ฆฌ์•ˆ์— ๋ณต์‚ฌ๋ฅผ ํ•ด์„œ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•จ

14) conserver ๋ฐ๋ชฌ ์‹œ์ž‘

[root@mgt /]# service conserver start

Starting Console Server: [ OK ]

15) NFS ์„œ๋ฒ„ ์„ค์ •

[root@mgt /]# vi /etc/exports

/opt/xcat *(ro,no_root_squash,sync)

/install *(ro,no_root_squash,sync)

/tftpboot *(ro,no_root_squash,sync)

[root@mgt /]# chkconfig nfs on

[root@mgt /]# service nfs start

Starting NFS services: [ OK ]

Starting NFS quotas: [ OK ]

Starting NFS daemon: [ OK ]

Starting NFS mountd: [ OK ]

[root@mgt /]# exportfs

/opt/xcat <world>

/tftpboot <world>

/install <world>

16) rinstall <node range>

* /tftpboot/pxelinux.cfg/์•ˆ์— node์ด๋ฆ„๋ณ„๋กœ pxelinux๊ด€๋ จ ํŒŒ์ผ๋“ค์ด ์ƒ์„ฑ๋˜๋ฉด์„œ

node์„ค์น˜๊ฐ€ ์ง„ํ–‰๋œ๋‹ค.

[root@mgt /]# rinstall blade253

blade253: install rhas4-x86_64-blade-all

blade253: ********

[root@mgt pxelinux.cfg]# ls

AC14 AC147709 AC14770D AC147C02 AC147C06 blade256

AC140A08 AC14770A AC14770E AC147C03 AC147C07 blade243

AC14760A AC14770B AC147908 AC147C04 AC147C08 blade254

AC147708 AC14770C AC147C01 AC147C05 AC147C09 blade255 default

Page 20: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 20 -

17) tty console๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ node์„ค์น˜ ํ™•์ธ

[root@mgt ~]# wcons โ€“t blade253 (console open)

[root@mgt ~]# wkill blade253 (console close)

- console font ๋ฐ size์กฐ์ ˆ

* ctrol key + mouse right button ํด๋ฆญ

18) makesshgkh <noderange>

* ์„ค์น˜ ์™„๋ฃŒ ํ›„ node๋“ค์˜ ssh key๊ฐ’์„ ์Šค์บ”ํ•˜์—ฌ ์ •ํ•ฉ์„ฑ์„ ์ฒดํฌํ•œ๋‹ค.

[root@mgt ~]# makesshgkh blade1-blade14

Scanning keys, please wait...

Page 21: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 21 -

1.10 xCAT ์ค‘์š” ๋ช…๋ น์–ด

xCAT ๊ด€๋ จํ•œ ๋ชจ๋“  ๋ช…๋ น์–ด๋“ค์€ /opt/xcat/sbin ๋˜๋Š” /opt/xcat/bin ๋””๋ ‰ํ† ๋ฆฌ ์•ˆ์— ์ •์˜๋˜์–ด ์žˆ๋‹ค.

ํ•˜์ง€๋งŒ ๋ช‡๋ช‡ ๋ช…๋ น์–ด๋“ค์€ xCAT ์Šคํฌ๋ฆฝํŠธ์—์„œ๋งŒ ์‚ฌ์šฉ๋˜๋ฉฐ, ๊ทธ ์™ธ ๋‹ค๋ฅธ ๋ช…๋ น์–ด๋“ค์€ ํด๋Ÿฌ์Šคํ„ฐ ํ™˜๊ฒฝ์„ ๊ด€๋ฆฌ

ํ•˜๋Š” ๋‹ด๋‹น์ž๋‚˜ ํด๋Ÿฌ์Šคํ„ฐ๋ฅผ ์ด์šฉํ•˜์—ฌ Job์„ ์ˆ˜ํ–‰ํ•˜๊ณ ์ž ํ•˜๋Š” User์— ์˜ํ•ด ์‚ฌ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค. ์—ฌ๊ธฐ์„œ๋Š” ์ฃผ

๋กœ ์‚ฌ์šฉ๋˜๋Š” ๋ช…๋ น์–ด์— ๋Œ€ํ•ด ์„ค๋ช…ํ•˜๊ณ ์ž ํ•œ๋‹ค.

1. rpower - Remote power control

xCAT์˜ ์„ค์ •ํŒŒ์ผ์ธ nodelist.tab ํŒŒ์ผ์— ์ •์˜๋˜์–ด ์žˆ๋Š” node๋“ค์˜ ์ „์›๊ด€๋ฆฌ๋ฅผ ์›๊ฒฉ์—์„œ On/Off/Reset

ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•จ

Synopsis rpower [noderange] [on|off|stat|state|reset|boot|cycle]

Options on Turn power on

off Turn power off

stat | state Return the current power state

reset Send a hardware reset

boot If off, then power on. If on, then hard reset

cycle Power off, then on

Examples [root@mgt ~]# rpower blade1-blade14 stat blade1~14๊นŒ์ง€ ์ „์›์ƒํƒœ ํ™•์ธ

[root@mgt ~]# rpower blade7 on blade7 ์„œ๋ฒ„ ์ „์›์„ On ์‹œํ‚ด

[root@mgt ~]# rpower box22 reset Chassis 22๋ฒˆ ์ „ ์„œ๋ฒ„์˜ ์ „์›์„ Reboot ์‹œํ‚ด

[root@mgt ~]# rpower all off ์ „ ์‹œ์Šคํ…œ์˜ ์ „์›์„ Off ์‹œํ‚ด

2. rinstall - Remote network install

xCAT์˜ ์„ค์ •ํŒŒ์ผ์ธ nodetype.tab ํŒŒ์ผ์— ์ •์˜๋˜์–ด ์žˆ๋Š” ์‚ฌํ•ญ์— ๋งž๊ฒŒ node๋“ค์˜ ์„ค์น˜๋ฅผ ์›๊ฒฉ์—์„œ

์ง„ํ–‰ ์‹œํ‚ด. (๋‹จ, ์„ค์น˜ ์ง„ํ–‰๋˜๋Š” ํ™”๋ฉด์€ ํ™•์ธํ•  ์ˆ˜ ์—†์œผ๋ฉฐ, ์ง„ํ–‰์‚ฌํ•ญ์„ ๋ณด๊ณ ์ž ํ•  ๊ฒฝ์šฐ console server๋ฅผ

ํ†ตํ•ด์„œ ๋ณผ ์ˆ˜ ์žˆ์Œ.)

Synopsis rinstall [noderange]

Examples [root@mgt ~]# rinstall blade1-blade14 blade1~14๊นŒ์ง€ ์›๊ฒฉ ์„ค์น˜ ์ง„ํ–‰

[root@mgt ~]# rinstall box10 Chassis 11๋ฒˆ ์ „ ์„œ๋ฒ„์˜ ์›๊ฒฉ ์„ค์น˜ ์ง„ํ–‰

3. wcons - Windowed remote console

xCAT์˜ ์„ค์ •ํŒŒ์ผ์ธ conserver.tab ํŒŒ์ผ์— ์ •์˜๋˜์–ด ์žˆ๋Š” node๋“ค์˜ Serial ์—ฐ๊ฒฐ ํ™”๋ฉด์„ ์ œ๊ณตํ•˜์—ฌ ์คŒ.

Synopsis rpower [noderange] [on|off|stat|state|reset|boot|cycle]

Options -t|-tile|--tile n ์ฐฝ์˜ ์™ผ์ชฝ ์œ„์—์„œ๋ถ€ํ„ฐ ์™ผ์ชฝ์—์„œ ์˜ค๋ฅธ์ชฝ์œผ๋กœ ํ•ด๋‹น node๋“ค์˜ xWindow

์ฐฝ์„ ๋ณด์—ฌ์ค€๋‹ค. โ€˜nโ€™ ์€ ํ•œ ํ–‰์—์„œ ๋ณด์—ฌ์ค„ ์ˆ˜ ์žˆ๋Š” tile์˜ ์ˆ˜๋ฅผ ์ •์˜ํ•จ.

-f|-font|--font font wcons ํ™”๋ฉด๋“ค์˜ font๋ฅผ ์ง€์ •ํ•จ. (์ฃผ๋กœ ํ™”๋ฉด์ฐฝ์—์„œ ctrl + right click์œผ

๋กœ ์กฐ์ ˆ์ด ๊ฐ€๋Šฅํ•˜๋‹ค.)

wcons๊ฐ€ ์ง€์›ํ•˜๋Š” font alias๋Š” ์•„๋ž˜์™€ ๊ฐ™๋‹ค.

verysmall | vs = nil2

small|s = 5x8

medium | med|m = 6x13

Page 22: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 22 -

large | l | big | b = 7x13

verylarge | vl | verybig | vb = 10x20

Examples [root@mgt ~]# wcons blade7 blade7์˜ Serial Consoleํ™”๋ฉด์„ ๋ณด์—ฌ์คŒ.

[root@mgt ~]# wcons โ€“t 3 box1 Chassis 1๋ฒˆ์˜ Serial Consoleํ™”๋ฉด์„ 3๊ฐœ ๋‹จ์œ„๋กœ ๋ณด์—ฌ์คŒ.

* ํ™”๋ฉด ํฌ๊ธฐ ์กฐ์ ˆ (wcons ํ™”๋ฉด์—์„œ ctrl + mouse right click)

4. wkill - Windowed remote console kill

wcons ๋ช…๋ น์–ด๋ฅผ ํ†ตํ•ด ๋„์šด ์ฐฝ์„ ์—†์• ๊ณ ์ž ํ•  ๋•Œ ์‚ฌ์šฉ.

Synopsis wkill [noderange]

Examples [root@mgt ~]# wkill blade1-blade14 blade1~14์˜ Console ํ™”๋ฉด์„ ์—†์•ค๋‹ค.

[root@mgt ~]# wkill box10 Chassis 10๋ฒˆ์— ํ•ด๋‹นํ•˜๋Š” Console ํ™”๋ฉด์„ ์—†์•ค๋‹ค.

5. winstall - Windowed remote network install

xCAT์˜ ์„ค์ •ํŒŒ์ผ์ธ nodetype.tab ํŒŒ์ผ์— ์ •์˜๋˜์–ด ์žˆ๋Š” ์‚ฌํ•ญ์— ๋งž๊ฒŒ node๋“ค์˜ ์„ค์น˜๋ฅผ ์›๊ฒฉ์—์„œ

์ง„ํ–‰ ์‹œํ‚ค๋ฉฐ, winstall ๋ช…๋ น์–ด ์‹คํ–‰ ์‹œ wcons ๋ช…๋ น์–ด๋ฅผ ๋™์‹œ์— ์‹คํ–‰์‹œ์ผœ ์„ค์น˜ ํ™”๋ฉด์„ Serial

Consoleํ™”๋ฉด์„ ํ†ตํ•ด์„œ ํ™•์ธ ๊ฐ€๋Šฅ์ผ€ ํ•จ.

Synopsis winstall [{-t|-tile|--tile} n | {-t|-tile|--tile}=n] [{-f|-font|--font} font | {-f|-font|-

Page 23: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 23 -

-font}=font] noderange

Options -t|-tile|--tile n ์ฐฝ์˜ ์™ผ์ชฝ ์œ„์—์„œ๋ถ€ํ„ฐ ์™ผ์ชฝ์—์„œ ์˜ค๋ฅธ์ชฝ์œผ๋กœ ํ•ด๋‹น node๋“ค์˜

xWindow ์ฐฝ์„ ๋ณด์—ฌ์ค€๋‹ค. โ€˜nโ€™ ์€ ํ•œ ํ–‰์—์„œ ๋ณด์—ฌ์ค„ ์ˆ˜ ์žˆ๋Š” tile์˜

์ˆ˜๋ฅผ ์ •์˜ํ•จ.

-f|-font|--font font wcons ํ™”๋ฉด๋“ค์˜ font๋ฅผ ์ง€์ •ํ•จ. (์ฃผ๋กœ ํ™”๋ฉด์ฐฝ์—์„œ ctrl + right

click์œผ๋กœ ์กฐ์ ˆ์ด ๊ฐ€๋Šฅํ•˜๋‹ค.)

wcons๊ฐ€ ์ง€์›ํ•˜๋Š” font alias๋Š” ์•„๋ž˜์™€ ๊ฐ™๋‹ค.

verysmall | vs = nil2

small|s = 5x8

medium | med|m = 6x13

large | l | big | b = 7x13

verylarge | vl | verybig | vb = 10x20

Examples [root@mgt ~]# winstall blade1-blade14

blade1~14๊นŒ์ง€ ์›๊ฒฉ ์„ค์น˜๋ฅผ ์ง„ํ–‰ํ•˜๊ณ  Serial Console ํ™”๋ฉด์„ ๋„์›Œ์คŒ

[root@mgt ~]# winstall box22

Chassis 22๋ฒˆ ์ „ ์„œ๋ฒ„์— ๋Œ€ํ•œ ์„ค์น˜๋ฅผ ์ง„ํ–‰ํ•˜๊ณ  Serial Console ํ™”๋ฉด์„ ๋„์›Œ์คŒ

6. psh - Parallel remote shell

Cluster User๋“ค์ด ๊ฐ€์žฅ ๋งŽ์ด ์‚ฌ์šฉํ•˜๋Š” ๋ช…๋ น์–ด๋กœ, xCAT์— ์„ค์ •๋˜์–ด ์žˆ๋Š” node๋“ค์— ๋™์‹œ์— ๋™์ผํ•œ

๋ช…๋ น์–ด๋ฅผ ์‹คํ–‰ํ•˜๊ณ ์ž ํ•  ๋•Œ ์‚ฌ์šฉ (rsh๋ฅผ ์‚ฌ์šฉํ• ์ง€ ssh๋ฅผ ์‚ฌ์šฉํ• ์ง€๋Š” site.tab ํŒŒ์ผ์— ์ •์˜๋˜์–ด์žˆ์Œ)

Synopsis psh [-s] [noderange|me|pbs job id] [command]

Options -s Issues the commands serially.

Examples [root@mgt ~]# psh box1-box5 ls Chassis 1~5๋ฒˆ์— ํ•ด๋‹น๋˜๋Š” ์„œ๋ฒ„๋“ค์—๊ฒŒ ๋™์‹œ์— ls๋ช…๋ น์‹คํ–‰

[root@mgt ~]# psh blade3,blade5 date Blade 3,5๋ฒˆ ์„œ๋ฒ„์—๊ฒŒ ๋™์‹œ์— date ๋ช…๋ น์‹คํ–‰

7. pping - Parallel ping

xCAT์— ์„ค์ •๋˜์–ด ์žˆ๋Š” node๋“ค์— ๋™์‹œ์— ping ๋ช…๋ น์–ด๋ฅผ ์‹คํ–‰ํ•˜๊ณ ์ž ํ•  ๋•Œ ์‚ฌ์šฉ

Synopsis pping [-s] [noderange]

Options -s Issues the commands serially.

Examples [root@mgt ~]# pping blade1-blade20 Blade 1~20๋ฒˆ์— ํ•ด๋‹น๋˜๋Š” ์„œ๋ฒ„์— ping๋ช…๋ น ๋™์‹œ์‹คํ–‰

[root@mgt ~]# pping box25 Chassis 25๋ฒˆ ์„œ๋ฒ„์— ping๋ช…๋ น ๋™์‹œ์‹คํ–‰

8. prcp - Parallel remote copy

Cluster User๋“ค์ด ๊ฐ€์žฅ ๋งŽ์ด ์‚ฌ์šฉํ•˜๋Š” ๋ช…๋ น์–ด๋กœ, xCAT์— ์„ค์ •๋˜์–ด ์žˆ๋Š” node๋“ค์— ๋™์‹œ์— ๋™์ผํ•œ

๋ช…๋ น์–ด๋ฅผ ์‹คํ–‰ํ•˜๊ณ ์ž ํ•  ๋•Œ ์‚ฌ์šฉ (rsh๋ฅผ ์‚ฌ์šฉํ• ์ง€ ssh๋ฅผ ์‚ฌ์šฉํ• ์ง€๋Š” site.tab ํŒŒ์ผ์— ์ •์˜๋˜์–ด์žˆ์Œ)

9. rinv - Remote hardware inventory

Onboard์— ์žˆ๋Š” ๊ด€๋ฆฌ ํ”„๋กœ์„ธ์Šค๋ฅผ ํ†ตํ•ด ํ•˜๋“œ์›จ์–ด ๊ตฌ์„ฑ ์ •๋ณด๋ฅผ ํ™•์ธ.

Synopsis rinv [noderange] [pci|config|model|serial|asset|vpd|bios|mprom|all]

Options pci PCI bus ์ •๋ณด ์ถ”์ถœ.

config Processor ์ˆ˜๋Ÿ‰, speed, total memory, DIMM locations ์ •๋ณด ์ถ”์ถœ

model model number ์ถ”์ถœ.

serial serial number ์ถ”์ถœ.

asset Eth0 MAC address ์ถ”์ถœ

vpd BIOS level, management processor, firmware level ์ถ”์ถœ

Page 24: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 24 -

bios BIOS level ์ •๋ณด ์ถ”์ถœ.

mprom management processor firmware level ์ •๋ณด ์ถ”์ถœ

all ์œ„์˜ ๋ชจ๋“  ์ •๋ณด ์ถ”์ถœ

Examples [root@mgt ~]# rinv blade5 bios blade5์˜ BIOS ์ •๋ณด ์ถ”์ถœ

[root@mgt ~]# rinv box23 all Chassis 23๋ฒˆ์˜ ๋ชจ๋“  ํ•˜๋“œ์›จ์–ด ๊ตฌ์„ฑ ์ •๋ณด ์ถ”์ถœ

10. bwatch โ€“ Blade Watch

๊ฐ Blade Server์— ๋Œ€ํ•œ ๊ฐ„๋‹จํ•œ ์‹œ์Šคํ…œ ์‚ฌ์šฉ ์ •๋ณด๋ฅผ GUIํ™”๋ฉด์„ ํ†ตํ•ด ํ™•์ธ

Synopsis bwatch [noderange]

Examples [root@mgt ~]# bwatch box1 Chassis 1๋ฒˆ์— ํ•ด๋‹น๋˜๋Š” ์„œ๋ฒ„๋“ค์˜ ์‹œ์Šคํ…œ ์ •๋ณด ํ™•์ธ

[root@mgt ~]# bwatch blade3 Blade 3๋ฒˆ ์„œ๋ฒ„์˜ ์‹œ์Šคํ…œ ์ •๋ณด ํ™•์ธ

11. rbootseq - Remote boot sequence

BIOS์ƒ์—์„œ boot sequence๋ฅผ ๋ณ€๊ฒฝํ•˜์ง€ ์•Š๊ณ  ๋ช…๋ น์–ด๋ฅผ ํ†ตํ•ด boot priority๋ฅผ ๋ณ€๊ฒฝ.

Synopsis Rbootseq [noderange] [{f|floppy,c|cdrom,n|network,hd{0,1}|harddisk{0,1}}|list]

Options f|floppy Floppy disk

c|cdrom CD-Rom

n|network PXE Network

hd{0,1}|harddisk{0,1} Hard Disk.

list ํ˜„์žฌ boot sequence ํ™•์ธ

Examples [root@mgt ~]# rbootseq box1 n,c,hd0,f

Chassis 1๋ฒˆ์˜ Boot Sequence๋ฅผ ๋„คํŠธ์›Œํฌ, CD-Rom, Hard Disk 0๋ฒˆ, Floppy Disk์œผ๋กœ ๋ณ€๊ฒฝ

[root@mgt ~]# rbootseq box23 list Chassis 23๋ฒˆ์˜ ๋ชจ๋“  ์„œ๋ฒ„๋“ค์˜ Boot Sequence ํ™•์ธ

Page 25: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 25 -

2. HPC Software Stack Installation Guide

2.1 Intel C Compiler

intel CD๋ฅผ ์‚ฝ์ž…ํ•˜๊ณ  CD์— ์žˆ๋Š” ๋ฐ”์ด๋„ˆ๋ฆฌ๋ฅผ ํŠน์ • ๋””๋ ‰ํ† ๋ฆฌ๋กœ ๋ณต์‚ฌํ•œ๋‹ค. tar ์••์ถ•์„ ํ‘ผํ›„ ์ธ์Šคํ†จ ํ”„๋กœ๊ทธ๋žจ์„ ์‹คํ–‰์‹œํ‚จ๋‹ค. [root@dmgt tmp]# tar xfz l_cc_c_9.1.045.tar.gz [root@dmgt tmp]# cd l_cc_c_9.1.045 [root@dmgt l_cc_c_9.1.045]# ./install.sh install ๋ฉ”๋‰ด์—์„œ 1๋ฒˆ์„ ์„ ํƒํ•œ๋‹ค์Œ, ๋‹ค์‹œ ํ•œ๋ฒˆ 1๋ฒˆ์„ ์„ ํƒํ•˜๊ณ , ๋ผ์ด์„ผ์ŠคํŒŒ์ผ์˜ ์œ„์น˜๋ฅผ ์ ์–ด์ค€๋‹ค. Please provide the license file name with full path (/<path>/<name>.lic) b. Go Back. x. Exit. License file path : /opt/intel/cpplicense.lic ์ดํ›„ accept๋ฅผ ์ž…๋ ฅํ•˜๋ฉด ์ปดํŒŒ์ผ๋Ÿฌ๊ฐ€ ์„ค์น˜๋œ๋‹ค. ์„ค์น˜๋ฅผ ๋งˆ์นœํ›„, /opt/intel/cc๋ผ๋Š” ๋””๋ ‰ํ† ๋ฆฌ๊ฐ€ ์„ค์น˜๋˜์—ˆ๋Š”์ง€ ํ™•์ธํ•œ ๋‹ค์Œ ์•„๋ž˜์˜ ๋ช…๋ น์–ด๋กœ ์ปดํŒŒ์ผ๋Ÿฌ๊ฐ€ ์ •์ƒ์ ์œผ๋กœ ์„ค์น˜๋˜์—ˆ๋Š”์ง€ ํ™•์ธํ•œ๋‹ค. [root@dmgt l_cc_c_9.1.045]source /opt/intel/cce/9.1.042/bin/iccvars.sh [root@blade329 ~]# icc icc: Command line error: no files specified; for help type "icc -help

2.2 Intel Fortran Compiler

intel CD๋ฅผ ์‚ฝ์ž…ํ•˜๊ณ  CD์— ์žˆ๋Š” ๋ฐ”์ด๋„ˆ๋ฆฌ๋ฅผ ํŠน์ • ๋””๋ ‰ํ† ๋ฆฌ๋กœ ๋ณต์‚ฌํ•œ๋‹ค. tar ์••์ถ•์„ ํ‘ผํ›„ ์ธ์Šคํ†จ ํ”„๋กœ๊ทธ๋žจ์„ ์‹คํ–‰์‹œํ‚จ๋‹ค. [root@dmgt tmp]# tar xfz l_fc_c_9.1.040.tar.gz [root@dmgt tmp]# cd l_fc_c_9.1.040 [root@dmgt l_fc_c_9.1.040]# ./install.sh install ๋ฉ”๋‰ด์—์„œ 1๋ฒˆ์„ ์„ ํƒํ•œ๋‹ค์Œ, ๋‹ค์‹œ ํ•œ๋ฒˆ 1๋ฒˆ์„ ์„ ํƒํ•˜๊ณ , ๋ผ์ด์„ผ์ŠคํŒŒ์ผ์˜ ์œ„์น˜๋ฅผ ์ ์–ด์ค€๋‹ค. Please provide the license file name with full path (/<path>/<name>.lic)

b. Go Back. x. Exit.

License file path : /opt/intel/fclicense.lic ์ดํ›„ accept๋ฅผ ์ž…๋ ฅํ•˜๋ฉด ์ปดํŒŒ์ผ๋Ÿฌ๊ฐ€ ์„ค์น˜๋œ๋‹ค. ์„ค์น˜๋ฅผ ๋งˆ์นœํ›„, /opt/intel/fc๋ผ๋Š” ๋””๋ ‰ํ† ๋ฆฌ๊ฐ€ ์„ค์น˜๋˜์—ˆ๋Š”์ง€ ํ™•์ธํ•œ ๋‹ค์Œ ์•„๋ž˜์˜ ๋ช…๋ น์–ด๋กœ ์ปดํŒŒ์ผ๋Ÿฌ๊ฐ€ ์ •์ƒ์ ์œผ๋กœ ์„ค์น˜๋˜์—ˆ๋Š”์ง€ ํ™•์ธํ•œ๋‹ค. [root@dmgt tmp]source / source /opt/intel/cce/9.1.042/bin/iccvars.sh [root@ dmgt tmp]# ifc ifc: warning: The Intel Fortran driver is now named ifort. You can suppress this message with '-quiet' Ifort: Command line error: no files specified; for help type "ifort -help"

2.3 Intel MPI

intel CD๋ฅผ ์‚ฝ์ž…ํ•˜๊ณ  CD์— ์žˆ๋Š” ๋ฐ”์ด๋„ˆ๋ฆฌ๋ฅผ ํŠน์ • ๋””๋ ‰ํ† ๋ฆฌ๋กœ ๋ณต์‚ฌํ•œ๋‹ค. tar ์••์ถ•์„ ํ‘ผํ›„ ์ธ์Šคํ†จ ํ”„๋กœ๊ทธ๋žจ์„ ์‹คํ–‰์‹œํ‚จ๋‹ค. [root@dmgt tmp]# tar xfz l_mpi_p_3.0.033.tar.gz [root@dmgt tmp]# cd l_mpi_p_3.0.033 [root@dmgt l_mpi_p_3.0.033]# ./install.sh install ๋ฉ”๋‰ด์—์„œ 1๋ฒˆ์„ ์„ ํƒํ•œ๋‹ค์Œ, ๋‹ค์‹œ ํ•œ๋ฒˆ 1๋ฒˆ์„ ์„ ํƒํ•˜๊ณ , ๋ผ์ด์„ผ์ŠคํŒŒ์ผ์˜ ์œ„์น˜๋ฅผ ์ ์–ด์ค€๋‹ค.

Page 26: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 26 -

Please provide the license file name with full path (/<path>/<name>.lic)

b. Go Back. x. Exit.

License file path : /opt/intel/mpilicense.lic ์ดํ›„ accept๋ฅผ ์ž…๋ ฅํ•˜๋ฉด mpi๊ฐ€ ์„ค์น˜๋œ๋‹ค. ์„ค์น˜๋ฅผ ๋งˆ์นœํ›„, /opt/intel/mpirun๋ผ๋Š” ๋””๋ ‰ํ† ๋ฆฌ๊ฐ€ ์„ค์น˜๋˜์—ˆ๋Š”์ง€ ํ™•์ธํ•œ ๋‹ค์Œ ์•„๋ž˜์˜ ๋ช…๋ น์–ด๋กœ mpi๊ฐ€ ์ •์ƒ์ ์œผ๋กœ ์„ค์น˜๋˜์—ˆ๋Š”์ง€ ํ™•์ธํ•œ๋‹ค. [root@dmgt tmp]source source /opt/intel/mpi/3.0/bin64/mpivars.sh [root@blade329 bin]# mpiexec | more usage: mpiexec [-h or -help or --help] # get this message

2.4 Intel MKL (Math Kernel Library)

intel CD๋ฅผ ์‚ฝ์ž…ํ•˜๊ณ  CD์— ์žˆ๋Š” ๋ฐ”์ด๋„ˆ๋ฆฌ๋ฅผ ํŠน์ • ๋””๋ ‰ํ† ๋ฆฌ๋กœ ๋ณต์‚ฌํ•œ๋‹ค. tar ์••์ถ•์„ ํ‘ผํ›„ ์ธ์Šคํ†จ ํ”„๋กœ๊ทธ๋žจ์„ ์‹คํ–‰์‹œํ‚จ๋‹ค. [root@dmgt tmp]# tar xfz l_mkl_p_9.0.018.tar.gz [root@dmgt tmp]# cd l_mkl_p_9.0.018 [root@dmgt l_mkl_p_9.0.018]# ./install.sh install ๋ฉ”๋‰ด์—์„œ 1๋ฒˆ์„ ์„ ํƒํ•œ๋‹ค์Œ, ๋‹ค์‹œ ํ•œ๋ฒˆ 1๋ฒˆ์„ ์„ ํƒํ•˜๊ณ , ๋ผ์ด์„ผ์ŠคํŒŒ์ผ์˜ ์œ„์น˜๋ฅผ ์ ์–ด์ค€๋‹ค. Please provide the license file name with full path (/<path>/<name>.lic)

b. Go Back. x. Exit.

License file path : /opt/intel/mkllicense.lic ์ดํ›„ accept๋ฅผ ์ž…๋ ฅํ•˜๋ฉด MKL์ด ์„ค์น˜๋œ๋‹ค.

์„ค์น˜๋ฅผ ๋งˆ์นœํ›„ /opt/intel/mkl์ด ์ƒ์„ฑ๋˜์—ˆ๋‚˜๋ฅผ ํ™•์ธํ•œ๋‹ค.

2.5 PBS Pro

์„ค์น˜๋ฅผ ์›ํ•˜๊ณ ์žํ•˜๋Š” ๋…ธ๋“œ์— dmgt์˜ /root/pspace/PBSPro_7.1๋ฅผ ๋ณต์‚ฌํ•ด ์ค€๋‹ค. root@dmgt#> scp /root/pspace/PBSPRo_7.1 blade1:/tmp ๋ณต์‚ฌํ•œ ๋…ธ๋“œ์— ๋กœ๊ทธ์ธํ•œํ›„ ๋ณต์‚ฌํ•œ ๋””๋ ‰ํ† ๋ฆฌ๋กœ ์ด๋™ํ•œ๋‹ค. root@dmgt#> ssh blade1 ๋ณต์‚ฌ๋œ ๋””๋ ‰ํ† ๋ฆฌ๋กœ ์ด๋™ํ•œํ›„ ์„ค์น˜ ๋ช…๋ น์„ ์ˆ˜ํ–‰ํ•œ๋‹ค. root@blade1# PBSPro_7.1> ./INSTALL < install_mom ๊ฐ™์€ ๋””๋ ‰ํ† ๋ฆฌ์˜ pbs.conf(๋””๋ฒ„๊ทธ ๋…ธ๋“œ์ผ ๊ฒฝ์šฐ pbs_debug.conf )๋ฅผ /etc/์•„๋ž˜๋กœ ๋ณต์‚ฌํ•œํ›„ pbs restart์‹œ์ผœ์ค€๋‹ค. root@blade1# PBSPro_7.1> cp pbs.conf /etc/pbs.conf root@blade1# PBSPro_7.1> service pbs restart

2.6 PBS Pro Check

Pbs server์™€์˜ ํ†ต์‹ ํ™•์ธ

[root@blade328 scripts]# qstat -Bf

Server: schedule1

server_state = Active

server_host = schedule1

scheduling = True

total_jobs = 1

Page 27: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 27 -

state_count = Transit:0 Queued:0 Held:1 Waiting:0 Running:0 Exiting:0 Begun

:0

acl_roots = root@schedule2,root@schedule1

default_queue = workq

log_events = 511

mail_from = adm

query_other_jobs = True

resources_default.ncpus = 1

default_chunk.ncpus = 1

scheduler_iteration = 600

FLicenses = 0

resv_enable = True

node_fail_requeue = 310

max_array_size = 10000

pbs_version = PBSPro_7.1.4.63140

pbs ๋ฐ๋ชฌ ์ƒํƒœํ™•์ธ

[root@blade328 scripts]# pbsnodes blade328

blade328

Host = blade328

ntype = PBS

state = free

license = l

pcpus = 4

resources_available.arch = linux

resources_available.host = blade328

resources_available.mem = 8162496kb

resources_available.ncpus = 4

resources_assigned.mem = 0kb

resources_assigned.ncpus = 0

resv_enable = True

Page 28: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 28 -

3. HPL Compile, Installation and Running BMT

3.1 HPC ์‹œ์Šคํ…œ์˜ ์„ฑ๋Šฅ์ˆ˜์น˜ - FLOPS

์Šˆํผ์ปดํ“จํ„ฐ์—์„œ์˜ ์„ฑ๋Šฅ์€ FLOPS(Floationg-point Operations Per Second : ์ดˆ๋‹น ์‹ค์ˆ˜์—ฐ์‚ฐ ํšŒ์ˆ˜) 1์ดˆ์—

๋ง์…ˆ, ๋บ„์…ˆ, ๊ณฑ์…‰, ๋‚˜๋ˆ—์…ˆ ๋“ฑ์˜ ์‹ค์ˆ˜ ๊ณ„์‚ฐ์„ ์ด ๋ช‡ ๋ฒˆ ํ•  ์ˆ˜ ์žˆ๋Š”์ง€๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” ๊ฐ’์ด๋‹ค. ๋งŒ์•ฝ 1์ดˆ์— ์‹ค

์ˆ˜๋ผ๋ฆฌ ๊ณฑ์…ˆ์„ 2๋ฒˆ์”ฉ ํ•  ์ˆ˜ ์žˆ๋‹ค๋ฉด ๊ทธ๋•Œ์˜ ๊ณ„์‚ฐ ์†๋„๋Š” 2FLOPS๊ฐ€ ๋˜๋Š” ๊ฒƒ์ด๋‹ค.

3.2 ์ด๋ก ์„ฑ๋Šฅ vs ์‹ค์ œ์„ฑ๋Šฅ ใ€€ Rpeak vs Rmax

๊ทธ๋Ÿผ ์ด์ œ ์„ฑ๋Šฅ์— ๋Œ€ํ•ด ์•Œ์•„๋ดค์œผ๋‹ˆ ์–ด๋–ป๊ฒŒ ์„ฑ๋Šฅ์„ ์ธก์ •ํ•˜๋Š”๊ฐ€?์— ๋Œ€ํ•œ ํ•ด๋‹ต์ด ๋‚˜์™€์•ผ ํ•œ๋‹ค. ์„ฑ๋Šฅ์€ ํฌ

๊ฒŒ ์ด๋ก ์„ฑ๋Šฅ๊ณผ ์‹ค์ œ์„ฑ๋Šฅ์œผ๋กœ ๋‚˜๋‰œ๋‹ค. ์ด๋ก ์„ฑ๋Šฅ(Rpeak)์€ ๋ง ๊ทธ๋Œ€๋กœ ์ด๋ก ์ ์ธ ์„ฑ๋Šฅ์ด๊ณ  ์‹ค์ œ์„ฑ๋Šฅ(Rmax)

์€ ์—ฌ๋Ÿฌ๊ฐ€์ง€ ํ…Œ์ŠคํŠธ๋ฅผ ํ†ตํ•ด ์–ป์–ด๋‚ธ ์‹ค์ œ๋กœ ์ธก์ •๋œ ์„ฑ๋Šฅ์ด๋‹ค. ๊ทธ๋Ÿผ ์‹ค์ œ์„ฑ๋Šฅ์„ ๊ตฌํ•˜๊ธฐ ์ „์— ๋‚ด ํ•˜๋“œ์›จ์–ด

(CPU)๊ฐ€ ๋‚ผ ์ˆ˜ ์žˆ๋Š” ์ด๋ก ์ ์ธ ์„ฑ๋Šฅ์„ ๊ณ„์‚ฐํ•ด ๋ณด๋„๋ก ํ•˜์ž. KIST ์Šˆํผ์ปดํ“จํŒ…์„ผํ„ฐ์˜ QnA์— ์˜ฌ๋ผ์˜จ ๊ฒƒ์„

์ธ์šฉํ•˜์ž๋ฉด, ์ด๋ก  ์„ฑ๋Šฅ์น˜, peak performance(Rpeak)๋Š” ์•„๋ž˜์˜ ์‹์„ ํ†ตํ•ด ๊ตฌํ•  ์ˆ˜ ์žˆ๋‹ค.

Flop/s=(cycle/s)*(Flop/cycle)*(Number of pipes)

Pentium-IV 3GHz์ผ ๊ฒฝ์šฐ ia32(Itaium์„ ์ œ์™ธํ•œ intel, AMD์˜ ๋Œ€๋ถ€๋ถ„์˜ CPU) ์•„ํ‚คํ…์ฒ˜ ๊ณ„์—ด์˜ CPU๋กœ

์„œ 1 cycle๋‹น (double precision) floating point ์—ฐ์‚ฐ์€ ์ตœ๋Œ€ 2๋ฒˆ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, vector ๊ณ„์—ด์ด ์•„๋‹ˆ

๋ฏ€๋กœ number of pipes๋Š” 1์ด๋‹ค. ๋”ฐ๋ผ์„œ ์œ„์‹์„ ์ ์šฉํ•˜๋ฉด

3(GHz/sec) * 2(Flop/Hz) * 1 = 6 GFlops

๋”ฐ๋ผ์„œ ์œ„์™€ ๊ฐ™์€ ๋…ธ๋“œ๊ฐ€ 32๊ฐœ ์ผ ๊ฒฝ์šฐ

32*6 = 192 Gflops ๊ฐ€ ๋‚˜์˜จ๋‹ค.

CPU๊ฐ€ Woodcrest ๋ชจ๋ธ์ผ ๋•Œ๋Š” 3(GHz/sec) * 4(Flop/Hz) * 1 = 12 GFlops๋กœ ๊ณ„์‚ฐ์ด ๋œ๋‹ค. ๋งˆ์ดํฌ๋กœ ํ”„

๋กœ์„ธ์Šค์˜ ์•„ํ‚คํ…์ณ ๋ณ€ํ™”๋กœ ์ธํ•˜์—ฌ ํ•œ ํด๋Ÿญ๋‹น 4๊ฐœ์˜ fp๋ช…๋ น์–ด๋ฅผ ์ฒ˜๋ฆฌ ๊ฐ€๋Šฅํ•˜๊ฒŒ ๋˜์—ˆ๋‹ค.

1. ๋ช…๋ น์–ด(instruction, mictor-op code)๋ฅผ 1Clock๋‹น 3๊ฐœ์—์„œ 4๊ฐœ fetch, decode, issue, execution & retire 2. ๊ณผ๊ฑฐ์—๋Š” fp ์—ฐ์‚ฐ์‹œ add or mul/div ์˜€๋Š”๋ฐ ์ง€๊ธˆ์€ add + mul/div ์ธ๊ตฌ์กฐ๋กœ๋ณ€๊ฒฝ๋˜์—ˆ์Šต๋‹ˆ๋‹ค. ์ฆ‰ ๊ณผ๊ฑฐ์—๋Š” add ์™€ mul/div ๋ฅผ ๋™์‹œ์— ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์—†์—ˆ๋Š”๋ฐ(๋‘˜์ค‘์—ํ•˜๋‚˜๋งŒ์ฒ˜๋ฆฌ), ์ง€๊ธˆ์€ add์™€ mul/div๋ฅผ ๋™์‹œ์— ์ฒ˜๋ฆฌ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ํ–‰๋ ฌ์—ฐ์‚ฐ ๊ณผ ๊ฐ™์€ HPC application์—์„œ 4๊ฐœ์˜ fp ์ฒ˜๋ฆฌ๊ฐ€ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.

3.3 HPL (High Performance Linpack)

ํ”ํžˆ ํด๋Ÿฌ์Šคํ„ฐ ์‹œ์Šคํ…œ์—์„œ ๋ฒค์น˜๋งˆํฌ๋Š” LINPACK ์„ ์‚ฌ์šฉํ•˜๊ฑฐ๋‚˜ HPL(High-Performance Linpack

Benchmark)์„ ํ†ตํ•ด์„œ ์‹œ์Šคํ…œ์˜ ์‹ค์ œ ์„ฑ๋Šฅ์„ ์ธก์ •ํ•˜๊ฒŒ ๋œ๋‹ค. LINPACK ๋ฒค์น˜๋งˆํฌ ์—์„œ ์ค‘์ ์ ์œผ๋กœ

์‚ฌ์šฉ๋˜๋Š” ๋ฃจํ‹ด๋“ค์€ Gauss ์†Œ๊ฑฐ๋ฒ•์„ ์ด์šฉํ•œ N ๊ฐœ์˜ ์„ ํ˜•๋ฐฉ์ •์‹ ์˜ ํ•ด๋ฅผ ๊ตฌํ•˜๋Š” ๊ฒƒ์œผ๋กœ BLAS (Basic

Linear Algebra Subprograms) ์— ํฌํ•จ๋˜์–ด ์žˆ๋‹ค. BLAS ๋Š” LINPACK ๋ฒค์น˜๋งˆํฌ ์—์„œ ๊ฐ€์žฅ ๊ธฐ๋ณธ์ด ๋˜๋Š”

๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋กœ์จ ๊ธฐ๋ณธ์ ์ธ ์„ ํ˜•๋Œ€์ˆ˜ ์—ฐ์‚ฐํ•จ์ˆ˜ ๋“ค์„ ๊ตฌํ˜„ํ•ด๋†“์€ ์ง‘ํ•ฉ์ด๋‹ค. ์ด BLAS ๋ฅผ ์ด์šฉํ•ด

๋ฒค์น˜๋งˆํ‚น์„ ํ• ์ˆ˜๋„ ์žˆ์ง€๋งŒ ATLAS (Automatically Tuned Linear Algebra Software) ๋ฅผ ์ด์šฉํ•˜์—ฌ ํ•ด๋‹น

ํ”Œ๋žซํผ์— ์ตœ์ ํ™”๋œ ๋ฃจํ‹ด ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ์ƒ์„ฑ ํ•  ์ˆ˜๋„ ์žˆ๋‹ค. HPL ์„ ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” MPI implementation

๊ณผ BLAS implementation ์„ ํ•„์š”๋กœ ํ•œ๋‹ค.

Page 29: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 29 -

3.4 ์„ ํ˜•๋Œ€์ˆ˜ BLAS , ATLAS

BLAS (Basic Linear Algebra Subprograms)

์„ ํ˜•๋Œ€์ˆ˜(Linear Algebra) ๋ฌธ์ œ์˜ ํ•ด๋ฅผ ํšจ์œจ์ ์œผ๋กœ ๊ตฌํ•˜๊ธฐ ์œ„ํ•œ ๋ฐฉ๋ฒ•์˜ ํ•˜๋‚˜๋Š” Basic Linear Algebra

Subprograms(BLAS)๋ฅผ ์ด์šฉํ•˜๋Š” ๊ฒƒ์ด๋‹ค. BLAS ๋Š” blocking ๊ธฐ๋ฒ•์„ ๋ฐ”ํƒ•์œผ๋กœ ํ•˜์—ฌ ๊ธฐ๋ณธ์ ์ธ vector ์™€

matrix ์—ฐ์‚ฐ์„ ์ˆ˜ํ–‰ํ•˜๋Š” ์—ญํ• ์„ ํ•œ๋‹ค. BLAS ์—๋Š” ์—ฐ์‚ฐ์˜ ์ข…๋ฅ˜์— ๋”ฐ๋ผ Level 1, 2, 3 BLAS ๋กœ

๋‚˜๋‰˜์–ด์ง„๋‹ค. Level 1 BLAS ๋Š” vector-vector ์—ฐ์‚ฐ์„ ์ˆ˜ํ–‰ํ•˜๊ณ , Level 2 BLAS ๋Š” matrix-vector ์—ฐ์‚ฐ,

Level 3 BLAS ๋Š” matrix-matrix ์—ฐ์‚ฐ์„ ํ•˜๋Š”๋ฐ ์‚ฌ์šฉ๋˜์–ด์ง„๋‹ค. BLAS ๋Š” ๋›ฐ์–ด๋‚œ ํšจ์œจ๊ณผ ์šฐ์ˆ˜ํ•œ

์ด์‹์„ฑ์„ ๋ฐ”ํƒ•์œผ๋กœ LAPACK ๊ณผ ๊ฐ™์€ Linear Algebra Software ์˜ ๊ฐœ๋ฐœ์— ์‚ฌ์šฉ๋˜๊ณ  ์žˆ๋‹ค. ํ˜„์žฌ ๊ฐ๊ฐ์˜

architecture ์— hand-optimized BLAS ๊ฐ€ software vendor ๋“ค์— ์˜ํ•ด์„œ ๋งŒ๋“ค์–ด์ ธ ์žˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ์ƒˆ๋กœ์šด

architecture ๋งˆ๋‹ค ์ตœ์ ํ™”๋œ BLAS ๋ฅผ ๋งŒ๋“œ๋Š” ์ž‘์—…์€ ์‹œ๊ฐ„์ด ๋งŽ์ด ๊ฑธ๋ฆด ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ๊ธˆ์ „์ ์ธ ๋ฌธ์ œ๋„

๊ฐ€์ง€๊ณ  ์žˆ ๋‹ค. ์ด๋กœ ์ธํ•ด Pentium/Linux architecture ๋ฅผ ์œ„ํ•œ ํšจ์œจ์ ์ธ matrix multiply ๋ฅผ ํ•  ์ˆ˜ ์žˆ๋Š”

BLAS ๊ฐ€ ์•„์ง ๋งŒ๋“ค์–ด์ง€์ง€ ์•Š์•˜๋‹ค. ๋”ฐ๋ผ์„œ ์˜ค๋Š˜๋‚ ์˜ ๋‹ค์–‘ํ•œ Architecture ์—์„œ ํšจ์œจ์ ์œผ๋กœ ์‚ฌ์šฉ๋  ์ˆ˜

์žˆ๋Š” basic linear algebra routines ๋ฅผ ๋งŒ๋“ค์–ด์ฃผ๋Š” software ๊ฐ€ ํ•„์š”ํ•˜๊ฒŒ ๋˜์—ˆ๋‹ค. ์ด๋Ÿฌํ•œ ๋ชฉ์ ์„

๋‹ฌ์„ฑํ•˜๊ธฐ ์œ„ํ•˜์—ฌ ๋งŒ๋“  ์†Œํ”„ํŠธ์›จ์–ด๊ฐ€ ๋ฐ”๋กœ ATLAS(Automatically Tuned Linear Algebra Software)์ด๋‹ค.

ATLAS (Automatically Tuned Linear Algebra Software)

ATLAS ๋Š” ๊ทธ ์ด๋ฆ„์—์„œ๋„ ์•Œ ์ˆ˜ ์žˆ๋“ฏ์ด ์˜ค๋Š˜๋‚ ์˜ microprocessor ์—์„œ ๊ณ ๋„์˜ ํšจ์œจ์„ฑ์„ ๋‹ฌ์„ฑํ•  ์ˆ˜

์žˆ๋Š” basic linear algebra routine ์„ ์Šค์Šค๋กœ ๋งŒ๋“ค ์ˆ˜ ์žˆ๋Š” software ์ด๋‹ค. ๋”ฐ๋ผ์„œ ํ”„๋กœ๊ทธ๋žจ์˜ ํŠน์„ฑ์—

๋”ฐ๋ผ, ๊ฐ•ํ•œ ์ด์‹์„ฑ๊ณผ architecture ์˜ ํŠน์„ฑ์— ๋งž๋Š” BLAS implementation ์„ ์ œ๊ณตํ•˜๊ณ  ์žˆ๋‹ค. ์ง€๊ธˆ๋„

๊พธ์ค€ํžˆ ์ƒˆ๋กœ์šด ๋ฒ„์ „์˜ ATLAS ๊ฐ€ ๋งŒ๋“ค ์–ด์ง€๊ณ  ์žˆ์œผ๋ฉฐ, C ์™€ Fortran77 interface ๋ฅผ ์ œ๊ณตํ•˜๊ณ  ์žˆ๋‹ค.

ํ˜„์žฌ ์•ˆ์ •๋ฒ„์ „์€ 3.2.1 ๊นŒ์ง€, ๊ฐœ๋ฐœ์ž๋ฒ„์ „์€ 3.3.1 ๊นŒ์ง€ ๋งŒ๋“ค์–ด์ ธ ์žˆ๋‹ค. ATLAS 3.2.1 ์—์„œ๋Š” Intel ์˜

SSE ์™€ AMD ์˜ 3DNOW ๋ฅผ ์ด์šฉํ•˜์—ฌ library ๋ฅผ ๋งŒ๋“ค๊ณ  ์žˆ์œผ๋ฉฐ, ATLAS 3.3.1 ์—์„œ๋Š” Intel ์˜ SSE,

SSE2 ์™€ AMD ์˜ 3DNOW, 3DNOW2 ๋ฅผ ๊ฐ๊ฐ ์ด์šฉํ•˜๋„๋ก ๋˜์–ด์žˆ๋‹ค. ๋”ฐ๋ผ์„œ ATLAS 3.2.1 ์„ ์‚ฌ์šฉํ•œ

Pentium III ์—์„œ๋Š” ๋‹จ์ •๋„ ๋ถ€๋™์†Œ์ˆ˜์ (single precision floating point) ์—ฐ์‚ฐ์˜ ์„ฑ๋Šฅ์ด ํ™•์—ฐํžˆ ์ข‹์•„์ง€๊ณ ,

ATLAS 3.3.1 ์„ ์‚ฌ์šฉํ•œ Pentium4 ์‹œ์Šคํ…œ์—์„œ SSE2 ๊ธฐ๋Šฅ์„ ํƒ์ง€ํ•˜์—ฌ ์‚ฌ์šฉํ•จ์œผ๋กœ์„œ ๋ฐฐ์ •๋„

๋ถ€๋™์†Œ์ˆ˜์ (double precision floating point) ์—ฐ์‚ฐ์—์„œ์˜ ์„ฑ๋Šฅ์ด ํš๊ธฐ์ ์œผ๋กœ ๊ฐœ์„ ๋˜๋Š” ๊ฒƒ์„ ํ™•์ธํ•  ์ˆ˜

์žˆ๋‹ค.

3.5 MPI (Message Passing Interface)

MPI ์˜ ์ข…๋ฅ˜์—๋Š” ์—ฌ๋Ÿฌ ๊ฐ€์ง€๊ฐ€ ์žˆ์ง€๋งŒ, Linux ์šฉ์€ ํฌ๊ฒŒ ๋‘๊ฐ€์ง€ ์ด๋‹ค. ๊ทธ ํ•˜๋‚˜๊ฐ€ LAM/MPI ์ด๋ฉฐ, ๋˜

๋‹ค๋ฅธ ํ•˜๋‚˜๋Š” MPICH ์ด๋‹ค. ์ด๋Ÿฌํ•œ MPI Library ์˜ ์„ค์น˜๋Š” GNU Compiler ๋ฅผ ์„ค์น˜ํ•  ๋•Œ ์™€ ๊ฐ™์€

configure , make, make install ์˜ ์ผ๋ จ์˜ ๊ณผ์ •์„ ๊ฑฐ์น˜๋ฉฐ, ๋‹ค๋ฅธ ์ ์€ configure ํ•  ๋•Œ์˜ option ์˜

์ฐจ์ด์ด๋‹ค. ํ‘œ 1 ์€ ์‚ฌ์šฉ๋˜์–ด์ง„ compiler ๊ทธ๋ฆฌ๊ณ  ์‚ฌ์šฉ๋˜์–ด์ง„ option ์— ๋”ฐ๋ผ ์„ค์น˜๋˜์–ด์ง„ CUP ์— ๋Œ€ํ•œ

MPI ๋ฅผ ๋‚˜ํƒ€๋‚ด์—ˆ๋‹ค.

MPI OPTION Compiler ๋น„๊ณ 

lam-gcc

(lam-6.5.6) --prefix=/usr/local/lam-gcc

gcc-

2.95.3

Smp machine:

--with-

Page 30: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 30 -

lam-intel

(lam-6.5.6)

--prefix=/usr/local/lam --with-cc=icc --

with-fc=ifc --with-cflags=โ€™-unroll -axW -

tpp7 -alignโ€™ --with-fflags=โ€™-unroll -axW -

tpp7 -alignโ€™

Intel

Compiler

rpi=usysv

์ถ”๊ฐ€

mpich-gcc

(mpich-

1.2.4)

--prefix=/usr/local/mpich-gcc gcc-

2.95.3

mpich-intel

(mpich-

1.2.4)

export CC=icc

export FC=ifc

export F90=ifc

--prefix=/usr/local/mpich-intel โ€“with-

comm=shared

Intel

Compiler

ํ‘œ 1 MPI Library

3.6 MPICH and LIbrary Installation

3.6.1 ์‚ฌ์ „ ์‹œ์Šคํ…œ ๊ตฌ์„ฑ ํ™˜๊ฒฝ

์‹œ์Šคํ…œ ์šด์˜ํ™˜๊ฒฝ์€ x86_64๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜๋Š” ๋ฆฌ๋ˆ…์Šค ์‹œ์Šคํ…œ์œผ๋กœ MPI์šฉ ๋„คํŠธ์›์€ ๊ธฐ๊ฐ€๋น„ํŠธ ์ด๋”๋„ท์„

์‚ฌ์šฉํ•˜์—ฌ ์šด์˜์ฒด์ œ๋Š” Redhat Enterprise Linux Update 4๋ฅผ ์ด์šฉํ•˜์—ฌ ์‹œ์Šคํ…œ์„ ๊ตฌ์„ฑํ•˜์˜€๋‹ค. ์‹œ์Šคํ…œ

์šด์˜ํ™˜๊ฒฝ์— ๋Œ€ํ•œ ๊ตฌ์„ฑ์€ ๋‹ค์Œ๊ณผ ๊ฐ™๋‹ค.

3.6.2 MPICH Installation

GNU Compiler์ธ GCC ์ปดํŒŒ์ผ๋Ÿฌ์— ๋Œ€ํ•œ ํ™˜๊ฒฝ๋ณ€์ˆ˜ ๋ฐ ์„ค์น˜์œ„์น˜ Path๋“ฑ์— ๋Œ€ํ•œ ํ™˜๊ฒฝ๋ณ€์ˆ˜๊ฐ€

์ ์ ˆํ•˜๊ฒŒ ๊ตฌ์„ฑ๋˜์–ด ์žˆ๋Š”์ง€ ํ™•์ธํ•˜์—ฌ MPICH ์— ๋Œ€ํ•œ ์„ค์น˜๋ฅผ ์ง„ํ–‰ํ•œ๋‹ค

# gcc โ€“v Reading specs from /usr/lib/gcc/x86_64-redhat-linux/3.4.6/specs Configured with: ../configure --prefix=/usr --mandir=/usr/share/man --infodir=/usr/share/info --

Page 31: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 31 -

enable-shared --enable-threads=posix --disable-checking --with-system-zlib --enable-__cxa_atexit --disable-libunwind-exceptions --enable-java-awt=gtk --host=x86_64-redhat-linuxThread model: posix gcc version 3.4.6 20060404 (Red Hat 3.4.6-3)

* ์ปดํŒŒ์ผ๋Ÿฌ์— ๋Œ€ํ•œ ๊ธฐ๋ณธํ™˜๊ฒฝ ๊ตฌ์„ฑ ๋ฐ ํ™˜๊ฒฝ๋ณ€์ˆ˜ ํ•ญ๋ชฉ๋“ค์ด ์ •์ƒ์ ์œผ๋กœ ํ™•์ธ๋˜์—ˆ์œผ๋ฉด ์•„๋ž˜์™€ ๊ฐ™์ด

MPICH ๋ณ‘๋ ฌ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์— ๋Œ€ํ•œ ๊ธฐ๋ณธํ™˜๊ฒฝ ๋ณ€์ˆ˜๋ฅผ ๊ตฌ์„ฑํ•ด ์ค€๋‹ค

# vi ~/.bashrc ...... # .bashrc # User specific aliases and functions alias rm='rm -i' alias cp='cp -i' alias mv='mv -i' # Source global definitions if [ -f /etc/bashrc ]; then . /etc/bashrc fi export MPICH="/usr/local/mpich/1.2.7p1/ip/x86_64/smp/gnu64/ssh" export MPICH_PATH="${MPICH}/bin" export MPICH_LIB="${MPICH}/lib" export PATH="${MPICH_PATH}:${PATH}" export LD_LIBRARY_PATH="${MPICH_LIB}:${LD_LIBRARY_PATH}" MPICH ๋ณ‘๋ ฌ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์— ๋Œ€ํ•œ ๊ธฐ๋ณธํ™˜๊ฒฝ ๊ตฌ์„ฑ * xCAT ๊ธฐ๋ณธ์„ค์น˜ ๋””๋ ‰ํ† ๋ฆฌ์ธ /opt/xcat/build/mpi์— mpich tar๋ฅผ ํŒŒ์ผ์„ ๋ณต์‚ฌํ•˜์—ฌ ์••์ถ•์„ ํ•ด์ œ (ex: NFS๋กœ ๊ณต์œ ๋œ /opt/xcat/build/mpi์— mpimaker ๋ช…๋ น์–ด๋ฅผ ์ด์šฉ, ํ™˜๊ฒฝ๋ณ€์ˆ˜ ์ž๋™๊ตฌ์„ฑ ๊ฐ€๋Šฅ) # source ~/.bash_profile [ํ™˜๊ฒฝ ํ”„๋กœํŒŒ์ผ ์ ์šฉ]

๋‹ค์šด๋กœ๋“œ ๋ฐ›์€ mpich์†Œ์Šค์˜ ์••์ถ•์„ ํ’€๊ณ , ํ•ด๋‹น ์†Œ์Šค์˜ ๋ฃจํŠธ๋กœ ์ด๋™ํ•ฉ๋‹ˆ๋‹ค. ์•„๋ž˜์„œ ์‹ฌ๋ณผ๋ฆญ ๋งํฌ๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ์ด์œ ๋Š”, ํ˜น ๋‹ค๋ฅธ ๋ฒ„์ „์˜ mpich๋กœ ๋ณ€๊ฒฝํ•˜๋”๋ผ๋„ ๋งํฌ๋งŒ ๋ณ€๊ฒฝํ•ด์ฃผ๋ฉด, ์‚ฌ์šฉ์ž๋“ค์˜ ํ™˜๊ฒฝ๋ณ€์ˆ˜๋ฅผ ๋ณ€๊ฒฝํ•  ํ•„์š”์—†์ด ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•ด์„œ ์ž…๋‹ˆ๋‹ค. # wget http://wwwโ€unix.mcs.anl.gov/mpi/mpich/downloads/mpich.tar.gz

์„ค์น˜

# cd /usr/local

# tar xvfz mpich.tar.gz

# cd mpich

# ./configure --with-device=ch_p4 -prefix=/usr/local/mpich-1.2.7p1 -c++=pgCC โ‚ฉ

-cc=pgcc -fc=pgf77 -clinker=pgcc -flinker=pgf77 -c++linker=pgCC โ‚ฉ

-f90=pgf90 -f90inc=/usr/pgi/linux86/6.0/include -f90linker=pgf90 โ‚ฉ

-f90libpath=/usr/pgi/linux86/6.0/lib -opt="-fast" -rsh=ssh

# make

# make install

# ln -s /usr/local/mpich-1.2.7p1 /usr/local/mpich

mpich/util/machines/machines์— hostname์„ ๋„ฃ์–ด์ค€๋‹ค. ๋˜๋Š” mpirun โ€“machinefile filename๋กœ ๊ฐ€๋Šฅ.

MPICH ํ…Œ์ŠคํŠธ

# cd mpich/examples/basic

# make cpi

# mpich/bin/mpirun โ€“np 4 cpi

(e.g. mpicc โ€“g โ€“o matrix matrix.c)

Page 32: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 32 -

path๋Š” /etc/profile.d/mpi.sh ์— ๋‹ค์Œ๊ณผ ๊ฐ™์ด ์ž…๋ ฅํ•ด๋‘๋ฉด ๋ณ„๋„์˜ ํ™˜๊ฒฝ๋ณ€์ˆ˜ ์…‹ํŒ…์—†์ด ์‚ฌ์šฉ์ด ๊ฐ€๋Šฅํ•˜๋‹ค.

export MPICH=/usr/local/mpich

export PATH=$PATH:$MPICH/bin

mpimaker ๋ฅผ ์ด์šฉํ•œ MPICH ์ปดํŒŒ์ผ ๋ฐฉ๋ฒ•

./mpimaker mpich-1.2.7p1.tar.gz smp gnu64 ssh

* mmpimaker๋Š” tar์••์ถ•ํ•ด์ œ ๋ฐ Configure, make, make install๊นŒ์ง€ ์ž๋™์œผ๋กœ ์‹คํ–‰์‹œ์ผœ์ค€๋‹ค

์„ค์น˜ ํ™•์ธ

# mpicc -V mpicc for 1.2.7 (release) of : 2005/11/04 11:54:51 Reading specs from /usr/lib/gcc/x86_64-redhat-linux/3.4.6/specs Configured with: ../configure --prefix=/usr --mandir=/usr/share/man --infodir=/usr/share/info --enable-shared --enable-threads=posix --disable-checking --with-system-zlib --enable-__cxa_atexit --disable-libunwind-exceptions --enable-java-awt=gtk --host=x86_64-redhat-linuxThread model: posix gcc version 3.4.6 20060404 (Red Hat 3.4.6-3) /usr/libexec/gcc/x86_64-redhat-linux/3.4.6/collect2 --eh-frame-hdr -m elf_x86_64 -dynamic-linker /lib64/ld-linux-x86-64.so.2 /usr/lib/gcc/x86_64-redhat-linux/3.4.6/../../../../lib64/crt1.o /usr/lib/gcc/x86_64-redhat-linux/3.4.6/../../../../lib64/crti.o /usr/lib/gcc/x86_64-redhat-linux/3.4.6/crtbegin.o -L/usr/local/mpich/1.2.7p1/ip/x86_64/smp/gnu64/ssh/lib -L/usr/lib/gcc/x86_64-redhat-linux/3.4.6 -L/usr/lib/gcc/x86_64-redhat-linux/3.4.6 -L/usr/lib/gcc/x86_64-redhat-linux/3.4.6/../../../../lib64 -L/usr/lib/gcc/x86_64-redhat-linux/3.4.6/../../.. -L/lib/../lib64 -L/usr/lib/../lib64 -lmpich -lpthread -lrt -lgcc --as-needed -lgcc_s --no-as-needed -lc -lgcc --as-needed -lgcc_s --no-as-needed /usr/lib/gcc/x86_64-redhat-linux/3.4.6/crtend.o /usr/lib/gcc/x86_64-redhat-linux/3.4.6/../../../../lib64/crtn.o /usr/lib/gcc/x86_64-redhat-linux/3.4.6/../../../../lib64/crt1.o(.text+0x21): In function `_start': : undefined reference to `main'

collect2: ld returned 1 exit status

๊ฐ ๊ณ„์‚ฐ ๋…ธ๋“œ์—๋„ ๊ฐ™์€ ์œ„์น˜์— ์ธ์Šคํ†จ ํ•˜๊ฑฐ๋‚˜, NFS๋“ฑ์„ ์ด์šฉํ•˜์—ฌ ๊ฐ™์€ ์œ„์น˜์—

mpich๋ฐ PGI๊ฐ€ ์œ„์น˜ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.

์˜ˆ์ œ ์‹คํ–‰

# cd /usr/local/mpich/examples # make cpi # ./cpi Process 0 on master pi is approximately 3.1416009869231254, Error is 0.0000083333333323

wall clock time = 0.000105

mpirun์„ ํ†ตํ•˜์—ฌ ์‹คํ–‰

๋จผ์ € ๋…ธ๋“œ๋ฅผ ์ง€์ •ํ•˜๊ธฐ ์œ„ํ•ด, ๋จธ์‹ ํŒŒ์ผ์„ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ๋จธ์‹ ํŒŒ์ผ์˜ ์ง€์ •์€ ๋‹ค์Œ๊ณผ ๊ฐ™์œผ๋ฉฐ CPU์˜

์ง€์ •ํƒ€์ž…์€ hostname:n (n์€ ๋ฌผ๋ฆฌ์ ์ธ CPU์˜ ๊ฐฏ์ˆ˜๋ฅผ ์ง€์ •ํ•œ๋‹ค)

# vi /usr/local/mpich/1.2.7p1/ip/x86_64/smp/gnu64/ssh/share/machines.LINUX -------------------------------------------------------------------

Page 33: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 33 -

node001:2 ๊ฐ ๋…ธ๋“œ๋“ค์— ๋Œ€ํ•œ ๋ฌผ๋ฆฌ์ ์ธ CPU์˜ ๊ฐœ์ˆ˜๋ฅผ ์ง€์ •ํ•ด ํ•œ๋‹ค. node002:2 node003:2 node004:2

๊ฐ ๋…ธ๋“œ๋‹น cpu๊ฐœ์ˆ˜ 2๊ฐœ, ์ด4๊ฐœ์˜ CPU๋ฅผ ๋จธ์‹ ํŒŒ์ผ ๋งˆ์Šคํ„ฐ๋…ธ๋“œ์—์„œ๋Š” ์‹คํ–‰ํ•˜์ง€ ์•Š๋Š” ์˜ต์…˜์œผ๋กœ

mpirun์„ ์‹คํ–‰ํ•ฉ๋‹ˆ๋‹ค.

# mpirun -np 4 -machinefile ./share/machines.LINUX -nolocal ./cpi Process 0 on node001 Process 1 on node002 Process 2 on node003 pi is approximately 3.1416009869231241, Error is 0.0000083333333309

wall clock time = 0.000546

Page 34: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 34 -

3.6.3 ATLAS Installation

ATLAS ( Auto Tuning Linear Algebra Subprogram)๋Š” ์ˆ˜์น˜ ์—ฐ์‚ฐ์— ์žˆ์–ด ์‚ฌ์šฉ๋˜์–ด์ง€๊ณ  ์žˆ๋Š” BLAS

(Basic Linear Algebra Subroutine )์„ ๊ทธ ํ•ด๋‹น Computer ์— ๋งž๊ฒŒ ์ž๋™์œผ๋กœ Tuning ํ•˜์—ฌ BLAS

Library ๋ฅผ ๋งŒ๋“ค์–ด ์ฃผ๋Š” Program ์ด๋‹ค. ์ด๊ฒƒ์œผ๋กœ ๋งŒ๋“ค์–ด์ง„ BLAS Library ๋Š” Standard BLAS

Library ๋ณด๋‹ค ์„ฑ๋Šฅ์ด ์›”๋“ฑํ•˜๊ธฐ ๋•Œ๋ฌธ์— Linpack Benchmarking Test ๋ฅผ ํ•  ๊ฒฝ์šฐ์—๋„ ์ด ๊ฒƒ์„ Link ํ•˜์—ฌ

์‚ฌ์šฉํ•œ๋‹ค. ATLAS ๋ฅผ Install ํ•˜๊ธฐ ์ „์— ์šฐ์„ ์ ์œผ๋กœ ๊ณ ๋ คํ•ด์•ผ ํ•  ๊ฒƒ์ด Compiler ์˜ ์ข…๋ฅ˜์ด๋‹ค. ์–ด๋–ค

Compiler ๋กœ Library ๋ฅผ ๋งŒ๋“ค๊ธฐ ์‰ฌ์šด ๋ฐ˜๋ฉด์—, ์–ด๋–ค Compiler ์˜ ๊ฒฝ์šฐ์—์„œ๋Š” Compile ์ด ์‰ฝ๊ฒŒ ๋˜์ง€ ์•Š๋Š”

๊ฒฝ์šฐ๊ฐ€ ์žˆ๋‹ค. ๋”ฐ๋ผ์„œ ๋˜๋„๋ก์ด๋ฉด, gcc-2.95 version ์„ ์‚ฌ์šฉํ•  ๊ฒƒ์„ ๊ถŒ์žฅํ•œ๋‹ค.

BLAS Library ๋ฅผ ๋งŒ๋“ค๊ธฐ ์œ„ํ•œ ์ผ๋ จ์˜

๊ณผ์ •์„ ์•„๋ž˜์— ๋‚˜ํƒ€๋‚ด์—ˆ๋‹ค.

๊ฐ ์งˆ๋ฌธ๋“ค์˜ ์ž์„ธํ•œ ๋‚ด์šฉ์€

www.netlib.org ๋ฅผ ์ฐธ๊ณ ํ•˜๊ธฐ ๋ฐ”๋ž€๋‹ค.

3.6.4 Cache Size

ATLAS ์˜ ์„ฑ๋Šฅ์„ ๋†’์ด๊ธฐ ์œ„ํ•œ ํ•˜๋‚˜์˜

๋ฐฉ๋ฒ•์œผ๋กœ๋Š” Cache Edge Size ๋ฅผ

๋ณ€๊ฒฝํ•˜๋Š” ๊ฒƒ์ด์žˆ๋‹ค. ๋ณดํ†ต Pentium 4 ์˜

๊ฒฝ์šฐ Cache Edge Size ๊ฐ€ 209K ๋กœ

๋˜์–ด์ ธ ์žˆ๋Š”๋ฐ, ์ด๊ฒƒ์˜ ํฌ๊ธฐ๋ฅผ ๋ณ€๊ฒฝํ•˜๋ฉด

Cluster ์˜ ์„ฑ๋Šฅ์„ ๋†’์ผ ์ˆ˜ ์žˆ๋‹ค.

~ /ATLAS/inc lude/<arch>/a tlas_cacheedge .h F ile ์ˆ˜ ์ •

# ifndef ATLAS_CACHEED G E_H #define ATLAS_CACHEEDG E_H #define CacheEdge 210944 -> 393216 (384K )# end if

~ /ATLAS/b in /<arch>/ : m ake xs l3b lasts t, xd l3b lastst, xc l3b lastst, xz l3b lasts t ์‹ค ํ–‰

~ /ATLAS/b in /l3b last.c F ile ์ˆ˜ ์ •

* * By defau lt the m ono- th readed ATLAS rou tines a re tested. To test the * m u lti- th readed ATLAS rou tines, define the fo llow ing m acro : * USE_L3_PTHREAD S : m u lti- th readed ATLAS im p lem enta tion . *///# define USE _F77_BLAS# define USE_L3_REFERENCE // L ine 79

Page 35: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 35 -

3.6.5 GotoBLAS Installation

GotoBLAS ์ฝ”๋“œ๋Š” ํ˜„์žฌ ์„ ํ˜•๋Œ€์ˆ˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ์„œ๋ธŒ๋ฃจํ‹ด์„ ๊ฐ€์žฅ ๋น ๋ฅด๊ฒŒ ๊ตฌํ˜„ํ•˜๊ณ  ๋‹ค์–‘ํ•œ ์•„ํ‚คํ…์ณ๋ฅผ ์ง€

์›ํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์†Œ์Šค ์ฝ”๋“œ์ด๋‹ค.

down load http://www.tacc.utexas.edu/resources/software/

GotoBLAS-1.10.tar.gz ๋‹ค์šด๋กœ๋“œํ›„ /opt/ํด๋”์— ์••์ถ•ํ•ด์ œ

[root@mgt01 opt]# tar xzvf GotoBLAS-1.10.tar.gz [root@mgt01 opt]# cd GotoBLAS [root@mgt01 opt]# vi Makefile.rule ใ€€ GotoBLAS ์„ ํ˜•์•Œ๊ณ ๋ฆฌ์ฆ˜ ์•„ํ‚คํ…์ณ ๋ฃฐ์…‹์ •์˜ Configuration File ์ˆ˜์ • # # Beginning of user configuration # # This library's version REVISION = -r1.06 # Which do you prefer to use for C compiler? Default is gcc. # I recommend you to use GCC because inline assembler is required. C_COMPILER = GNU ## C compiler ์ •์˜ ๋ฆฌ๋ˆ…์Šค GNU ์ปดํŒŒ์ผ๋Ÿฌ ์ •์˜ 64๋น„ํŠธ # C_COMPILER = INTEL # Which do you prefer to use for Fortran compiler? Default is GNU G77. F_COMPILER = G77 ## FORTRAN compiler ์ •์˜ # F_COMPILER = G95 # F_COMPILER = GFORTRAN # F_COMPILER = INTEL # F_COMPILER = PGI # F_COMPILER = PATHSCALE # F_COMPILER = IBM # F_COMPILER = COMPAQ # F_COMPILER = SUN # F_COMPILER = F2C # If you want to build threaded version. # You can specify number of threads by environment value # "OMP_NUM_THREADS", otherwise, it's automatically detected. # SMP = 1 โ€“> ์‹œ์Šคํ…œํ™˜๊ฒฝ์ด SMP ํ™˜๊ฒฝ์ด๊ฑฐ๋‚˜ Thread Library๋ฅผ ์š”๊ตฌํ•  ๋•Œ ์˜ต์…˜ ์„ค์ • # You may specify Maximum number of threads. It should be minimum. # MAX_THREADS = 4 Maximum threads ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์˜ ๊ฐœ์ˆ˜๋ฅผ ์ •์˜ํ•จ # If you need 64bit binary; some architecture can accept both 32bit and # 64bit binary(EM64T, Opteron, SPARC and Power/PowerPC). # BINARY64 = 1 # If you want to drive whole 64bit region by BLAS. Not all Fortran # compiler supports this. It's safe to keep comment it out if you # are not sure. INTERFACE64 = 1 ## 64bit binary only # If you need Special memory management; # Using HugeTLB file system(Linux / AIX / Solaris) # CCOMMON_OPT += -DALLOC_HUGETLB # Using static allocation instead of dynamic allocation

Page 36: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 36 -

# You can't use it with ALLOC_HUGETLB together # CCOMMON_OPT += -DALLOC_STATIC # If you want to use CPU affinity # CCOMMON_OPT += -DUSE_CPU_AFFINITY # If you want to use memory affinity (for NUMA) # You can't use it with ALLOC_STATIC together # NUMA_AFFINITY = 1 # If you want to use pure thread server model. # Default is only OMP_NUM_THREADS - 1 threads are spawned to reduce # thread overhead. This is not implemented yet. # CCOMMON_OPT += -DALL_THREADED # Use busy wait instead of pthread_lock implemenation # BLAS perforamnce should be better, but total performance would be # worse due to extra usage of CPU # CCOMMON_OPT += -DUSE_BUSYWAIT # If you have special compiler to run script to determine architecture. GETARCH_CC = gcc GETARCH_FLAGS = # End of user configuration [root@mgt01 GotoBLAS]# make # generating normal library [root@mgt01 GotoBLAS]# make prof # generating profile enabled library (linux only) [root@mgt01 GotoBLAS]# make clean [root@mgt01 GotoBLAS]# cd exports; make so # Shared library ์ƒ์„ฑ [root@mgt01 GotoBLAS]# ls libgoto.a, libgoto_core2-r1.06.a, libgoto_core2-r1.06.so ํŒŒ์ผ ์ƒ์„ฑ ํ™•์ธ

Sample Installation Script

#!/bin/bash export CC=gcc export FC=g77 rm -rf *.log rm -rf *.so tar xfz GotoBLAS-1.14.tar.gz echo "Complete .. Unarchive" cp Makefile.rule GotoBLAS/ cd GotoBLAS make -j4 1>../make.log 2>../make_err.log echo "Complete .. Make" make prof -j4 1>../prof.log 2>../prof_err.log echo "Complete .. Profile" cd exports make so 1>../../so.log 2>../../so_err.log echo "Complete .. Build Dynamic Library" cd .. cp *.so ../ cd .. #rm -rf GotoBLAS echo "Complete .. Build GotoBLAS"

Page 37: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 37 -

3.6.7 HPL Installation HPL์€ ๋งŽ์€ ์‚ฌ์šฉ์ž์šฉ option์ด ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ์ด option์— ๋”ฐ๋ผ์„œ๋„ ์„ฑ๋Šฅ์˜ ์ฐจ์ด๊ฐ€ ๋งŽ์ด ๋‚  ์ˆ˜๋„ ์žˆ๋‹ค. ๋˜ํ•œ Network์˜ ์„ฑ๋Šฅ์ฐจ์ด ๋˜ํ•œ Cluster Computer์˜ ์„ฑ๋Šฅ์— ์ฃผ์š”ํ•œ ์š”์ธ์ด ๋œ๋‹ค. HPL์„ Installํ•œ๋‹ค๋Š” ์ด์•ผ๊ธฐ๋Š” ๊ณณ Compileํ•˜๋‹ค๋Š” ์ด์•ผ๊ธฐ์™€ ๊ฐ™๋‹ค. ์ฆ‰ hpl์„ Compileํ•˜์—ฌ execution file์„ ๋งŒ๋“ค๊ณ  ์ด๊ฒƒ์˜ input file์ธ HPL.dat file ์„ ์ˆ˜์ •ํ•˜์—ฌ Cluster Super Computer์˜ ์„ฑ๋Šฅ์„ ํ‰๊ฐ€ ํ•  ์ˆ˜ ์žˆ๋‹ค. hpl Compile์€ ATLAS์˜ Compile๊ณผ ์œ ์‚ฌํ•˜๊ฒŒ ~/hpl/setup Directory์— ์žˆ๋Š” ์ ๋‹นํ•œ Make.<arch> file์„ ์„ ํƒํ•˜์—ฌ, ์ˆ˜์ •ํ•œ ํ›„์—, make arch=<arch> ๋ช…๋ น์„ ์‹คํ–‰ ์‹œํ‚ค๋ฉด ~/hpl/bin/<arch> Directory์— HPL.dat file ๊ณผ execution file (xhpl)์ด ์ƒ๊ธด๋‹ค. ๋‹ค์Œ์€ Make.<arch>file์˜ ์˜ˆ์ด๋‹ค.

MPI PATH & Compile OPTION

BLAS LIBRARY PATH & Link

Page 38: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 38 -

HPL์˜ ์„ค์น˜์™€ ๊ด€๋ จ๋œ ํ•ญ๋ชฉ๋“ค์€ ์ƒ์œ„๋‹จ์ธ ๋ณ‘๋ ฌ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์ธ MPICH๊ฐ€ ์ •์ƒ์ ์œผ๋กœ ์„ค์น˜๋˜์—ˆ์Œ์„ ๊ฐ€์ •ํ•˜

๊ณ  ์ง„ํ–‰ํ•œ๋‹ค.

Step 1.

HPL์˜ ์†Œ์Šค์ฝ”๋“œ๋ฅผ ๋‹ค์šด๋กœ๋“œ ๋ฐ›์•„ ์„ค์น˜ํ•˜๊ณ ์ž ํ•˜๋Š” Path์— ์œ„์น˜ํ•˜์—ฌ ์••์ถ•ํ™”์ผ์„ ํ•ด์ œํ•ด ์ค€๋‹ค

http://www.netlib.org/benchmark/hpl/hpl.tgz

์ผ๋ฐ˜์ ์ธ HPL์˜ ์†Œ์Šค์ฝ”๋“œ์˜ ์••์ถ•ํ•ด์ œ์˜ ์œ„์น˜๋Š” /opt์•ˆ์— ์„ค์น˜ํ•œ๋‹ค.

Step 2.

์••์ถ•ํ•ด์ œ๋œ HPL ๋””๋ ‰ํ† ๋ฆฌ ์ด๋™ํ•˜์—ฌ Machine config ํŒŒ์ผ์„ ๋ณต์‚ฌํ•˜์—ฌ ์ƒ์œ„๋””๋ ‰ํ† ๋ฆฌ์— ์œ„์น˜ํ›„์—

Config๋ฅผ ์„ค์ •ํ•œ๋‹ค. /opt/hpl/setup ๋””๋ ‰ํ† ๋ฆฌ์— Make.Linux_PII_FB_LAS๋ฅผ Make.Linux๋กœ Rename

ํ•˜์—ฌ ์ƒ์œ„๋””๋ ‰ํ† ๋ฆฌ์— ๋ณต์‚ฌํ•ด ๋„ฃ์€๋‹ค.

[root@hpl/setup]cp Make.Linux_PII_FBLAS ./make.Linux

Step 3.

๋ณต์‚ฌํ•œ Make.Linux config ํŒŒ์ผ์„ ์‹œ์Šคํ…œ์˜ ์„ค์ •์— ๋งž๊ฒŒ ์žฌ๊ตฌ์„ฑ ํ•œ๋‹ค. HPL์˜ ์„ค์ •์€ GotoBLAS

๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๊ฐ€ ์ •์ƒ์ ์œผ๋กœ ์„ค์น˜๊ฐ€ ์™„๋ฃŒ๋˜์—ˆ๋‹ค๋Š” ๊ฐ€์ •ํ•˜์— ์ง„ํ–‰ํ•œ๋‹ค

* HPL ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ์ •์˜ ---------------------------------------------------------------------- # - HPL Directory Structure / HPL library # ---------------------------------------------------------------------- # TOPdir = /opt/hpl hpl์ด ์„ค์น˜๋œ ๋””๋ ‰ํ† ๋ฆฌ์œ„์น˜ ์ •์˜ INCdir = $(TOPdir)/include BINdir = $(TOPdir)/bin/$(ARCH) LIBdir = $(TOPdir)/lib/$(ARCH) # HPLlib = $(LIBdir)/libhpl.a /opt/hpl/lib ๋””๋ ‰ํ† ๋ฆฌ๋ฅผ ํ™•์ธํ•˜์—ฌ ํŒŒ์ผ์กด์žฌ ํ™•์ธ * MPICH ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ์ •์˜ # ---------------------------------------------------------------------- # - Message Passing library (MPI) # ---------------------------------------------------------------------- # MPinc tells the C compiler where to find the Message Passing library # header files, MPlib is defined to be the name of the library to be # used. The variable MPdir is only used for defining MPinc and MPlib. # MPdir = /usr/local/mpich/1.2.7p1/ip/x86_64/smp/gnu64/ssh MPICH ๋””๋ ‰ํ† ๋ฆฌ ์ •์˜ MPinc = -I$(MPdir)/include MPlib = $(MPdir)/lib/libmpich.a # *Goto ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ์ •์˜ ----------------------------------------------------------------------

Page 39: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 39 -

# - Linear Algebra library (BLAS or VSIPL) # ---------------------------------------------------------------------- # LAinc tells the C compiler where to find the Linear Algebra library # header files, LAlib is defined to be the name of the library to be # used. The variable LAdir is only used for defining LAinc and LAlib. # LAdir = /home/Goto/GotoBLAS GotoBLAS ๋””๋ ‰ํ† ๋ฆฌ ์ •์˜ LAinc = LAlib = -L$(LAdir)/libgoto.a $(LAdir)/libgoto.a *GNU Compiler ์ •์˜ ---------------------------------------------------------------------- # - Compilers / linkers - Optimization flags --------------------------- # ---------------------------------------------------------------------- # CC = /usr/local/mpich/1.2.7p1/ip/x86_64/smp/gnu64/ssh/bin/mpicc

mpicc ์ปดํŒŒ์ผ๋Ÿฌ์— ๋Œ€ํ•œ ์ •์˜ CCNOOPT = $(HPL_DEFS) CCFLAGS = $(HPL_DEFS) -fomit-frame-pointer -O3 -funroll-loops # # On some platforms, it is necessary to use the Fortran linker to find # the Fortran internals used in the BLAS library. # LINKER = /usr/local/mpich/1.2.7p1/ip/x86_64/smp/gnu64/ssh/bin/mpif77

mpi Fortran77์— ๋Œ€ํ•œ ์ •์˜๋ฅผ ์ง€์ • LINKFLAGS = $(CCFLAGS) -lm # ARCHIVER = ar ARFLAGS = r RANLIB = echo # # ---------------------------------------------------------------------- # - Platform identifier ------------------------------------------------ # ---------------------------------------------------------------------- # ARCH = Linux ๋ฆฌ๋ˆ…์Šค ํ”Œ๋žซํผ์— ๋Œ€ํ•œ ์ •์˜๋ฅผ ์ง€์ •ํ•˜์—ฌ ์ค€๋‹ค.

Step4.

๋ชจ๋“  ์„ค์ •์ด ์™„๋ฃŒ๋œ ํ›„์— make arch=Linux ๋ช…๋ น์„ ์ด์šฉํ•˜์—ฌ ์ง€์ •๋œ GotoBLAS ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์™€

HPL ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์— ๋Œ€ํ•œ ์ปดํŒŒ์ผ์„ ์ง„ํ–‰ํ•œ๋‹ค.

Page 40: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 40 -

3.7 Run HPL

3.7.1 Compile Linpack Benchmarking ์€ ์œ„์—์„œ๋„ ์„ค๋ช…ํ–ˆ๋˜ ๊ฒƒ๊ณผ ๊ฐ™์ด Cluster์˜ ์„ฑ๋Šฅ์„ ํ‰๊ฐ€ ํ•  ์ˆ˜ ์žˆ๋‹ค. MPICH๋ฅผ ์ด์šฉํ•˜์—ฌ MPI Job์„ ์‹คํ–‰์‹œํ‚ค๊ธฐ ์œ„ํ•ด์„œ๋Š” ๋จผ์ € Compile์„ mpich์˜ mpicc Command๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ์•ผ ํ•œ๋‹ค. ๊ทธ ์ด์™ธ์˜ Compile ๊ณผ์ •์€ ๋™์ผํ•˜๋‹ค. 3.7.2 Send the Execution file Linpack Benchmarking์—์„œ ๊ฐ Node์—์„œ ํ•„์š”๋กœ ํ•˜๋Š” execution file์€ xhpl๋ฟ์ด๋‹ค. ๋”ฐ๋ผ์„œ scp๋‚˜ psh๋ช…๋ น์–ด๋ฅผ ์ด์šฉํ•˜์—ฌ ๊ฐ Node์˜ ๋™์ผ Directory๋กœ ๋ณต์‚ฌ๋ฅผ ํ•œ๋‹ค. 3.7.3 Preparation for MPI Job Linpack Benchmarking test๋ฅผ ํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” HPL์˜ input file์ธ HPL.dat file์˜ ๊ฐ parameter๋“ค์— ๋Œ€ํ•ด์„œ ์•Œ์•„์•ผ ํ•˜๋Š”๋ฐ, ์ž์„ธํ•œ ๋‚ด์šฉ์€ hpl Directory๋‚ด์˜ TUNING file์— ์ž˜ ์ ํ˜€์ ธ ์žˆ์œผ๋ฏ€๋กœ ์—ฌ๊ธฐ์„œ๋Š” Linpack Benchmarking test์˜ ์„ฑ๋Šฅ์— ์ฃผ์š”ํ•˜๊ฒŒ ์˜ํ–ฅ์„ ์ฃผ๋Š” ๋ช‡ ๊ฐ€์ง€ ์š”์†Œ์— ๋Œ€ํ•˜์—ฌ๋งŒ ์„ค๋ช… ํ•˜๋„๋ก ํ•˜๊ฒ ๋‹ค. ๋‹ค์Œ์€ HPL.dat ์˜ ์˜ˆ์ด๋‹ค.

HPL.dat file์˜ parameter ๋“ค์€ Linpack Benchmarking์˜ ์„ฑ๋Šฅ์— ์ง€๋Œ€ํ•œ ์˜ํ–ฅ์„ ์ค€๋‹ค. ๋”ฐ๋ผ์„œ User๊ฐ€ ๊ฐ€์ง€๊ณ  ์žˆ๋Š” Cluster์˜ ํ™˜๊ฒฝ์— ๋งž๊ฒŒ ์ž˜ ์ˆ˜์ •ํ•ด ์ฃผ์–ด์•ผ๋งŒ ์ข‹์€ ์„ฑ๋Šฅ์„ ๋‚ผ ์ˆ˜ ์žˆ๋‹ค. MPICH๊ฐ€ LAM/MPI์™€ ๋‹ค๋ฅธ ์  ์ค‘์˜ ํ•˜๋‚˜๋Š” lamd๋ฅผ ์‹คํ–‰์‹œํ‚ค๊ธฐ ์œ„ํ•ด lamboot๋ฅผ ์‹คํ–‰ ์‹œํ‚ฌ ํ•„์š”๊ฐ€ ์—†์œผ๋ฉฐ, ์‚ฌ์šฉ๋  Node์˜ ์ •๋ณด๋Š” lamhosts file์„ ์ด์šฉํ•˜๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ mpi Directory/share/machines.LINUX file์„ ์ด์šฉํ•œ๋‹ค. ์•„๋ž˜๋Š” ๊ทธ file์˜ ์˜ˆ์ด๋‹ค.

Matrix Size์— ๊ด€ํ•ด์„œ test ํ•  ํšŸ์ˆ˜ : ์ด ๊ฒฝ์šฐ 10000์— ๋Œ€ํ•ด์„œ 1๋ฒˆ

๊ฐ Node๋ฅผ ์œ„ํ•ด Partition ๋˜์–ด์ง„ Sub Matrix Block์˜ Size : ์ด ๊ฒฝ์šฐ 104 ์— ๋Œ€ํ•ด์„œ 1๋ฒˆ test

Network์™€ ๊ด€๋ จ๋œ parameter๋กœ์„œ Broad Casting ๋ฐฉ๋ฒ•์„ ์„ ํƒํ•œ๋‹ค.: ์ด ๊ฒฝ์šฐ์— 0๋ฒˆ ๋ฐฉ๋ฒ•์„ ์จ์„œ 1๋ฒˆ test

์‚ฌ์šฉ๋˜์–ด์งˆ CPU์˜ ๊ฐœ์ˆ˜ (P*Q=CPU ๊ฐœ์ˆ˜): ์ด ๊ฒฝ์šฐ 2*2 ์— ๋Œ€ํ•ด 1๋ฒˆ test ( cpu 4๊ฐœ ์‚ฌ์šฉ)

Page 41: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 41 -

3.7.4 xhpl ์‹คํ–‰

# mpirun -np 7 xhpl ==================================================================== HPLinpack 1.0 -- High-Performance Linpack benchmark -- September 27, 2000 Written by A. Petitet and R. Clint Whaley, Innovative Computing Labs., UTK ==================================================================== An explanation of the input/output parameters follows: T/V : Wall time / encoded variant. N : The order of the coefficient matrix A. NB : The partitioning blocking factor. P : The number of process rows. Q : The number of process columns. Time : Time in seconds to solve the linear system. Gflops : Rate of execution for solving the linear system. The following parameter values will be used: N : 10000 NB : 85 P : 1 Q : 7 PFACT : Crout NBMIN : 4 NDIV : 2 RFACT : Right BCAST : 1ringM DEPTH : 1 SWAP : Mix (threshold = 64) L1 : transposed form U : transposed form EQUIL : yes ALIGN : 8 double precision words ============================================================================ T/V N NB P Q Time Gflops ---------------------------------------------------------------------------- W11R2C4 10000 85 1 7 70.49 9.460e+00 ---------------------------------------------------------------------------- ||Ax-b||_oo / ( eps * ||A||_1 * N ) = 0.0646673 ...... PASSED ||Ax-b||_oo / ( eps * ||A||_1 * ||x||_1 ) = 0.0153022 ...... PASSED ||Ax-b||_oo / ( eps * ||A||_oo * ||x||_oo ) = 0.0034203 ...... PASSED ============================================================================ ์œ„์˜ ๊ฒฐ๊ณผ๋Š” N = 10000 , NB = 85 ์ผ๋•Œ 9.46Gflops ๊ฐ€ ๋‚˜์™”๋‹ค. LINPACK ๊ณผ ๋งˆ์ฐฌ๊ฐ€์ง€๋กœ ์—ฌ๋Ÿฌ๋ถ„์˜ ์‹œ์Šคํ…œ ํ™˜๊ฒฝ์— ๋งž๊ฒŒ problem size ์™€ NB ๋ฅผ ์ ์ ˆํžˆ ์ˆ˜์ •ํ•ด ๊ฐ€๋ฉด์„œ ์‹œ์Šคํ…œ์ด ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋Š” ์ตœ๊ณ ์„ฑ๋Šฅ์„

Page 42: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 42 -

์ด๋Œ์–ด ๋‚ด๋ณด์ž. HPL.dat ํŒŒ์ผ์„ ์ˆ˜์ • ํ•œ ๋‹ค์Œ ์ปดํŒŒ์ผ์„ ๋‹ค์‹œ ํ•œ๋‹ค. # rm -f ./xhpl # cd ../../ # make clean # make all

Page 43: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 43 -

4. GPFS (General Parallel File System)

4.1 GPFS ๊ฐœ์š”

GPFS(General Parallel File System)๋ž€?

General Parallel File System(GPFS)์€ ํด๋Ÿฌ์Šคํ„ฐ ๋‚ด์˜ ๋ชจ๋“  ๋…ธ๋“œ๋กœ ๋ถ€ํ„ฐ ๋ฐ์ดํ„ฐ ์ ‘๊ทผ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๋Š”

๊ณ ์„ฑ๋Šฅ ๊ณต์œ  ๋””์Šคํฌ ํŒŒ์ผ ์‹œ์Šคํ…œ์ด๋‹ค. ๋ณ‘๋ ฌ ๋˜๋Š” ์ง๋ ฌ ๋“ฑ ์–ด๋–ค ์œ ํ˜•์˜ ์–ดํ”Œ๋ฆฌ์ผ€์ด์…˜์„ ์‚ฌ์šฉํ•˜๋”๋ผ๋„ ํ‘œ

์ค€ ์œ ๋‹‰์Šค ํŒŒ์ผ ์‹œ์Šคํ…œ ์ธํ„ฐํŽ˜์ด์Šค๋ฅผ ํ†ตํ•ด ๊ณต์œ  ํŒŒ์ผ

์— ์‰ฝ๊ฒŒ ์ ‘๊ทผํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ๋ณต์ˆ˜ ๋…ธ๋“œ ์—์„œ๋„ ๊ฐ™์€ ํŒŒ

์ผ์— ๋™์‹œ์— ์ ‘๊ทผํ•  ์ˆ˜ ์žˆ๋‹ค. GPFS๋Š” ๋กœ๊น…(logging)๊ณผ

๋ณต์ œ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ, failover ๊ตฌ์„ฑ์„ ํ†ตํ•ด ๋””์Šคํฌ ๋˜๋Š”

์„œ๋ฒ„ ์˜ค์ž‘๋™ ์‹œ์—๋„ ๊ณ ๊ฐ€์šฉ์„ฑ์„ ๋ณด์žฅํ•œ๋‹ค.

๊ณต์œ  ํŒŒ์ผ ์‹œ์Šคํ…œ ์†”๋ฃจ์…˜

๋ณต์ˆ˜๊ฐœ์˜ ๋…ธ๋“œ๊ฐ€ ํŒŒ์ผ์‹œ์Šคํ…œ์˜ ํŒŒ์ผ์„ ๊ณต์œ ํ•  ์ˆ˜ ์žˆ๋„

๋ก ์ง€์›

Shared SAN Filesystem

Shared SAN์„ ํ†ตํ•ด ๊ณต์œ  ๋…ธ๋“œ๋“ค์ด ๋ฐ์ดํƒ€์™€ ๋ฌผ๋ฆฌ์ 

์œผ๋กœ ์ง์ ‘ ์—ฐ๊ฒฐ ํ•จ์œผ๋กœ์จ ๋†’์€ ์„ฑ๋Šฅ๊ณผ ์•ˆ์ •์„ฑ ์ œ๊ณต ๊ฐ€

๋Šฅ

ํ‘œ์ค€ UNIXยฎ ํŒŒ์ผ ์‹œ์Šคํ…œ ์ธํ„ฐํŽ˜์ด์Šค ์ œ๊ณต

๊ธฐ์กด ์œ ๋‹‰์Šค ํŒŒ์ผ์‹œ์Šคํ…œ๊ณผ ๋™์ผํ•˜๊ฒŒ ํŒŒ์ผ์„ ์ƒ์„ฑ/๋ณ€๊ฒฝ

/์‚ญ์ œ. GPFS๋‚ด์˜ ํŒŒ์ผ์„ ๊ด€๋ฆฌํ•˜๊ธฐ ์œ„ํ•ด ๋ณ„๋„ ๋ฐฉ๋ฒ•์„

์Šต๋“ํ•  ํ•„์š” ์—†์Œ.

4.2 GPFS ํ™˜๊ฒฝ

GPFS short history & evolution

1996๋…„ Tiger Shark File system โ€“ ๋ฉ€ํ‹ฐ๋ฏธ๋””์–ด video server๋กœ ๊ตฌํ˜„๋จ

1998๋…„ GPFS 1.1 ์ถœ์‹œ โ€“ IBM SP switch์ด์šฉํ•œ ๊ณ ์† ์Šค์œ„์น˜ ๋„คํŠธ์›Œํฌ์— ๊ตฌํ˜„

2001๋…„ ์ตœ๋Œ€ 32 ๋…ธ๋“œ ์ง€์›, ๋ฆฌ๋ˆ…์Šค ์ง€์›ํ•˜๋Š” Linux์šฉ GPFS 1.1 ์ถœ์‹œ

2002๋…„ GPFS 2.1 ์ถœ์‹œ , GPFS 1.3 for Linux ์ถœ์‹œ

2003๋…„ 12์›” GPFS 2.2 ์ถœ์‹œ - xSeries Linux, pSeries Linux, pSeries AIX

2004๋…„ 12์›” GPFS 2.2 ๊ธฐ๋Šฅํ–ฅ์ƒ โ€“ AIX-Linux interoperability ๊ธฐ๋Šฅ ์ถ”๊ฐ€

2004๋…„ ์ƒˆ๋กœ์šด ํ•˜๋“œ์›จ์–ด ์ง€์›๊ณผ ๊ธฐ๋Šฅ ํ–ฅ์ƒ๋œ GPFS 2.3 ์ถœ์‹œ

2006๋…„ GPFS V3.1 ์ถœ์‹œ

Page 44: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 44 -

AIX์™€ Linux system์˜ Concurrent Sharing ์ง€์›

GPFS architecture๋Š” AIX์™€ Linux ์„œ๋ฒ„๊ฐ€ SAN based storage์— ๋™์‹œ ์—ฐ๊ฒฐํ•˜์—ฌ ํŒŒ์ผ ๊ณต์œ ํ•˜๋„๋ก ์ง€์›

ํ•˜๋“œ์›จ์–ด์ ์œผ๋กœ Linux์™€ AIX์—์„œ ๋™์‹œ ์ ‘์†/๊ณต์œ ๊ฐ€ ๊ฐ€๋Šฅํ•œ ๊ฒฝ์šฐ ์ ์šฉ ๊ฐ€๋Šฅ

GPFS for AIX

- H/W

IBM system p5 server, IBM eServer JS20 BladeCenter server JS21 BladeCenter server.

- S/W

GPFS V3.1.0.1 : AIX V5.3 or later

GPFS V2.3.0.12 : AIX V5.2 or later

GPFS for Linux

- H/W

IBM x86 xSeriesยฎ server, IBM BladeCenter server, IBM for AMD processor-based

Server Linux on POWERโ„ข product ( System p5 servers, BladeCenter server, OpenPowerโ„ข server)

- S/W

๊ตฌ์ถ•ํ•˜๊ณ ์ž ํ•˜๋Š” system type (POWER, x86_64, IA_32)์— ๋”ฐ๋ผ ์ง€์› ๊ฐ€๋Šฅ ์ปค๋„ ๋ ˆ๋ฒจ ๊ฒฐ์ •๋จ.

Red Hat EL 4.0 (Update 3), Red Hat EL 3.0 (Update 7)

SUSE LINUX ES 9.0 (SP3), SUSE LINUX ES 8.0 (SP4)

4.3 GPFS ํŠน์„ฑ

High Scalable File system ์ˆ˜๋ฐฑ๊ฐœ์˜ ๋…ธ๋“œ ๋™์‹œ ์ ‘์†๊ณผ 1000๊ฐœ ์ด์ƒ์˜ ๋””์Šคํฌ๋กœ ๊ตฌ์„ฑ๋œ ์ˆ˜๋ฐฑ TB

์šฉ๋Ÿ‰์„ ์ง€์› ์ด๋ก ์  ์ตœ๋Œ€ ์šฉ๋Ÿ‰ 2^99 bytes๊นŒ์ง€ ์ง€์› ๊ฐ€๋Šฅํ•˜๋ฉฐ ํ˜„์žฌ 200

Terabytes๊นŒ์ง€ ๊ฒ€์ฆ Filesystem๋‹น ์ตœ๋Œ€ 2,147,483,648๊ฐœ ํŒŒ์ผ๊นŒ์ง€ ์ƒ์„ฑ

๊ฐ€๋Šฅ(๊ตฌ์„ฑ์— ๋”ฐ๋ผ ์ƒ์ด)

High availability ์ผ๋ถ€ ๋…ธ๋“œ ์žฅ์• ์‹œ์—๋„ ๋‹ค์šดํƒ€์ž„ ์—†์ด ํŒŒ์ผ์‹œ์Šคํ…œ ๊ณต์œ ๋กœ ์ง€์†์ ์œผ๋กœ ์„œ๋น„

์Šค ๊ฐ€๋Šฅ

High performance Shared SAN์„ ํ†ตํ•ด ๊ณต์œ  ๋…ธ๋“œ๋“ค์ด ๋ฐ์ดํƒ€์™€ ๋ฌผ๋ฆฌ์ ์œผ๋กœ ์ง์ ‘ ์—ฐ๊ฒฐ ํ•จ์œผ

๋กœ์จ ๋†’์€ ์„ฑ๋Šฅ๊ณผ ์•ˆ์ •์„ฑ ์ œ๊ณต ๊ฐ€๋Šฅ

High Flexibility NSD (Network Shared Disk)๊ธฐ๋ฐ˜์˜ ๋””์Šคํฌ ์ œ๊ณต์œผ๋กœ, ์ง์ ‘์ ์ธ SAN ์—ฐ

๊ฒฐ ์—†์ด๋„ GPFS ๊ธฐ๋ฐ˜ํ•˜์˜ ํŒŒ์ผ์‹œ์Šคํ…œ ๊ณต์œ  ๊ฐ€๋Šฅ. ๋”ฐ๋ผ์„œ ์„œ๋ฒ„ ํ™•์žฅ์ด ์šฉ

์ดํ•˜๋ฉฐ, ์šฉ๋„๋ณ„, I/O ์šฉ๋Ÿ‰๋ณ„ ์œ ์—ฐํ•œ ๊ตฌ์„ฑ ๊ฐ€๋Šฅ

High stability HACMP ํ•„์š”์—†์ด GPFS ์†”๋ฃจ์…˜ ๋งŒ์œผ๋กœ ํŒŒ์ผ์‹œ์Šคํ…œ ๊ณต์œ ๊ฐ€ ๊ฐ€๋Šฅํ•ด ์ง์œผ

๋กœ์จ, ๊ตฌ์„ฑ์˜ ๋‹จ์ˆœํ™”, ๊ด€๋ฆฌ์˜ ๋‹จ์ˆœํ™”๋กœ ์ธํ•œ ์•ˆ์ •์„ฑ ํ–ฅ์ƒ (GPFS v2.3์ด

์ƒ)

Automatic

Journaling(Logging)

of Metadata

ํŒŒ์ผ์‹œ์Šคํ…œ ๋ฉ”ํƒ€๋ฐ์ดํƒ€๋ฅผ ๊ฐ ๊ณต์œ  ๋””์Šคํฌ์— ์ €์žฅํ•˜๊ณ , ์žฅ์• ์‹œ ์‹ ์†ํ•œ ๋ณต

๊ตฌ๊ฐ€ ๊ฐ€๋Šฅํ•˜๋„๋ก ์ง€์›

Concurrent File Access ๋ฐ ์ •๊ตํ•œ Lock ๊ด€๋ฆฌ๋กœ ๋ฐ์ดํƒ€ ์ ‘๊ทผ ์ถฉ๋Œ/์ƒ์ถฉ์„ ์˜ˆ๋ฐฉํ•˜๊ณ , ํ•˜๋‚˜์˜ ํŒŒ์ผ ์—ฌ

Page 45: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 45 -

Block Level Locking ์ œ๊ณต ๋Ÿฌ ํด๋ผ์ด์–ธํŠธ๊ฐ€ ๋™์‹œ์— ์ ‘๊ทผํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•จ. Byte range Lock๊ด€๋ฆฌ๋กœ,

ํ•œ ํŒŒ์ผ๋‚ด ๊ฒน์น˜์ง€ ์•Š๋Š” ๋ถ€๋ถ„์— ๋Œ€ํ•œ ๋™์‹œ ์—…๋ฐ์ดํŠธ ๊ฐ€๋Šฅ. ๊ฒน์น˜๋Š” ์˜์—ญ์—

๋Œ€ํ•œ write/update์š”์ฒญ์‹œ ์š”์ฒญ ์ˆœ์„œ์— ๋”ฐ๋ผ serializing ํ•˜์—ฌ Lock ๊ถŒํ•œ

์ œ๊ณต GPFS๋Š” ์ตœ๋Œ€ 300,000๊ฐœ์˜ ํ† ํฐ์„ ๋™์‹œ์— ๊ด€๋ฆฌํ•  ์ˆ˜ ์žˆ์Œ.

Data Protection GPFS ํŒŒ์ผ์‹œ์Šคํ…œ๊ณผ metadata์— ๋Œ€ํ•œ ๋ฏธ๋Ÿฌ๋ง ๊ธฐ๋Šฅ ์ œ๊ณต์œผ๋กœ ์•ˆ์ •์„ฑ ํ™•๋ณด

Application์— ์ตœ์ ํ™”๋œ

block-size ์ง€์ • ๊ฐ€๋Šฅ

์šด์˜ ์–ดํ”Œ๋ฆฌ์ผ€์ด์…˜์˜ ํŠน์„ฑ์— ๋งž๋„๋ก 16K ~ 1024K๊นŒ์ง€ ๋‹ค์–‘ํ•œ block

size๋ฅผ ์ง€์ •ํ•˜์—ฌ ์ตœ์ ํ™” ํ•  ์ˆ˜ ์žˆ์Œ

Block Level striping ํŒŒ์ผ์‹œ์Šคํ…œ ์ƒ์„ฑ์‹œ ์ง€์ •ํ•œ block size๋‹จ์œ„๋กœ ์ž๋™ Stripingํ•จ์œผ๋กœ์จ I/O

์„ฑ๋Šฅ ์ตœ๋Œ€ํ™” ์ œ๊ณต

Online ์ƒ์—์„œ์˜ Disk add /

remove ๊ฐ€๋Šฅ

์‹œ์Šคํ…œ ์šด์˜์ค‘์— ๋””์Šคํฌ ์ถ”๊ฐ€/์‚ญ์ œ๊ฐ€ ๊ฐ€๋Šฅํ•˜๊ณ , ์ถ”๊ฐ€/์‚ญ์ œ์‹œ ๋™์ ์œผ๋กœ ๋ฐ

์ดํƒ€๋ฅผ ์žฌ๋ถ„๋ฐฐ ํ•ด ์คŒ์œผ๋กœ์จ, I/O balancing์„ ์œ ์ง€

ํ‘œ์ค€ UNIXยฎ ํŒŒ์ผ ์‹œ์Šคํ…œ

์ธํ„ฐํŽ˜์ด์Šค ์ œ๊ณต

๊ธฐ์กด ์œ ๋‹‰์Šค ํŒŒ์ผ์‹œ์Šคํ…œ๊ณผ ๋™์ผํ•˜๊ฒŒ ํŒŒ์ผ์„ ์ƒ์„ฑ/๋ณ€๊ฒฝ/์‚ญ์ œ. GPFS๋‚ด์˜ ํŒŒ

์ผ์„ ๊ด€๋ฆฌํ•˜๊ธฐ ์œ„ํ•ด ๋ณ„๋„ ๋ฐฉ๋ฒ•์„ ์Šต๋“ํ•  ํ•„์š” ์—†์Œ

4.4 GPFS ์„ค์น˜

4.4.1 GPFS ์„ค์น˜ ์ „ ์ค€๋น„์‚ฌํ•ญ

GPFS์— ํฌํ•จ๋  ๋ชจ๋“  nodes ์— Linux OS ์„ค์น˜

๊ฐ node๋“ค /etc/hostsํŒŒ์ผ ์ž‘์„ฑ

root ์‚ฌ์šฉ์ž๊ฐ€ ๊ฐ nodes์—๊ฒŒ password prompt ์—†์ด ์ ‘์† ๊ฐ€๋Šฅํ•˜๊ฒŒ SSH ๊ตฌ์„ฑ

GPFS nodes ์‹œ๊ฐ„ ๋™๊ธฐํ™”๋ฅผ ์œ„ํ•ด NTP์„œ๋ฒ„ ๊ตฌ์„ฑ ๊ถŒ์žฅ

GPFS nodes์— ์™ธ์žฅ ์Šคํ† ๋ฆฌ์ง€ ๋ณผ๋ฅจ ์ธ์‹

4.4.2 GPFS ์„ค์น˜ ๋ฐ Portability layer ๋นŒ๋“œ

GPFS ์„ค์น˜ ๋””๋ ‰ํ† ๋ฆฌ ์ƒ์„ฑ

mkdir /gpfslpp

CD-ROM์œผ๋กœ๋ถ€ํ„ฐ GPFS ์ œํ’ˆ ์ด๋ฏธ์ง€ ๋ณต์‚ฌ gpfs_install-3.1*

[root@blade1 gpfslpp]# ./gpfs_install-3.1.* [root@blade1 gpfslpp]# ls gpfs.base-3.1*.rpm gpfs.gpl-3.1*.noarch.rpm gpfs.msg.en_US-3.1*.noarch.rpm gpfs.docs-3.1*.noarch.rpm

[root@blade1 gpfslpp]# rpm โ€“ivh gpfs*

Page 46: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 46 -

node profile ์— ํ™˜๊ฒฝ ๋ณ€์ˆ˜ ์ถ”๊ฐ€

PATH=$PATH:$HOME/bin:/usr/lpp/mmfs/bin

MANPATH=$MANPATH:/usr/lpp/mmfs/messages

Portability layer Build ํ•˜๊ธฐ

cd /usr/lpp/mmfs/src

export SHARKCLONEROOT=/usr/lpp/mmfs/src

cd /usr/lpp/mmfs/src

make Autoconfig

make World

make InstallImages

[root@blade1 ~]# make InstallImages cd gpl-linux; /usr/bin/make InstallImages; make[1]: Entering directory `/usr/lpp/mmfs/src/gpl-linux' mmfslinux mmfs26 lxtrace dumpconv tracedev make[1]: Leaving directory `/usr/lpp/mmfs/src/gpl-linux'

๊ฐ GPFS nodes์— ์ƒ์„ฑ๋œ ํŒŒ์ผ ๋ณต์‚ฌ

[root@blade1 ~]# cd /usr/lpp/mmfs/bin [root@blade1 bin]# scp mmfslinux mmfs26 lxtrace dumpconv tracedev gpfs2:/usr/lpp/mmfs/bin /* Copy to all the other nodes */

4.5 GPFS ๊ตฌ์„ฑ

4.5.1 GPFS Test ์‹œ์Šคํ…œ ๊ตฌ์„ฑ๋„

Page 47: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 47 -

4.5.2 GPFS๊ตฌ์„ฑ ์‹œ ๊ณ ๋ ค์‚ฌํ•ญ

GPFS node ์ •์˜

NodeName:NodeDesignations:AdminNodeName

optional, โ€ณ-โ€ณ node roles ๊ตฌ๋ถ„

manager | client โ€“ GPFS node ๋ฅผ ์ •์˜ ํ•  ๋•Œ manager node ํ˜น์€ client node์— ๋Œ€ํ•œ node pool์„

์ •์˜ํ•œ๋‹ค.

quorum | nonquorum โ€“ Node pool์—์„œ node๊ฐ€ quorum ์•„๋‹ˆ๋ฉด non-quorum node๋กœ ๋™์ž‘ ํ•  ๊ฒƒ์ธ์ง€

์— ๋Œ€ํ•œ node role ์ •์˜. ์ •์˜ ํ•˜์ง€ ์•Š์œผ๋ฉด non-quorum node๋กœ ๋™์ž‘

GPFS Cluster primary์™€ secondary node๋Š” ๋ฐ˜๋“œ์‹œ quorum nodes ๊ตฌ์„ฑ ํ•  ๊ฒƒ์„ ๊ถŒ์žฅํ•œ๋‹ค.

GPFS NSD ์ƒ์„ฑ ์˜ต์…˜ ํ™•์ธ

DiskName:PrimaryServer:BackupServer:DiskUsage:FailureGroup:DesiredName:StoragePool

DiskName โ€“ NSD๋กœ ์ •์˜๋  block device ์ด๋ฆ„ ex) /dev/sdb, /dev/hdisk1

PrimaryServer โ€“ Primary NSD ์„œ๋ฒ„ ์ด๋ฆ„

BackupServer โ€“ Backup NSD ์„œ๋ฒ„ ์ด๋ฆ„

DiskUsage โ€“ Disk Usage ์ •์˜

- dataAndMetadata โ€“ disk๊ฐ€ data์™€ metadata ๋‘˜ ๋‹ค ๊ฐ€์ง„๋‹ค. default๊ฐ’

- dataOnly โ€“ disk๊ฐ€ data๋งŒ ๊ฐ€์ง„๋‹ค.

- metadataOnly โ€“ disk๊ฐ€ metadata๋งŒ ๊ฐ€์ง„๋‹ค

- descOnly โ€“ disk๊ฐ€ data ๋ฐ metadata ๋ชจ๋‘๋ฅผ ๊ฐ€์ง€์ง€ ์•Š๊ณ  ๋‹จ์ง€ file system descriptor์˜ ๋ณต์‚ฌ๋ณธ๋งŒ

๊ฐ€์ง„๋‹ค. DR ๊ตฌ์„ฑ์‹œ์— third failure group์œผ๋กœ ์‚ฌ์šฉ๋จ

FailureGroup โ€“ disk replication์„ ์œ„ํ•œ failureGroup number ์ง€์ •

DesiredName โ€“ NSD disk ์ด๋ฆ„ ์ง€์ • ์ง€์ •ํ•˜์ง€ ์•Š์œผ๋ฉด gpfsNNnsdํ˜•์‹์œผ๋กœ ์ž๋™์ƒ์„ฑ

StoragePool โ€“ NSD๊ฐ€ ํ• ๋‹น๋œ Storage Pool ์ด๋ฆ„ ์ง€์ •

ex) /dev/sdb:gpfs_primary:gpfs_secondary:dataAndMetadata:1::stgpool01

GPFS file system ์ƒ์„ฑ ์˜ต์…˜ ํ™•์ธ

File System ์ƒ์„ฑ options Default value -F DiskDesc ํŒŒ์ผ ์‹œ์Šคํ…œ ๋””์Šคํฌ์ •์˜ none -A {yes | no | automount} yes โ€“ gpfs ๋ฐ๋ชฌ์ด ์‹œ์ž‘๋  ๋•Œ ์ž๋™ ๋งˆ์šดํŠธ no โ€“ ์ˆ˜๋™ ๋งˆ์šดํŠธ automount โ€“ ํŒŒ์ผ์‹œ์Šคํ…œ ์ฒ˜์Œ์œผ๋กœ access๋  ๋•Œ ๋งˆ์šดํŠธ

yes

-B BlockSize 16K,64K,256K,512K,1024K or 1M data block size

256K

-E {yes | no} mtime ๊ฐ’ ๊ธฐ๋ก yes -m DefaultMetadataReplicas 1

Page 48: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 48 -

metadata replication ๊ฐ’ {1 | 2} 1 โ€“ metadata replication ํ•˜์ง€ ์•Š์Œ 2 โ€“ metadata replication -M MaxMetadataReplicas metadata replication ๊ฐ’ {1 | 2}

1 โ€“ metadata replication ํ•˜์ง€ ์•Š์Œ 2 โ€“ metadata replication

1

-n NumNodes ๋งˆ์šดํŠธ ๋˜๋Š” nodes ์ˆซ์ž 32 -o MountOptions none -Q {yes | no} to active quota no -r DefaultDataReplicas data replication ๊ฐ’ {1 | 2}

1 โ€“ data replication ํ•˜์ง€ ์•Š์Œ 2 โ€“ data replication

yes

-R MaxDataReplicas data replication ๊ฐ’ {1 | 2}

1 โ€“ metadata replication ํ•˜์ง€ ์•Š์Œ 2 โ€“ metadata replication

yes

-v {yes | no} disk usage ๊ฒ€์ฆ yes

4.5.3 GPFS Cluster ๊ตฌ์„ฑ

GPFS Cluster ์ƒ์„ฑ ๋ฐ ํ™•์ธ

[root@blade1 gpfs]# mmcrcluster โ€“n nodefile โ€“p blade1 โ€“s blade2 โ€“r /usr/bin/ssh โ€“R /usr/bin/scp โ€“C gpfs_test ! -p primary server โ€“s secondary server nodefile ์˜ ๋‚ด์šฉ > blade1:manager-quorum blade2:manager-quorum blade3:manager-quorum

blade4:client [root@gpfs1 gpfslpp]# mmlscluster GPFS cluster information ======================== GPFS cluster name: test.gpfs GPFS cluster id: 13882422822655908522 GPFS UID domain: test.gpfs Remote shell command: /usr/bin/ssh Remote file copy command: /usr/bin/scp GPFS cluster configuration servers: ------------------------------------------- Primary server: gpfs1 Secondary server: blade2.cluster.net Node Daemon node name IP address Admin node name Designation ----------------------------------------------------------------------------------------------------------------------- 1 blade1 192.168.70.1 blade1 quorum-manager 2 blade2.cluster.net 192.168.70.2 blade2.cluster.net quorum-manager 3 blade3.cluster.net 192.168.70.3 blade3.cluster.net quorum-manager 4 blade4.cluster.net 192.168.70.4 blade4.cluster.net

Page 49: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 49 -

NSD Disk ์ƒ์„ฑ ๋ฐ ํ™•์ธ

[root@blade1 gpfslpp]# mmcrnsd -F diskfile -v no diskfile ์˜ ๋‚ด์šฉ > /dev/sdb:blade1:blade2:dataAndMetadata:1:: /dev/sdc:blade1:blade2:dataAndMetadata:1:: /dev/sdd:blade1:blade2:dataAndMetadata:1:: /dev/sde:blade1:blade2:dataAndMetadata:1:: /dev/sdf:blade2:blade1:dataAndMetadata:2:: /dev/sdg:blade2:blade1:dataAndMetadata:2:: /dev/sdh:blade2:blade1:dataAndMetadata:2:: /dev/sdi:blade2:blade1:dataAndMetadata:2:: [root@blade1 gpfslpp]# mmlsnsd File system Disk name Primary node Backup node --------------------------------------------------------------------------- (free disk) gpfs1nsd blade1 blade2.cluster.net (free disk) gpfs2nsd blade1 blade2.cluster.net (free disk) gpfs3nsd blade1 blade2.cluster.net (free disk) gpfs4nsd blade1 blade2.cluster.net (free disk) gpfs5nsd blade2.cluster.net blade1 (free disk) gpfs6nsd blade2.cluster.net blade1 (free disk) gpfs7nsd blade2.cluster.net blade1 (free disk) gpfs8nsd blade2.cluster.net blade1 * ์•„์ง filesystem ์ด ๋งŒ๋“ค์–ด ์ง€์ง€ ์•Š์•„ File system๋ถ€๋ถ„์ด free disk๋กœ ๋‚˜ํƒ€๋‚œ๋‹ค

File System ์ƒ์„ฑ ๋ฐ mount

file system์„ ์ƒ์„ฑ ํ•˜๊ธฐ ์ „์— ๋จผ์ € mmstartup โ€“a ๋ช…๋ น์œผ๋กœ gpfs ๋ฐ๋ชฌ์„ ์‹œ์ž‘ํ•ด์•ผ ํ•œ๋‹ค.

nsd ๋””์Šคํฌ ์ƒ์„ฑ ์ดํ›„ diskfile ์˜ ๋‚ด์šฉ > # /dev/sdb:blade1:blade2:dataAndMetadata:1:: gpfs1nsd:::dataAndMetadata:1:: # /dev/sdc:blade1:blade2:dataAndMetadata:1:: gpfs2nsd:::dataAndMetadata:1:: # /dev/sdd:blade1:blade2:dataAndMetadata:1:: gpfs3nsd:::dataAndMetadata:1:: # /dev/sde:blade1:blade2:dataAndMetadata:1:: gpfs4nsd:::dataAndMetadata:1:: # /dev/sdf:blade2:blade1:dataAndMetadata:2:: gpfs5nsd:::dataAndMetadata:2:: # /dev/sdg:blade2:blade1:dataAndMetadata:2:: gpfs6nsd:::dataAndMetadata:2:: # /dev/sdh:blade2:blade1:dataAndMetadata:2:: gpfs7nsd:::dataAndMetadata:2:: # /dev/sdi:blade2:blade1:dataAndMetadata:2:: # mmcrnsd๋กœ nsd์ƒ์„ฑ ํ›„ diskfile ๋‚ด์šฉ์ด ํŒŒ์ผ์‹œ์Šคํ…œ ์ƒ์„ฑ์— ํ•„์š” ๊ฐ’์œผ๋กœ ๋ฐ”๋€๋‹ค. [root@blade1 gpfslpp]# mmcrfs /gpfs1 gpfs1 -F diskfile -B 1024K -m1 -M1 -r1 -R1 # replication ๊ธฐ๋Šฅ์„ ์‚ฌ์šฉ ํ•˜๊ธฐ์œ„ํ•ด์„œ๋Š” -m2 โ€“M2 โ€“r2 โ€“R2 ๋กœ ์˜ต์…˜์„ ๋ณ€๊ฒฝํ•ด์•ผ ํ•œ๋‹ค. [root@blade1 gpfslpp]# mmmount gpfs1 โ€“a [root@blade1 gpfslpp]# mmlsmount all -L File system gpfs1 is mounted on 4 nodes: 192.168.70.1 blade1 192.168.70.2 blade2.cluster.net 192.168.70.3 blade3.cluster.net 192.168.70.4 blade4.cluster.net

Page 50: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 50 -

File system gpfs2 is mounted on 4 nodes: 192.168.70.2 blade2.cluster.net 192.168.70.3 blade3.cluster.net 192.168.70.1 blade1

192.168.70.4 blade4.cluster.net

[[root@blade1 gpfslpp]# mmlsnsd File system Disk name Primary node Backup node --------------------------------------------------------------------------- gpfs1 gpfs1nsd blade1 blade2.cluster.net gpfs1 gpfs2nsd blade1 blade2.cluster.net gpfs1 gpfs3nsd blade1 blade2.cluster.net gpfs1 gpfs4nsd blade1 blade2.cluster.net (free disk) gpfs5nsd blade2.cluster.net blade1 (free disk) gpfs6nsd blade2.cluster.net blade1 (free disk) gpfs7nsd blade2.cluster.net blade1 (free disk) gpfs8nsd blade2.cluster.net blade1

4.6 GPFS (GENERAL PARALLEL FILE SYSTEM) ์šด์˜ ๋ฐ ๊ด€๋ฆฌ

4.6.1 GPFS ํŒŒ์ผ์‹œ์Šคํ…œ ํ™•์ธ ๋ฐ ๊ด€๋ฆฌ

ํŒŒ์ผ์‹œ์Šคํ…œ ์ƒ์„ฑ ํ™•์ธ ๋ช…๋ น mmlsdisk, mmlsfs

[root@blade1 gpfslpp]# mmlsdisk gpfs1 disk driver sector failure holds holds storage name type size group metadata data status availability pool ------------ -------- ------ ------- -------- ----- ------------- ------------ ------------ gpfs1nsd nsd 512 1 yes yes ready up system gpfs2nsd nsd 512 1 yes yes ready up system gpfs3nsd nsd 512 1 yes yes ready up system gpfs4nsd nsd 512 1 yes yes ready up system [root@blade1 gpfslpp]# mmlsfs gpfs1 flag value description ---- -------------- ----------------------------------------------------- -s roundRobin Stripe method -f 32768 Minimum fragment size in bytes -i 1024 Inode size in bytes -I 32768 Indirect block size in bytes -m 1 Default number of metadata replicas -M 1 Maximum number of metadata replicas -r 1 Default number of data replicas -R 1 Maximum number of data replicas -j cluster Block allocation type -D posix File locking semantics in effect -k posix ACL semantics in effect -a 1048576 Estimated average file size -n 32 Estimated number of nodes that will mount file system -B 1048576 Block size -Q none Quotas enforced none Default quotas enabled -F 417792 Maximum number of inodes -V 9.03 File system version. Highest supported version: 9.03 -u yes Support for large LUNs? -z no Is DMAPI enabled?

Page 51: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 51 -

-E yes Exact mtime mount option -S no Suppress atime mount option -K whenpossible Strict replica allocation option -P system Disk storage pools in file system -d gpfs8nsd;gpfs9nsd;gpfs10nsd;gpfs11nsd Disks in file system -A yes Automatic mount option -o none Additional mount options -T /gpfs1 Default mount point

File System ์‚ฌ์šฉ๊ณต๊ฐ„ ํ™•์ธํ•˜๊ธฐ

mmdf ๋ช…๋ น์–ด๋ฅผ ์ด์šฉํ•˜์—ฌ ๋ชจ๋“  GPFS ๋””์Šคํฌ๋“ค์˜ ์šฉ๋Ÿ‰ ํ™•์ธ

ex) mmdf gpfs1

file system mount ๋ฐ umount

GPFS ํŒŒ์ผ์‹œ์Šคํ…œ์„ ์ƒ์„ฑํ•  ๋•Œ ์ž๋™์œผ๋กœ ๋งˆ์šดํŠธ๋˜๋Š” ์˜ต์…˜(๋””ํดํŠธ๊ฐ’)์œผ๋กœ ์ƒ์„ฑํ•˜๋ฉด GPFS ๋ฐ๋ชฌ์ด ์‹œ์ž‘๋ 

๋•Œ ์ž๋™์œผ๋กœ ๋งˆ์šดํŠธ๋œ๋‹ค.

mmmount gpfs1 โ€“a (์ „ nodes์— mount ํ•  ๋•Œ -a์˜ต์…˜ ์‚ฌ์šฉ)

mmumount gpfs1 -a

File System ์ œ๊ฑฐํ•˜๊ธฐ

GPFS ํŒŒ์ผ์‹œ์Šคํ…œ์„ ์ œ๊ฑฐํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ๋จผ์ € ํŒŒ์ผ์‹œ์Šคํ…œ์„ unmountํ•ด์•ผ ํ•œ๋‹ค.

GPFS ํŒŒ์ผ์‹œ์Šคํ…œ์€ mmdelfs ๋ช…๋ น์–ด๋กœ ์ œ๊ฑฐํ•  ์ˆ˜ ์žˆ๋‹ค.

mmdelfs ๋ช…๋ น์–ด

mmdelfs gpfs1 โ€“p (์—ฌ๊ธฐ์„œ p ์˜ต์…˜์€ ํŒŒ์ผ์‹œ์Šคํ…œ์ •๋ณด๊ฐ€ ์ž˜๋ชป๋˜์—ˆ๊ฑฐ๋‚˜ ๋””์Šคํฌ๊ฐ€ availableํ•œ ์ƒํƒœ๊ฐ€ ์•„

๋‹ˆ๋ผ๋„ ๊ฐ•์ œ๋กœ ์‚ญ์ œํ•  ์ˆ˜ ์žˆ๋‹ค.)

File System ์˜ค๋ฅ˜ ์ฒดํฌ ๋ฐ ๋ณต๊ตฌํ•˜๊ธฐ

gpfs์˜ mmfsck ๋ช…๋ น์€ online, offline ๋‘๊ฐ€์ง€ ๋ชจ๋“œ์—์„œ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค.

ํŒŒ์ผ์‹œ์Šคํ…œ์ด ๋งˆ์šดํŠธ๋˜์–ด ์žˆ๋Š” online์ƒํƒœ์—์„œ๋Š” โ€“o ์˜ต์…˜์„ ์‚ฌ์šฉํ•˜์—ฌ ํŒŒ์ผ์‹œ์Šคํ…œ์„ ์ฒดํฌํ•  ์ˆ˜ ์žˆ๋‹ค. ๋ฐ˜

๋Œ€๋กœ ํŒŒ์ผ์‹œ์Šคํ…œ์ด unmount๋œ ์ƒํƒœ์—์„œ๋Š” offline์œผ๋กœ ํŒŒ์ผ์‹œ์Šคํ…œ์„ ์ฒดํฌํ•  ์ˆ˜ ์žˆ๋‹ค.

Online ๋ชจ๋“œ๋Š” ํŒŒ์ผ์‹œ์Šคํ…œ์ด ๋งˆ์šดํŠธ๋œ ์ƒํƒœ์—์„œ ๋ฐฐ์น˜๋˜์ง€ ๋ชปํ•œ ๋ธ”๋Ÿญ๋“ค์„ ์ฒดํฌํ•˜๊ณ  ๋ณต๊ตฌํ•œ๋‹ค. ์ด์™ธ์˜

๋ฌธ์ œ๋Š” ์ฒดํฌํ•˜์—ฌ ๋ณด๊ณ ๋งŒ ํ•˜๊ณ  ๋ณต๊ตฌ๋Š” ํ•˜์ง€ ์•Š๋Š”๋‹ค.

์œ„์™€ ๋‹ค๋ฅธ ์˜ค๋ฅ˜๋“ค์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ํŒŒ์ผ์‹œ์Šคํ…œ์ด unmount๋œ offline๋ชจ๋“œ์—์„œ mmfsck๋ช…๋ น์„ ์‹คํ–‰

ํ•˜์—ฌ์•ผ ํ•œ๋‹ค.

Disk์ƒํƒœ๊ฐ€ down์ƒํƒœ๋ผ๋ฉด mmfsck ๋ช…๋ น์„ ์‚ฌ์šฉํ•  ์ˆ˜ ์—†๋‹ค. ๋จผ์ € mmlsdisk๋ช…๋ น์–ด๋กœ ๋””์Šคํฌ ์ƒํƒœ๋ฅผ ํ™•

์ธํ•˜๊ณ  down๋œ ๋””์Šคํฌ๊ฐ€ ์žˆ์œผ๋ฉด mmchdisk๋ช…๋ น์œผ๋กœ unrecovered ๋‚˜ up์ƒํƒœ๋กœ ๋งŒ๋“ ๋‹ค.

mmfsck ๋ช…๋ น์–ด

ex) mmfsck gpfs1 โ€“n (n์˜ต์…˜์€ ํŒŒ์ผ์‹œ์Šคํ…œ ๋ณต๊ตฌ๋ฅผ ์‹œ๋„ํ•˜์ง€ ์•Š๊ณ  ์ฒดํฌ๋งŒํ•œ๋‹ค.)

File System ์†์„ฑ ๋ณ€๊ฒฝํ•˜๊ธฐ

mmchfs ๋ช…๋ น์–ด๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ GPFS ํŒŒ์ผ์‹œ์Šคํ…œ์˜ ์†์„ฑ์„ ๋ฐ”๊ฟ€ ์ˆ˜ ์žˆ๋‹ค.

Usage:

Page 52: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 52 -

mmchfs Device [-A {yes | no | automount}] [-D {posix | nfs4}] [-E {yes | no}]

[-F MaxNumInodes] [-k {posix | nfs4 | all}]

[-K {no | whenpossible | always}]

[-m DefaultMetadataReplicas] [-o MountOptions]

[-Q {yes | no}] [-r DefaultDataReplicas] [-S {yes | no}]

[-T Mountpoint] [-V] [-z {yes | no}]

or

mmchfs Device [-W NewDeviceName]

File System Restriping ํ•˜๊ธฐ

GPFS๋Š” ํŒŒ์ผ์„ writeํ•˜๋ฉด์„œ striping์„ ํ•œ๋‹ค. ๊ทธ๋Ÿฐ๋ฐ ๋””์Šคํฌ๋ฅผ ์ถ”๊ฐ€ํ•œ ํ›„์— performance๋ฅผ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ

์œ„ํ•ด์„œ๋Š” mmrestripefs ๋ช…๋ น์–ด๋ฅผ ์ด์šฉํ•˜์—ฌ ํŒŒ์ผ์„ restripeํ•  ์ˆ˜ ์žˆ๋‹ค. ๋””์Šคํฌ ๋ณต์ œ(replication)๋ฅผ ํ•˜

์ง€ ์•Š์€ ์ƒํƒœ๋ผ๋ฉด โ€“m ์˜ต์…˜์„ ์‚ฌ์šฉํ•˜์—ฌ fail๋œ ๋””์Šคํฌ๋กœ๋ถ€ํ„ฐ ๋ฐ์ดํ„ฐ๋ฅผ ์•ˆ์ „ํ•˜๊ฒŒ ๋ณต๊ตฌํ•  ์ˆ˜ ์žˆ๋‹ค. ๋งˆ์ฐฌ๊ฐ€

์ง€๋กœ โ€“r ์˜ต์…˜๋„ ๊ฐ™์€ ํšจ๊ณผ๋ฅผ ๋‚ผ ์ˆ˜ ์žˆ๋‹ค. ๋””์Šคํฌ ๋ณต์ œ(replication)๋ฅผ ํ•œ ์ƒํƒœ๋ผ๋ฉด โ€“b(rebalance) ์˜ต์…˜์œผ

๋กœ replication data๋„ restripeํ•  ์ˆ˜ ์žˆ๋‹ค.

mmrestripefs ๋ช…๋ น์–ด

ex) mmrestripefs gpfs1 โ€“r

File System ๋‹จํŽธํ™” ์ •๋ณด ๋ณด๊ธฐ ๋ฐ ์ตœ์ ํ™”

ํŒŒ์ผ์‹œ์Šคํ…œ์— ๋ฐ์ดํ„ฐ๋ฅผ ๊ธฐ๋กํ•  ๋•Œ fragmented block์œผ๋กœ ์กฐ๊ฐ๋‚˜๋Š” ๊ฒƒ์€ ํ”ผํ•  ์ˆ˜ ์—†๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ์ด๋Ÿฐ ์กฐ

๊ฐ๋“ค์ค‘์— ๋ฐ์ดํ„ฐ๋ฅผ ๊ฐ€๋“์ฑ„์šฐ์ง€ ๋ชปํ•˜๊ณ  block์˜ ์ผ๋ถ€๋งŒ ์‚ฌ์šฉํ•˜๋Š” ๋‹จํŽธ๋“ค๋„ ์ƒ๊ธฐ๊ฒŒ ๋œ๋‹ค. ์ด๋Ÿฐ ๋‹จํŽธ๋“ค์€

๋””์Šคํฌ ๊ณต๊ฐ„์„ ๋‚ญ๋น„ํ•˜๊ฒŒ ๋˜๋Š” ์š”์ธ์ค‘์˜ ํ•˜๋‚˜๊ฐ€ ๋œ๋‹ค. ์ด๋ ‡๊ฒŒ ๋””์Šคํฌ ๊ณต๊ฐ„์„ ๋‚ญ๋น„ํ•˜๋Š” fragmented block

์— ์กฐ๊ฐ๋ชจ์Œ์„ ํ•˜์—ฌ ๋””์Šคํฌ ๊ณต๊ฐ„์˜ ํšจ์œจ์„ ๋†’์ด๊ธฐ ์œ„ํ•ด mmdefragfs ๋ช…๋ น์–ด๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค.

ํŒŒ์ผ์‹œ์Šคํ…œ์˜ ๋‹จํŽธํ™” ์ •๋„๋ฅผ ์•Œ๊ธฐ ์œ„ํ•ด์„œ๋Š” mmdefragfs ๋ช…๋ น์–ด์— โ€“i ์˜ต์…˜์„ ์‚ฌ์šฉํ•œ๋‹ค.

File System ์ตœ์ ํ™”

Block์˜ ์‚ฌ์šฉ์œจ์„ ๋†’์ด๊ธฐ ์œ„ํ•œ ์กฐ๊ฐ๋ชจ์Œ์„ ํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” mmdefragfs ๋ช…๋ น์–ด๋ฅผ ์‹คํ–‰ํ•œ๋‹ค.

-u ์˜ต์…˜์œผ๋กœ ๋ชฉํ‘œ๋กœ ํ•˜๋Š” block ์‚ฌ์šฉ์œจ (Block utilization )์„ ์ง€์ •ํ•  ์ˆ˜ ์žˆ๋‹ค.

mmdefragfs ๋ช…๋ น์–ด

ex) mmdefragfs gpfs1

File System ๋””์Šคํฌ ์ถ”๊ฐ€

ํŒŒ์ผ์‹œ์Šคํ…œ์— ํŒŒ์ผ๋“ค์ด ์ฆ๊ฐ€ํ•˜์—ฌ ํŒŒ์ผ์‹œ์Šคํ…œ ๊ณต๊ฐ„์„ ๋Š˜๋ ค์•ผ ํ•  ๋•Œ GPFS๋Š” ๋น ๋ฅด๊ฒŒ ๋””์Šคํฌ๋ฅผ ์ถ”๊ฐ€ํ•˜์—ฌ

ํŒŒ์ผ์‹œ์Šคํ…œ ๊ณต๊ฐ„์„ ๋Š˜๋ฆด ์ˆ˜ ์žˆ๋‹ค. ์ด๋•Œ mmadddisk ๋ช…๋ น์–ด๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค.

root@blade1 gpfslpp]# mmlsnsd -F File system Disk name Primary node Backup node --------------------------------------------------------------------------- (free disk) gpfs5nsd blade2.cluster.net blade1 (free disk) gpfs6nsd blade2.cluster.net blade1 (free disk) gpfs7nsd blade2.cluster.net blade1 (free disk) gpfs8nsd blade2.cluster.net blade1

Page 53: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 53 -

* free disk ๋กœ ๋‚จ์•„์žˆ๋Š” gpfs5nsd ๋””์Šคํฌ๋ฅผ gpfs1 ํŒŒ์ผ ์‹œ์Šคํ…œ์— ์ถ”๊ฐ€ ํ•  ์˜ˆ์ • [root@blade1 gpfslpp]# mmadddisk gpfs1 gpfs5nsd:::dataAndMetadata:1:: -r -v no * ์—ฌ๊ธฐ์„œ r ์˜ต์…˜์€ ๋””์Šคํฌ๋ฅผ ์ถ”๊ฐ€ํ•œ ๋‹ค์Œ rebalance ํ•˜๋ผ๋Š” ์˜ต์…˜์ž„ [root@blade1 gpfslpp]# mmlsnsd File system Disk name Primary node Backup node --------------------------------------------------------------------------- gpfs1 gpfs1nsd blade1 blade2.cluster.net gpfs1 gpfs2nsd blade1 blade2.cluster.net gpfs1 gpfs3nsd blade1 blade2.cluster.net gpfs1 gpfs4nsd blade1 blade2.cluster.net gpfs1 gpfs5nsd blade2.cluster.net blade1 (free disk) gpfs6nsd blade2.cluster.net blade1 (free disk) gpfs7nsd blade2.cluster.net blade1 (free disk) gpfs8nsd blade2.cluster.net blade1 * mmlsnd ๋ช…๋ น์œผ๋กœ gpfs1 ํŒŒ์ผ ์‹œ์Šคํ…œ์— ๋””์Šคํฌ๊ฐ€ ์ถ”๊ฐ€ ๋œ ๊ฒƒ์„ ํ™•์ธ ํ•  ์ˆ˜ ์žˆ๋‹ค.

File System์—์„œ ๋””์Šคํฌ ์ œ๊ฑฐ

๋””์Šคํฌ๋ฅผ ์ œ๊ฑฐํ•˜๊ธฐ ์ „์— mmdf ๋ช…๋ น์–ด๋กœ ๋””์Šคํฌ๋ฅผ ์‚ญ์ œํ•ด๋„ ํŒŒ์ผ์‹œ์Šคํ…œ์— ์—ฌ์œ ๊ณต๊ฐ„์ด ๋‚จ๋Š”์ง€ ํŒŒ์ผ์‹œ์Šค

ํ…œ์˜ ๋‚จ์•„์žˆ๋Š” ์šฉ๋Ÿ‰์„ ํ™•์ธํ•ด์•ผ ํ•œ๋‹ค. ๋””์Šคํฌ๋ฅผ ์‚ญ์ œํ•˜๋ ค๋ฉด ๋‚จ์•„์žˆ๋Š” ์šฉ๋Ÿ‰์ด ์‚ญ์ œํ•  ๋””์Šคํฌ ์šฉ๋Ÿ‰์˜

150%์ด์ƒ์ด์–ด์•ผ ํ•œ๋‹ค.

* ์•ž์—์„œ ์ถ”๊ฐ€ํ•œ gpfs5nsd๋””์Šคํฌ๋ฅผ gpfs1 ํŒŒ์ผ ์‹œ์Šคํ…œ์—์„œ ์ œ๊ฑฐ [root@blade1 gpfslpp]# mmdeldisk gpfs1 gpfs5nsd โ€“r [root@blade1 gpfslpp]# mmlsnsd File system Disk name Primary node Backup node --------------------------------------------------------------------------- gpfs1 gpfs1nsd blade1 blade2.cluster.net gpfs1 gpfs2nsd blade1 blade2.cluster.net gpfs1 gpfs3nsd blade1 blade2.cluster.net gpfs1 gpfs4nsd blade1 blade2.cluster.net (free disk) gpfs5nsd blade2.cluster.net blade1 (free disk) gpfs6nsd blade2.cluster.net blade1 (free disk) gpfs7nsd blade2.cluster.net blade1 (free disk) gpfs8nsd blade2.cluster.net blade1 * gpfs5nsd ๋””์Šคํฌ๊ฐ€ free disk๋กœ ๋ณ€๊ฒฝ ๋œ ๊ฒƒ ์„ ํ™•์ธ ํ•  ์ˆ˜ ์žˆ๋‹ค.

4.6.2 GPFS Node ๊ด€๋ฆฌ

GPFS Cluster์— nodes ์ถ”๊ฐ€ ๋ฐ ์ œ๊ฑฐ

mmaddnoe๋ช…๋ น์œผ๋กœ gpfs cluster์— ์ƒˆ๋กœ์šด node๋ฅผ ์ถ”๊ฐ€ ์‹œํ‚ฌ ์ˆ˜๋„ ์žˆ๊ณ  mmdelnode ๋ช…๋ น์œผ๋กœ ์‚ญ์ œ ํ• 

์ˆ˜ ๋„ ์žˆ๋‹ค.

ex) mmaddnode blade4

mmdelnode blade4

mmlsnode โ€“a ๋ช…๋ น์œผ๋กœ cluster nodes ํ™•์ธ

File System ๊ด€๋ฆฌ nodes ๋ณ€๊ฒฝ ๋ฐ ํ™•์ธ

ํŒŒ์ผ์‹œ์Šคํ…œ์„ ๊ด€๋ฆฌํ•˜๋Š” ๋…ธ๋“œ ์ •๋ณด๋Š” mmlsmgr ๋ช…๋ น์–ด๋กœ ๋ณผ ์ˆ˜ ์žˆ๋‹ค.

mmlsmgr

mmlsmgr gpfs1

Page 54: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 54 -

์œ„ ๋ช…๋ น์–ด๋ฅผ ์‹คํ–‰ํ•˜๋ฉด ์•„๋ž˜์™€ ๊ฐ™์ด ํŒŒ์ผ์‹œ์Šคํ…œ์˜ device name๊ณผ ํ•ด๋‹น ํŒŒ์ผ์‹œ์Šคํ…œ์„ ๊ด€๋ฆฌํ•˜๋Š” ๋…ธ๋“œ๊ฐ€

์–ด๋–ค ๋…ธ๋“œ์ธ์ง€๋ฅผ ๋ณด์—ฌ์ค€๋‹ค.

๋˜ํ•œ ํŒŒ์ผ์‹œ์Šคํ…œ์„ ๊ด€๋ฆฌํ•˜๋Š” ๋…ธ๋“œ๋ฅผ mmchmgr ๋ช…๋ น์–ด๋กœ ์›ํ•˜๋Š” ๋…ธ๋“œ๋กœ ๋ฐ”๊ฟ€ ์ˆ˜ ์žˆ๋‹ค.

mmchmgr

mmchmgr gpfs1 blade3

์œ„ ๋ช…๋ น์–ด๋ฅผ ์‹คํ–‰ํ•˜๋ฉด ์•„๋ž˜์™€ ํ•ด๋‹น ํŒŒ์ผ์‹œ์Šคํ…œ์„ ๊ด€๋ฆฌํ•˜๋Š” ๋…ธ๋“œ๊ฐ€ blade1์—์„œ blade2๋กœ ๋ฐ”๋€ ๊ฒƒ์„ ์•Œ

์ˆ˜ ์žˆ๋‹ค.

GPFS Daemon ์‹œ์ž‘ ๋ฐ ์ค‘์ง€

GPFS๋Š” mmstartup ๋ช…๋ น์–ด๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์‹œ์ž‘ํ•  ์ˆ˜ ์žˆ๋‹ค.

mmstartup โ€“a

์œ„ ๋ช…๋ น์–ด๋ฅผ ์‹คํ–‰ํ•˜๋ฉด cluster์— ์žˆ๋Š” ๋ชจ๋“  ๋…ธ๋“œ์—์„œ GPFS ๋ฐ๋ชฌ์ด ์‹œ์ž‘๋œ๋‹ค.

mmstartup โ€“N NodeName

์œ„ ๋ช…๋ น์ด ์‹คํ–‰๋˜๋ฉด ๋ช…๋ น์„ ์‹คํ–‰ํ•œ ๋…ธ๋“œ์˜ GPFS ๋ฐ๋ชฌ๋งŒ ์˜ฌ๋ผ์˜จ๋‹ค.

GPFS๋Š” mmshutdown ๋ช…๋ น์–ด๋กœ ์ค‘์ง€์‹œํ‚ฌ ์ˆ˜ ์žˆ๋‹ค.

mmshutdown โ€“a

์œ„ ๋ช…๋ น์–ด๋ฅผ ์‹คํ–‰ํ•˜๋ฉด ๋ชจ๋“  ๋…ธ๋“œ์—์„œ GPFS ๋ฐ๋ชฌ์ด ์ค‘์ง€๋œ๋‹ค.

mmshutdown โ€“N NodeName

์œ„ ๋ช…๋ น์–ด๋ฅผ ์‹คํ–‰ํ•˜๋ฉด ํ•ด๋‹น ๋…ธ๋“œ์˜ GPFS ๋ฐ๋ชฌ๋งŒ ์ค‘์ง€๋œ๋‹ค.

mmgetstate ๋ช…๋ น์œผ๋กœ gpfs ์„œ๋น„์Šค๋ฅผ ํ™•์ธ ํ•  ์ˆ˜ ์žˆ๋‹ค.

mmgetstate โ€“a ๋ช…๋ น์œผ๋กœ ์ „์ฒด nodes ์„œ๋น„์Šค ํ™•์ธ

mmgetstate โ€“aL ๋ช…๋ น์œผ๋กœ quorum nodeํ™•์ธ

4.6.3 GPFS Config ๋ณ€๊ฒฝ

GPFS Cluster configuration์„ ๋ณ€๊ฒฝ ํ•  ๋•Œ๋Š” gpfs ๋ฐ๋ชฌ์„ ๋‚ด๋ฆฌ๊ณ  ์ž‘์—…ํ•ด์•ผ ํ•œ๋‹ค.

GPFS Cluster configuration data์˜ ๋ณ€๊ฒฝ

mmchcluster

usage : mmchcluster -p ngstfe1_gpfs ( primary server๋ฅผ ๋ฐ”๊ฟ€๋•Œ ์‚ฌ์šฉ )

mmchcluster -s ngdtfe1_gpfs ( secondary server๋ฅผ ๋ฐ”๊ฟ€๋•Œ ์‚ฌ์šฉ )

mmchcluster -C cluster_name ( cluster name์„ ๋ฐ”๊ฟ€๋Œ€ ์‚ฌ์šฉ )

mmchcluster -p LATEST ( primary server config ์ •๋ณด๋ฅผ ๋™๊ธฐํ™” ํ•  ๋•Œ)

GPFS cluster configuration parameter ์˜ ๋ณ€๊ฒฝ

mmchconfig

usage : mmchconfig attribute=value -i ( ์‹คํ–‰ํ›„ ์ฆ‰์‹œ ๋ฐ˜์˜๋˜๊ณ  restartํ›„์—๋„ ์ง€์†๋จ )

mmchconfig attribute=value -I ( ์‹คํ–‰ํ›„ ์ฆ‰์‹œ cluster์— ๋ฐ˜์˜๋จ restartํ›„ ์ ์šฉ ์•ˆ๋จ )

Page 55: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 55 -

Attribute

autoload - cluster node๊ฐ€ rebooting ์‹œ ์ž๋™์œผ๋กœ GPFS start

maxFilesToCache - ์ตœ๊ทผ์— ์‚ฌ์šฉํ•œ inode ๊ฐœ์ˆ˜ default ( 1000 )

maxMBpS - Data ์ „์†ก์†๋„ default ( 150MB/s )

maxStatCache - stat cache์— ์œ ์ง€ํ•  inode๊ฐœ์ˆ˜

pagepool - cache on each node default ( 64MB ) > 500M

tiebreakerDisks - node quorum๊ณผ ๊ฐ™์ด ์‚ฌ์šฉ

umountOnDiskFail - disk์— ์žฅ์• ๊ฐ€ ๋ฐœ์ƒํ–ˆ์„ ๋•Œ ๊ฐ•์ œ๋กœ file system์„ umount ์‹œํ‚ด ( yes or no )

* GPFS test ์„œ๋ฒ„์˜ Configuration ๊ฐ’ ํ™•์ธ [root@blade1 gpfslpp]# mmlsconfig Configuration data for cluster gpfstest.blade1: ----------------------------------------------- clusterName gpfstest.blade1 clusterId 13882422822655914773 clusterType lc autoload no useDiskLease yes maxFeatureLevelAllowed 906 maxMBpS 512 maxblocksize 2048k pagepool 256M [blade1] takeOverSdrServ yes File systems in cluster gpfstest.blade1: ---------------------------------------- /dev/gpfs1

4.7 GPFS(GENERAL PARALLEL FILE SYSTEM) ๊ตฌ์„ฑ๋ณ€๊ฒฝ ๋ฐ ์žฅ์• ๋ณต๊ตฌ

4.7.1 GPFS cluster IP ๋ฐ host name ๋ณ€๊ฒฝ

GPFS IP address์™€ hostnames์„ ๋ณ€๊ฒฝํ•˜๋ ค๋ฉด ์•„๋ž˜์˜ ๋‹จ๊ณ„๋ฅผ ๋”ฐ๋ผ์•ผ ํ•œ๋‹ค.

์ „์ฒด nodes IP addresses ์™€ hostnames ๋ณ€๊ฒฝ

IP addresses ์™€ hostnames ๋ณ€๊ฒฝ ์ „์— 1.primary GPFS cluster ์„œ๋ฒ„์—์„œ /var/mmfs/gen/mmsdrfs file๋ฐฑ์—… 2.configuration parameter์„ค์ •์„ ๊ธฐ๋กํ•œ๋‹ค. 3.mmshutdown โ€“a ๋ช…๋ น์œผ๋กœ ์ „ nodes gpfs ๋ฐ๋ชฌ ์ค‘์ง€ ex) [root@blade1 gpfslpp]# mmexportfs all โ€“o gpfs_backup Usage: mmexportfs {Device | all} -o ExportfsFile 4.mmexportfs all ๋ช…๋ น์œผ๋กœ file system ์ •๋ณด๋ฅผ outfile๋กœ ๋งŒ๋“ค์–ด ๋‚ด๋ณด๋‚ธ๋‹ค. ! mmexportfs ๋ช…๋ น์„ ์‹คํ–‰ํ•˜๋ฉด ๊ธฐ์กด cluster์—์„œ filesystem ๋ฐ disk์ •๋ณด๊ฐ€ ์‚ฌ๋ผ์ง 5.IP addresses ์™€ hostnames ๋ณ€๊ฒฝ 6.mmcrcluster ๋ช…๋ น์œผ๋กœ cluster ์žฌ์ƒ์„ฑ 7.mmchconfig ๋ช…๋ น์œผ๋กœ configuration parameter ๋ณต๊ตฌ 8.mmimportfs all ๋ช…๋ น์œผ๋กœ file system ๋ฐ disk ์ •๋ณด๋ฅผ ๋ณต๊ตฌ ex) [root@gpfs1 gpfslpp]# mmimportfs all โ€“i gpfs_backup Usage: mmimportfs {Device | all} -i ImportfsFile [-S ChangeSpecFile]

Client nodes IP addresses ์™€ hostnames ๋ณ€๊ฒฝ

IP addresses ์™€ hostnames ๋ณ€๊ฒฝ ์ „์—

Page 56: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 56 -

1.primary GPFS cluster ์„œ๋ฒ„์—์„œ /var/mmfs/gen/mmsdrfs file๋ฐฑ์—… 2.mmshutdown โ€“a ๋ช…๋ น์œผ๋กœ ์ „ nodes gpfs ๋ฐ๋ชฌ ์ค‘์ง€ 3.mmdelnode ๋ช…๋ น์œผ๋กœ ๋ณ€๊ฒฝ๋  node ์ œ๊ฑฐ 4.mmaddnode ๋ช…๋ น์œผ๋กœ ๊ธฐ์กด cluster์— ํฌํ•จ ์‹œํ‚จ๋‹ค primary or secondary configuration์„œ๋ฒ„ IP addresses ์™€ hostnames ๋ณ€๊ฒฝ 5.mmchcluster ๋ช…๋ น์œผ๋กœ primary or secondary ์„œ๋ฒ„ role ๋ณ€๊ฒฝ mmchcluster โ€“p PrimaryNode primary๋กœ ๋ณ€๊ฒฝ ๋  node๋ช… mmchcluster โ€“s SecondaryNode secondary๋กœ ๋ณ€๊ฒฝ ๋  node๋ช… ip addresses ๋ฐ hostnames ๋ณ€๊ฒฝ ํ›„ configuration ์„œ๋ฒ„ role ๋ณ€๊ฒฝ

primary or backup NSD ์„œ๋ฒ„ IP addresses ์™€ hostnames ๋ณ€๊ฒฝ

1.mmchnsd ๋ช…๋ น์œผ๋กœ primary or backup NSD ์„œ๋ฒ„ ์—†์ด directly-attached NSD๋กœ ๋ณ€๊ฒฝ [root@blade1 /]# mmchnsd โ€œgpfs1nsd;gpfs2nsd;gpfs3nsd;gpfs4nsdโ€ 2.ip addresses ๋ฐ hostnames ๋ณ€๊ฒฝ ํ›„ mmchnsd๋ณ€๊ฒฝ์œผ๋กœ ๋ณต๊ตฌ [root@gpfs1 /]# mmchnsd "gpfs1nsd:blade1:blade2:::" 3.mmlsnsd ๋ช…๋ น์œผ๋กœ primary or backup NSD ์„œ๋ฒ„ ํ™•์ธ [root@gpfs1 /]# mmlsnsd -d "gpfs1nsd" File system Disk name Primary node Backup node ---------------------------------------------------------------------------

gpfs1 gpfs1nsd blade1 blade2

4.7.2 GPFS cluster configuration data ํŒŒ์ผ ๋ณต๊ตฌ

GPFS cluster configuration data ํŒŒ์ผ์€ /var/mmfs/gen/mmsdrfs ํŒŒ์ผ์— ์ €์žฅ๋œ๋‹ค. ์‹œ์Šคํ…œ ์žฅ์• ๋กœ ์ธ

ํ•œ ์„œ๋ฒ„ ์žฌ ์„ค์น˜๋‚˜, ์‚ฌ์šฉ์ž ์‹ค์ˆ˜๋กœ ์ธํ•˜์—ฌ mmsdrfsํŒŒ์ผ์ด ์‚ญ์ œ๋˜๋Š” ๊ฒฝ์šฐ ๋‹ค๋ฅธ ์„œ๋ฒ„์— ์žˆ๋Š” mmsdrfsํŒŒ

์ผ๋กœ ๊ตฌ์„ฑ์ •๋ณด๋ฅผ ๋ณต๊ตฌ ํ•  ์ˆ˜ ์žˆ๋‹ค. ํ•˜์ง€๋งŒ ์ตœ์‹  level๋กœ ํŒŒ์ผ์„ ๋™๊ธฐํ™” ํ•˜๊ธฐ ์œ„ํ•ด์„œ primary or

secondary ํŒŒ์ผ๋กœ ๋ณต๊ตฌ ํ•  ๊ฒƒ์„ ๊ถŒ์žฅ ํ•œ๋‹ค. GPFS cluster ๊ตฌ์„ฑ ํ›„ ๋ชจ๋“  nodes mmsdrfs ํŒŒ์ผ์€ ๋ฐฑ์—…

๋ฐ›๋Š” ๊ฒƒ์„ ๊ถŒ์žฅ ํ•œ๋‹ค.

1.mmshutdown โ€“a ๋ช…๋ น์œผ๋กœ ๋ชจ๋“  nodes์—์„œ gpfs ๋ฐ๋ชฌ ์ค‘์ง€ 2.mmsdrfs ํŒŒ์ผ์ด ์‚ญ์ œ๋œ node์—์„œ mmsdrrestore ๋ช…๋ น ์‹คํ–‰ [root@blade2 /]# mmsdrrestore โ€“p blade1 โ€“F /var/mmfs/gen/mmsdrfs โ€“R /usr/bin/scp Usage: mmsdrrestore โ€“p remoteNode โ€“F remoteFile โ€“R RemoteFileCopyCommand 3.mmchcluster โ€“p LATEST๋ช…๋ น์œผ๋กœ ์ตœ์‹  level๋กœ ๋™๊ธฐํ™” [root@blade2 /]# mmchcluster โ€“p LATEST 4.mmstartup โ€“a ๋ช…๋ น์œผ๋กœ ์žฅ์•  ์„œ๋ฒ„๊ฐ€ ์ •์ƒ์ ์œผ๋กœ ๋™์ž‘ํ•˜๋Š”์ง€ ํ™•์ธ

Page 57: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 57 -

5. XEN

5.1 Xen

์„œ๋ฒ„์˜ ์ž์›์„ ๊ฐ€์ƒํ™” ํ•˜์—ฌ, ํ•˜๋‚˜์˜ ๋ฌผ๋ฆฌ์ ์ธ ์„œ๋ฒ„์— ์—ฌ๋Ÿฌ ๊ฐ€์ง€ OS๋ฅผ ์„ค์น˜ํ•  ์ˆ˜ ์žˆ๋Š” ๊ฐ€์ƒ๋จธ์‹ ์˜ ํ™˜

๊ฒฝ์„ ๋งŒ๋“ค์–ด ์ฃผ๋Š” ์†”๋ฃจ์…˜์œผ๋กœ, Intel Server ์— ์‚ฌ์šฉํ•˜๋Š” Vmware ์™€, IBM System p ์— ์‚ฌ์šฉํ•˜๋Š”

LPAR ์™€ ์œ ์‚ฌํ•˜๋‹ค.

5.2 Xen ์—์„œ ์ง€์›ํ•˜๋Š” ๊ฐ€์ƒํ™” ํ˜•ํƒœ

A) Para Virtualization

System์˜ ๋ชจ๋“  ์ž์›์„ ๊ฐ€์ƒํ™” ํ•˜๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ, ์„ค์น˜๋˜๋Š” ๊ฐ€์ƒ๋จธ์‹ ์ด Xen Kernel ๊ณผ

Communicationํ•˜๊ธฐ ์œ„ํ•œ API๊ฐ€ ์žˆ์–ด์•ผ ํ•˜๋Š”๋ฐ ์ด ๊ฒฝ์šฐ์—๋Š” Virtual Machine ์— ์„ค์น˜๋˜๋Š” OS ์˜

Kernel ์ด ์ˆ˜์ •๋˜์–ด์•ผ ํ•œ๋‹ค. ๊ทธ

๋Ÿฌ๋‚˜ Full Virtualization ์— ๋น„ํ•ด

์„ฑ๋Šฅ์„ ์ข‹์€ ํŽธ์ด๋ฉฐ, Memory

Ballooning ์„ ์ง€์›ํ•˜์—ฌ ์‹œ์Šคํ…œ

์˜ Memory๋ฅผ ์ข€๋” ํšจ์œจ์ ์œผ๋กœ

๊ด€๋ฆฌ ๊ฐ€๋Šฅํ•˜๋‹ค.

B) Full Virtualization

Intel ๋ฐ AMD ์—์„œ ์ œ๊ณตํ•˜๋Š”

Virtualization ๊ธฐ๋Šฅ์ด ์žˆ๊ณ ,

BIOS ์—์„œ ๋ณธ ๊ธฐ๋Šฅ์„ Enableํ•ด

์•ผ๋งŒ ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•˜๋‹ค.

5.3 Xen ์˜ ๊ธฐ๋ณธ ๊ตฌ์กฐ ๋ฐ Memory Ballooning ์ด๋ž€?

์œ„์˜ ๋‚ด์šฉ๊ณผ ๊ฐ™์ด Xen ์˜ ๊ธฐ๋ณธ ๊ตฌ์กฐ๋ฅผ ์‚ดํŽด ๋ณด๋ฉด, HW ์œ„์— Hypervisor ๊ฐ€ ์˜ฌ๋ผ๊ฐ€ ์žˆ์œผ๋ฉฐ, ์‹ค์ œ

Virtual Domain ์„ ๊ด€๋ฆฌ ํ•˜๊ธฐ ์œ„ํ•œ Domain0 ๊ฐ€ ์žˆ๊ณ , Domain 1,2,โ€ฆ ์ด๋Ÿฐ ๊ฐ€์ƒ ๋จธ์‹ ์€ Hypervisor ๋ฅผ

ํ†ตํ•˜์—ฌ ์‹ค์ œ ํ•˜๋“œ์›จ์–ด๋ฅผ Access ํ•˜๊ฒŒ ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค. ๋˜ํ•œ Full Virtualization ์—์„œ๋Š” ๊ฐ€์ƒ ๋จธ์‹ ์ด

Operation ์ค‘์— ์‹œ์Šคํ…œ Memory Size๋ฅผ ํ™•์žฅ ํ•  ์ˆ˜ ์—†์ง€๋งŒ, Para-Virtualization ํ™˜๊ฒฝ์—์„œ๋Š” ์œ„์˜

Memory Ballooning ์˜ ๊ทธ๋ฆผ์ฒ˜๋Ÿผ, ์‹œ์Šคํ…œ OS ๋ฅผ Operation ์ค‘์— ์‹œ์Šคํ…œ ๋ฉ”๋ชจ๋ฆฌ์˜ ์˜์—ญ์˜ ํ™•์žฅ์ด ๊ฐ€๋Šฅ

ํ•˜๋‹ค.

Page 58: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 58 -

5.4 Xen ๊ตฌ์„ฑ ์‹œ ํ™•์ธ ์‚ฌํ•ญ

Para-virtualized guest installํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š”

CPU๊ฐ€ PAE๋ฅผ ์ง€์›ํ•ด์•ผ ํ•จ. x86_64, ia64๋Š”

PAE๋ฅผ ์ง€์›ํ•˜์ง€๋งŒ i386์˜ ๊ฒฝ์šฐ๋Š” ์•„๋ž˜์™€ ๊ฐ™์ด

PAE๋ฅผ ์ง€์›ํ•˜๋Š”์ง€ ํ™•์ธํ•ด์•ผ ํ•œ๋‹ค.

# grep pae /proc/cpuinfo flags : fpu tsc msr pae mce cx8 apic mtrr mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm constant_tsc pni monitor ds_cpl vmx est tm2 cx16 xtpr lahf_lm flags : fpu tsc msr pae mce cx8 apic mtrr mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm constant_tsc pni monitor ds_cpl vmx est tm2 cx16 xtpr lahf_lm flags : fpu tsc msr pae mce cx8 apic mtrr mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm constant_tsc pni monitor ds_cpl vmx est tm2 cx16 xtpr lahf_lm flags : fpu tsc msr pae mce cx8 apic mtrr mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm constant_tsc pni monitor ds_cpl vmx est tm2 cx16 xtpr lahf_lm

Full-virtualized guest installํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” CPU๊ฐ€ Intel-VT ๋˜๋Š” AMD-V๋ฅผ ์ง€์›ํ•ด์•ผ ํ•œ๋‹ค.

ํ™•์ธํ•˜๋Š” ๋ฐฉ๋ฒ•์€ ์•„๋ž˜์™€ ๊ฐ™๋‹ค.

# egrep -e 'vmx|svm' /proc/cpuinfo flags : fpu tsc msr pae mce cx8 apic mtrr mca cmov pat clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe constant_tsc pni monitor vmx est tm2 xtpr

5.5 virt-install ๋ช…๋ น์œผ๋กœ Xen para-virtualized guest install

1. virt-install ๋ช…๋ น ์‹คํ–‰ํ›„์— full virtualized guest๋กœ ์ธ์Šคํ†จํ•  ๊ฒƒ์ธ๊ฐ€๋ฅผ ๋ฌผ์œผ๋ฉด no๋ฅผ ์ž…๋ ฅ. ๋งŒ์•ฝ

Full-virtualized guest๋กœ install ํ•˜๋ ค๋ฉด yes๋ฅผ ์ž…๋ ฅ

2. virtual machine name๋ฅผ ์ž…๋ ฅ(์ด๋ฆ„์€ ์ž„์˜๋กœ) : rhel5b_pv1

3. virtual machine์ด ์‚ฌ์šฉํ•  ๋ฉ”๋ชจ๋ฆฌ์–‘์„ ์ž…๋ ฅ(megabyte) : 500

4. guest๊ฐ€ ํŒŒ์ผ์‹œ์Šคํ…œ์œผ๋กœ ์‚ฌ์šฉํ•  guest image ์ด๋ฆ„์„ ์ž…๋ ฅ(์ ˆ๋Œ€๊ฒฝ๋กœ ํฌํ•จ) :

/xen/rhel5b_pv1.img

5. guest๊ฐ€ ์‚ฌ์šฉํ•  ๋””์Šคํฌ ์šฉ๋Ÿ‰์„ ์ž…๋ ฅ(gigabyte): 6

6. guest OS install source๋ฅผ ์ž…๋ ฅ : nfs:192.168.12.57:/install ๋˜๋Š” http://192.168.12.57/install

7. Para-virtualization์€ network intall๋งŒ ์ง€์›ํ•จ. ๋”ฐ๋ผ์„œ NFS๋‚˜ http๋กœ OS source๋ฅผ ๋ฏธ๋ฆฌ

exportํ•ด ๋†“์•„์•ผ ํ•จ.

8. installation์ด ์ง„ํ–‰๋จ.

Page 59: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 59 -

xwindow์ƒ์˜ virt manager๋ฅผ ํ†ตํ•ด install์„ ๊ณ„์† ์ง„ํ–‰ํ•  ์ˆ˜ ์žˆ์Œ.(graphics support๋ฅผ yes๋กœ ํ• 

๊ฒฝ์šฐ, no๋กœ ํ–ˆ์„ ์‹œ๋Š” test mod๋กœ install์ด ์ง„ํ–‰๋จ.)

Virtual machine ์‹คํ–‰ guest domain running ํ™•์ธ

Page 60: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 60 -

Guest domain์„ ์„ ํƒํ•˜์—ฌ open์„ ํ•˜๋ฉด ํ•ด๋‹น guest domain์˜ ์ฝ˜์†”์ฐฝ์ด ๋œธ.

๊ณ„์† ์„ค์น˜ ์ง„ํ–‰

Page 61: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 61 -

5.6 virt-manager๋ช…๋ น์œผ๋กœ Xen para-virtualized guest install

A) xwindows ์ฝ˜์†”์ฐฝ์—์„œ virt-manager ๋ช…๋ น์„ ์ž…๋ ฅํ•˜๊ฑฐ๋‚˜ ์•„๋ž˜์™€ ๊ฐ™์ด X์œˆ๋„์šฐ application ๋ฉ”๋‰ด์—์„œ

์„ ํƒ

B) Virtual Machine Manager ๊ธฐ๋ณธ์ฐฝ์ด ํŒ์—…๋œ๋‹ค.

Page 62: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 62 -

C) ์ƒ๋‹จ ๋ฉ”๋‰ด์—์„œ ์•„๋ž˜์™€ ๊ฐ™์ด new machine์„ ์„ ํƒ

D) ์•„๋ž˜์™€ ๊ฐ™์€ ์ฐฝ์ด ํŒ์—…๋˜๋ฉด Forward ์„ ํƒ

Page 63: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 63 -

E) virtual system ์ด๋ฆ„์„ ์ •ํ•˜์—ฌ ์ž…๋ ฅ

F) paravirtualized๋ฅผ ํ•  ์ง€ Fully virtualized๋ฅผ ํ• ์ง€ ์„ ํƒ ํ›„ Forward ์„ ํƒ

Page 64: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 64 -

G) paravirtualized๋ฅผ ์„ ํƒํ–ˆ์„ ๊ฒฝ์šฐ๋Š” ์•„๋ž˜์™€ ๊ฐ™์ด network install source ์ฃผ์†Œ๋ฅผ ์ž…๋ ฅํ•˜๋Š” ๋ฉ”๋‰ด๋กœ ๋„˜

์–ด๊ฐ„๋‹ค. ๋ฏธ๋ฆฌ ๊ตฌ์„ฑํ•œ NFS, HTTP ์„œ๋ฒ„ ์ฃผ์†Œ๋ฅผ ์ž…๋ ฅํ•˜๊ณ  kickstart ํŒŒ์ผ์ด ์ค€๋น„๋˜์–ด ์žˆ์œผ๋ฉด ๊ทธ ์ฃผ์†Œ๋ฅผ

์ž…๋ ฅํ•œ๋‹ค.

H) virtual system์„ ์„ค์น˜ํ•  ์Šคํ† ๋ฆฌ์ง€ ์˜์—ญ์„ ์„ค์ •ํ•œ๋‹ค. ๋ฌผ๋ฆฌ์  ํŒŒํ‹ฐ์…˜์ด ์ค€๋น„๋˜์–ด ์žˆ์œผ๋ฉด ์ฒซ๋ฒˆ์งธ

Normal Disk Partition์— ํ•ด๋‹น ํŒŒํ‹ฐ์…˜์˜ ๋””๋ฐ”์ด์Šค๋ช…์„ ์ž…๋ ฅํ•˜๊ณ  ์ด๋ฏธ์ง€ํŒŒ์ผ์— ์„ค์น˜๋ฅผ ํ•˜๋ ค๋ฉด Simple

file์„ ์ ˆ๋Œ€๊ฒฝ๋กœ/name.img ํ˜•์‹์œผ๋กœ ์ž…๋ ฅ์„ ํ•ด์ค€๋‹ค.

Page 65: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 65 -

I) ๋‹ค์Œ ํ™”๋ฉด์—์„œ vitual system์— ํ• ๋‹นํ•  ๋ฉ”๋ชจ๋ฆฌ์™€ CPU ๊ฐœ์ˆ˜์™€ ์ตœ๋Œ€ ํ—ˆ์šฉ๋Ÿ‰์„ ์„ ํƒ

J) ๋‹ค์Œ ํ™”๋ฉด์—์„œ ์ตœ์ข… ์„ค์ • ์ •๋ณด๋“ค์„ ํ™•์ธํ•œ ํ›„์— Finish๋ฅผ ์„ ํƒ

Page 66: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 66 -

K) ์„ค์ •์ž‘์—…์„ ๋๋‚ด๋ฉด virtual machine ์ƒ์„ฑ์ž‘์—…์ด ์‹คํ–‰๋œ๋‹ค

L) OS ์„ค์น˜ํ™”๋ฉด์ด ํŒ์—…๋˜๋ฉด ์ผ๋ฐ˜ OS์™€ ๋™์ผํ•˜๊ฒŒ install์„ ์ง„ํ–‰ํ•œ๋‹ค

Page 67: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 67 -

5.7 Virtual system Management

A) Virtual Machine Manager ๊ธฐ๋ณธ ํ™”๋ฉด

B) Virtual Machine Manager ๊ธฐ๋ณธ ํ™”๋ฉด์—์„œ ํ•ด๋‹น virtual system์„ ์„ ํƒํ•œ ํ›„์— ์•„๋ž˜ Details๋ฅผ ํด๋ฆญํ•˜

๋ฉด ์•„๋ž˜์™€ ๊ฐ™์€ ๊ด€๋ฆฌ์ฐฝ์ด ํŒ์—…๋œ๋‹ค. ์ƒ๋‹จ ํƒญ์—์„œ๋Š” virtual system์„ shutdown/Pause ํ•  ์ˆ˜ ์žˆ๋Š” ํƒญ์ด

์žˆ๋‹ค.

Page 68: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 68 -

C) Hardware ํƒญ์„ ์„ ํƒํ•˜๋ฉด virtual system์— ํ• ๋‹น๋œ CPU, memory, disk, network ์ •๋ณด ํ™•์ธ ๋ฐ ์„ค์ •

๋ณ€๊ฒฝ์„ ํ•  ์ˆ˜ ์žˆ๋‹ค.

D) ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ ํƒญ

Page 69: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 69 -

E) ๋””์Šคํฌ ๊ด€๋ฆฌ ํƒญ

F) Network ๊ด€๋ฆฌํƒญ

Page 70: Linux Advanced Managementblog.syszone.co.kr/attachment/2867923805.pdf6. Hardware Control Remote POST/BIOS console through the IBM Management Processor Network and with terminal servers

IBM System x Enterprise Architecture Linux Advanced Management

- 70 -

5.8 Command ๊ธฐ๋ฐ˜ Management

A. xm create [virtual system name] : ์ด๋ฏธ ์ƒ์„ฑ์ด ๋˜์–ด ์žˆ๊ฑฐ๋‚˜ ์ƒ์„ฑํ•  virtual system์„ ์‹œ์ž‘ํ•œ๋‹ค.

B. xm console [virtual system name] : virtual system์˜ ์ฝ˜์†”์„ ๋„์šด๋‹ค.(para-virtualized์˜ ๊ฒฝ์šฐ ์œ ์šฉ

ํ•จ)

C. xm destory [virtual system name] : virtual system์„ ์ œ๊ฑฐํ•œ๋‹ค.

D. xm shutdown [virtual system name] : virtual system์„ shutdown(full-virtualized์˜ ๊ฒฝ์šฐ๋Š” hard

poweroff๋ฅผ ํ•œ๋‹ค)

E. xm top [virtual system name] : top๊ณผ ๋น„์Šทํ•œ ์‹œ์Šคํ…œ ๋ชจ๋‹ˆํ„ฐ๋ง์„ ํ•  ์ˆ˜ ์žˆ๋‹ค.

F. xm mem-set [virtual system name] [value] : para-virtualized guest์˜ ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์ด์ฆˆ๋ฅผ ์กฐ์ ˆํ•œ๋‹ค.

G. xm vcpu-set [virtual system name] [value] : : para-virtualized guest์˜ CPU ๊ฐœ์ˆ˜๋ฅผ ์กฐ์ ˆํ•œ๋‹ค.