intel cluster checker module reference · intel r cluster checker module reference contents 1...

151
Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk 14 7 base_libs 15 8 bash 16 9 binutils_2_15_92 17 10 clean_ipc 18 11 clock_granularity 19 12 clock_sync 20 13 clomp 21 14 cluster_size 22 15 copy_exactly 23 16 core_count 25 17 core_frequency 26 18 cpuinfo 27 19 cron 28 20 csh 29 21 dat_conf 30 1

Upload: others

Post on 04-Apr-2020

64 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster CheckerModule Reference

Contents

1 1GiB_memory 7

2 65GiB_storage_head 8

3 X11_clients 9

4 X11_libs 11

5 arch 13

6 available_disk 14

7 base_libs 15

8 bash 16

9 binutils_2_15_92 17

10 clean_ipc 18

11 clock_granularity 19

12 clock_sync 20

13 clomp 21

14 cluster_size 22

15 copy_exactly 23

16 core_count 25

17 core_frequency 26

18 cpuinfo 27

19 cron 28

20 csh 29

21 dat_conf 30

1

Page 2: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 CONTENTS

22 dmidecode 31

23 e1000 32

24 e1000e 33

25 environment 34

26 etc_hosts 35

27 file_permissions 36

28 file_tree 38

29 gcc 39

30 gcc_3_4_6 40

31 gdb_6_3 41

32 generic_correctness 42

33 generic_uniformity 44

34 genuine_intel 45

35 gige 46

36 glibc_2_3_4 47

37 gmake_3_80 49

38 hdparm 50

39 home 52

40 host_conf 53

41 hostname 54

42 hpcc 55

43 ib_firmware 58

44 ibadm 59

45 icr_version_1_0 60

46 icr_version_1_1 61

47 igb 62

48 imb_collective_intel_mpi 63

49 imb_message_integrity_intel_mpi 65

2

Page 3: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 CONTENTS

50 imb_pingpong_intel_mpi 67

51 intel_cc 70

52 intel_cc_rtl_9_1 72

53 intel_cce_rtl 74

54 intel_cce_rtl_9_1 75

55 intel_cmkl_rtl_9_0 77

56 intel_devtools_1_0 79

57 intel_fc 81

58 intel_fc_rtl_9_1 82

59 intel_fce_rtl 84

60 intel_fce_rtl_9_1 85

61 intel_mpi 87

62 intel_mpi_internode 89

63 intel_mpi_rt 91

64 intel_mpi_rt_3_0_033 93

65 intel_mpi_rt_internode 96

66 intel_mpi_testsuite 98

67 intel_tbb_rtl_1_0 100

68 ip_alltoall 101

69 ipoib 102

70 java_1_4_2 103

71 jdk_1_4_2 104

72 kernel 105

73 kernel_2_6_17 106

74 kernel_modules 107

75 kernel_parameters 108

76 ksh 109

77 lib32_counterpart_lib64 110

3

Page 4: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 CONTENTS

78 loopback 111

79 memory_bandwidth_stream 112

80 mflops_intel_mkl 114

81 mount_proc 116

82 mpi_consistency 117

83 network_consistency 118

84 nfs_mounts 119

85 nisdomain 120

86 nismaps 121

87 nsswitch 122

88 openib 123

89 openssh_3_9 125

90 packages 126

91 pci 127

92 perl 128

93 perl_5_6_1 129

94 ping 130

95 ping_alltoall 131

96 portal 132

97 process_check 133

98 python 135

99 python_2_3_4 136

100rpm 137

101sh 138

102shm_mount 139

103single_authentication 140

104speedstep 141

105ssh 142

4

Page 5: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 CONTENTS

106ssh_alltoall 143

107ssh_version 144

108stray_uids 145

109subnet_manager 146

110system_memory 147

111tcl_8_4_7 148

112tcsh 149

113tmp 150

114uid_sync 151

5

Page 6: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 CONTENTS

Disclaimer and Legal Information

INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL(R) PRODUCTS. NO LI-CENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTYRIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL’S TERMS AND CON-DITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND IN-TEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTELPRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PUR-POSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLEC-TUAL PROPERTY RIGHT.Intel products are not intended for use in medical, life saving, life sustaining, critical control or safety systems, or innuclear facility applications.Intel may make changes to specifications and product descriptions at any time, without notice. Designers must notrely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reservesthese for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising fromfuture changes to them. The information here is subject to change without notice. Do not finalize a design with thisinformation.The products described in this document may contain design defects or errors known as errata which may cause theproduct to deviate from published specifications. Current characterized errata are available on request.Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing yourproduct order.Copies of documents which have an order number and are referenced in this document, or other Intel literature, maybe obtained by calling 1-800-548-4725, or by visiting Intel’s Web Site.* Other names and brands may be claimed as the property of others.Copyright (C) 2006-2009, Intel Corporation. All rights reserved.

6

Page 7: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 1GiB_memory

1 1GiB_memory

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the amount of physical memory in each node meets requirements.

METHOD

Get the amount of physical memory from /proc/meminfo.

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

grep

/proc/cpuinfo

/proc/meminfo

7

Page 8: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 65GiB_storage_head

2 65GiB_storage_head

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the amount of storage available to the head node meets requirements.

METHOD

Get the disk size from df(1).

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

df

8

Page 9: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 X11_clients

3 X11_clients

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the X11 clients meet requirements on the head node.

METHOD

Confirm that the following files are available on the head node:

glxinfo

listres

mkhtmlindex

showfont

viewres

x11perf

x11perfcomp

xauth

xclipboard

xev

xfd

xfontsel

xkill

xload

xmessage

xterm

xvinfo

xwininfo

CONFIGURATION

None

MODULE CLASS

unit

9

Page 10: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 X11_clients

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

which

10

Page 11: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 X11_libs

4 X11_libs

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the X11 runtime libraries meet requirements.

METHOD

Confirm that the following libraries (or later versions) are in the dynamic linker cache on all nodes. x86-64 versionsneed to be present.

libFS.so.6

libGLw.so.1

libI810XvMC.so.1

libICE.so.6

libMrm.so.3

libOSMesa.so.4

libUil.so.3

libX11.so.6

libXcomposite.so.1

libXcursor.so.1

libXdamage.so.1

libXevie.so.1

libXext.so.6

libXfixes.so.3

libXfont.so.1

libXft.so.2

libXinerama.so.1

libXi.so.6

libXm.so.3

libXmu.so.6

libXmuu.so.1

libXp.so.6

libXrandr.so.2

libXrender.so.1

11

Page 12: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 X11_libs

libXRes.so.1

libXss.so.1

libXTrap.so.6

libXt.so.6

libXtst.so.6

libXvMC.so.1

libXv.so.1

libXxf86dga.so.1

libXxf86misc.so.1

libXxf86vm.so.1

libfontconfig.so.1

libfreetype.so.6

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

/sbin/ldconfig

12

Page 13: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 arch

5 arch

Check the uniformity of the system architecture on all nodes

DESCRIPTION

arch is an Intel(R) Cluster Checker module to verify the uniformity of the system architecture among nodes. Themodule examines the output of ‘uname -p‘ to compare against the other nodes.

CONFIGURATION

None

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

uname

13

Page 14: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 available_disk

6 available_disk

Check the space available on a local filesystem

DESCRIPTION

available_disk is an Intel(R) Cluster Checker module to verify that a minimum amount of free space is available on alocal filesystem. Only local filesytems can be checked.

CONFIGURATION

filesystem

A container that groups the other options by filesystem

available The minimum amount of free disk space, in KB, that is required. If not specified, the free space is notchecked.

mountpoint The mountpoint of the filesystem.

Example

<available_disk><filesystem>

<available>10485760</available><mountpoint>/</mountpoint>

</filesystem><filesystem>

<available>102400</available><mountpoint>/boot</mountpoint>

</filesystem></available_disk>

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

df

14

Page 15: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 base_libs

7 base_libs

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the base libraries are present and meet requirements.

METHOD

Confirm that the following libraries (or later versions) are in the dynamic linker cache on all nodes. Both 32-bit andx86-64 versions need to be present.

libacl.so.1

libattr.so.1

libbz2.so.1

libcap.so.1

libelf.so.0

libgdbm.so.2

libncurses.so.5

libpam.so.0

libtermcap.so.2

libz.so.1

Libraries checked by theglibc_2_3_4 module are not re-examined in this module.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

glibc_2_3_4

ssh

genuine_intel

EXTERNAL DEPENDENCIES

/sbin/ldconfig

15

Page 16: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 bash

8 bash

Check the Bourne Again Shell

DESCRIPTION

bash is an Intel(R) Cluster Checker module to verify the Bourne Again Shell. The module tests if/bin/bash existsand runs a ’Hello World’ script.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

/bin/bash

test

16

Page 17: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 binutils_2_15_92

9 binutils_2_15_92

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the GNU (*) binutils are present and meet requirements.

METHOD

Compare the versions of the binutils to 2.15.92:

addr2line

ar

as

gprof

ld

nm

objcopy

objdump

ranlib

readelf

size

strings

strip

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

GNU binutils

17

Page 18: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 clean_ipc

10 clean_ipc

Check that no SysV IPC facilities are open

DESCRIPTION

clean_ipc is an Intel(R) Cluster Checker module to verify that the ipc subsystem is clean. The module executesipcs-a to get a list of Shared Memory Segments, Semaphore Arrays, and Message Queues. If there are any entries, it willflag them and fail, giving a count of the entries in each list.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ipcs

18

Page 19: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 clock_granularity

11 clock_granularity

Check the minimum granularity of gettimeofday()

DESCRIPTION

clock_granularity is an Intel(R) Cluster Checker module to verify thegettimeofday() system clock granular-ity. The module executes a C loop and counts the number of times through the loop before the value returned bygettimeofday() changes. As long as the number of times through the loop is greater than 1, this is an accurate, ifsimplistic, way to compute the clock granularity.

CONFIGURATION

granularity

The acceptable clock granularity threshold in microseconds.Default: 2 us

Example

<clock_granularity><granularity>2</granularity>

</clock_granularity>

MODULE CLASS

unit

DEPENDENCIES

gcc

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

None

19

Page 20: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 clock_sync

12 clock_sync

Check the cluster clock synchronization

DESCRIPTION

clock_sync is an Intel(R) Cluster Checker module to verify the system clocks on each node are reasonably synchro-nized. The module calculates the difference between the node time and the cluster median time and compares it to athreshold value.

CONFIGURATION

deviation

The maximum deviation (in seconds) of the clock on any node from the cluster median.Default: 300 seconds

Example

<clock_sync><deviation>300</deviation>

</clock_sync>

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

date

NOTES

Since the node clocks are not sampled at exactly the same instance, specifying too small of a threshold is not recom-mended.

20

Page 21: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 clomp

13 clomp

Check the Intel(R) C++ Compiler Cluster OpenMP runtime library

DESCRIPTION

clomp is an Intel(R) Cluster Checker module to verify the Intel(R) C++ Compiler Cluster OpenMP runtime. Themodule runs a Hello World binary on the compute nodes using Cluster OpenMP and checks that the kernel parameterrandomize_va_space is not set.Note: this module does not build the Hello World binary using the Intel(R) C++ Compiler.

CONFIGURATION

cc-path

The base path to the Intel(R) C++ Compiler installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

Example

<clomp><cc-path>/opt/intel/cc/9.1</cc-path>

</clomp>

MODULE CLASS

vector

DEPENDENCIES

arch

intel_cce_rtl

sh

ssh

ssh_alltoall

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) C++ Compiler 9.1 or later runtime

/sbin/sysctl

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH) inherited from the user running Intel(R) ClusterChecker is setup correctly.Assumes that the nodes are x86_64 architecture.

21

Page 22: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 cluster_size

14 cluster_size

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the number of nodes meets requirements.

METHOD

Count the number of nodes being checked.

CONFIGURATION

None

MODULE CLASS

vector

DEPENDENCIES

genuine_intel

EXTERNAL DEPENDENCIES

None

22

Page 23: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 copy_exactly

15 copy_exactly

Check that a node image is an exact copy of a reference image

DESCRIPTION

copy_exactly is an Intel(R) Cluster Checker module to verify that a node is an exact copy of a reference system. Themodule uses the output of thenode_checksum script run on the reference system as the basis of the comparison.

CONFIGURATION

compute_node

The path to the compute node reference file.Default: none

exclude

File to exclude from the test based on a regular expression match. May be specified multiple times.Default: none

head_node

The path to the head node reference file.Default: none

Example

<copy_exactly><compute_node>file1</compute_node><exclude>^/etc/sysconfig/</exclude><head_node>file2</head_node>

</copy_exactly>

MODULE CLASS

unit

DEPENDENCIES

sh

ssh

tmp

genuine_intel

23

Page 24: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 copy_exactly

EXTERNAL DEPENDENCIES

find

md5sum

prelink

xargs

NOTES

This module may take a long time (several minutes to over an hour, depending on the cluster size) to complete if thenumber of files to check is very large.

24

Page 25: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 core_count

16 core_count

Check the number and type of cores and processors

DESCRIPTION

core_count is an Intel(R) Cluster Checker module to verify the number of processors, physical cores, and logical coresper node. If the number of cores or processors is not specified, the uniformity of the count is checked for all nodes.Additionally, the test module verifies the status of Intel(R) Hyper-Threading Technology.

CONFIGURATION

hyper-threading

Flag to designate that Intel(R) Hyper-Threading Technology should be enabled.Default: Hyper-Threading Technology should not be enabled.

logical-cores

The number of logical cores per node. Logical cores include physical and SMT/HT cores.

physical-cores

The number of physical cores per node. Physical cores exclude SMT/HT cores.

processors

The number of processors per node.

Example

<core_count><hyper-threading/><logical-cores>8</logical-cores><physical-cores>4</physical-cores><processors>2</processors>

</core_count>

MODULE CLASS

vector

DEPENDENCIES

gcc

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

None

25

Page 26: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 core_frequency

17 core_frequency

Check the frequency of all processor cores

DESCRIPTION

core_frequency is an Intel(R) Cluster Checker module to verify that the frequency of each processor core in the clusteris within a specified threshold of a specified frequency.

CONFIGURATION

frequency

Expected frequency of a core in MHz.Default: median of the collected frequencies

threshold

Maximum absolute deviation from the expected frequency that is allowable, in MHz.Default: 5

Example

<core_frequency><frequency>3056</frequency><threshold>50</threshold>

</core_frequency>

MODULE CLASS

vector

DEPENDENCIES

ssh

mount_proc

genuine_intel

EXTERNAL DEPENDENCIES

grep

26

Page 27: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 cpuinfo

18 cpuinfo

Check the uniformity of /proc/cpuinfo

DESCRIPTION

cpuinfo is an Intel(R) Cluster Checker module to verify the uniformity of /proc/cpuinfo on all nodes. Besides checkingthat the core count is the same across the cluster, it tries to verify that all fields are uniform unless explicitly excluded.The following fields are excluded by default:BogoMIPS, bogomips , cpu MHz, itc MHz , physical id ,processor , runqueue andapicid . The core frequency is checked in thecore_frequency test module.

CONFIGURATION

exclude

The name of the field to exclude from the check. May be specified multiple times to exclude more than one field.Default: none

Example

<cpuinfo><exclude>stepping</exclude>

</cpuinfo>

MODULE CLASS

vector

DEPENDENCIES

ssh

mount_proc

genuine_intel

EXTERNAL DEPENDENCIES

cat

27

Page 28: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 cron

19 cron

Check that cron is not running

DESCRIPTION

cron is an Intel(R) Cluster Checker module to verify that the process list does not include any crond processes.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ps

grep

28

Page 29: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 csh

20 csh

Check the C Shell

DESCRIPTION

csh is an Intel(R) Cluster Checker module to verify the C Shell. The module tests if/bin/csh exists and runs a’Hello World’ script.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

/bin/csh

test

29

Page 30: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 dat_conf

21 dat_conf

Check that all entries in /etc/dat.conf are valid

DESCRIPTION

dat_conf is an Intel(R) Cluster Checker module to verify that all entries in /etc/dat.conf have corresponding networkdevices.

CONFIGURATION

None

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

cat

ifconfig

sh

30

Page 31: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 dmidecode

22 dmidecode

Check the uniformity of the SMBIOS/DMI information

DESCRIPTION

dmidecode is an Intel(R) Cluster Checker module to verify the uniformity of the SMBIOS/DMI information returnedby the dmidecode utility.

CONFIGURATION

exclude

dmidecode field to exclude from the check based on an exact match. May be specified multiple times to exclude morethan one field.

Example

<dmidecode><exclude>Memory Device (0x002F): Part Number</exclude><exclude>System Slot Information (0x001D): Length</exclude>

</dmidecode>

MODULE CLASS

vector

DEPENDENCIES

gcc

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

dmidecode (http://www.nongnu.org/dmidecode/)

make

NOTES

This check may only be run by a privileged user.

31

Page 32: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 e1000

23 e1000

Check the Intel(R) PRO/1000 Network Driver (e1000)

DESCRIPTION

e1000 is an Intel(R) Cluster Checker module to verify the e1000 kernel module. The module checks that the kernelmodule is loaded, the version and module file are uniform across the cluster, and the Interrupt throttling options areset.

CONFIGURATION

options

The string that is compared to the e1000 driver options.Default: options e1000 InterruptThrottleRate=0,0 TxIntDelay=0,64 RxAbsIntDelay=0,128 TxAbsIntDelay=0,64

version

The string that is compared to the version string in the e1000 kernel module.Default: noneIf versionis not specified in the configuration file, then the e1000 kernel module version is not checked.

Example

<e1000><options>options e1000 InterruptThrottleRate=0,0</options><version>5.2.52-k3</version>

</e1000>

MODULE CLASS

vector

DEPENDENCIES

sh

ssh

genuine_intel

EXTERNAL DEPENDENCIES

grep

lsmod

md5sum

modprobe

strings

32

Page 33: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 e1000e

24 e1000e

Check the Intel(R) PRO/1000 Network Driver (e1000e)

DESCRIPTION

e1000e is an Intel(R) Cluster Checker module to verify the e1000e kernel module. The module checks that the kernelmodule is loaded, the version and module file are uniform across the cluster, and the Interrupt throttling options areset.

CONFIGURATION

options

The string that is compared to the e1000e driver options.Default: options e1000e InterruptThrottleRate=0,0 TxIntDelay=0,64 RxAbsIntDelay=0,128 TxAbsIntDelay=0,64

version

The string that is compared to the version string in the e1000e kernel module.Default: noneIf versionis not specified in the configuration file, then the e1000e kernel module version is not checked.

Example

<e1000e><options>options e1000e InterruptThrottleRate=0,0</options><version>0.2.9</version>

</e1000e>

MODULE CLASS

vector

DEPENDENCIES

sh

ssh

genuine_intel

EXTERNAL DEPENDENCIES

grep

lsmod

md5sum

modprobe

strings

33

Page 34: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 environment

25 environment

Check the uniformity of environment variables

DESCRIPTION

environment is an Intel(R) Cluster Checker module to verify environment variables. It verifies the uniformity of theenvironment variables on the compute nodes. If the head node is also a compute node, the test also verifies thatthe environment variables on the compute nodes are the same on the head node (although the head node may haveadditional environment variables that are not set on the compute nodes.)

CONFIGURATION

exclude

The name of an environment variable to exclude from the check (case sensitive.) The variables HOST, HOSTNAME,SSH_CLIENT, SSH_CONNECTION, and SSH2_CLIENT are automatically excluded. This option may be specifiedmore than once to exclude multiple environment variables.Default: none

Example

<environment><exclude>LANG</exclude><exclude>PAGER</exclude>

</environment>

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

printenv

34

Page 35: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 etc_hosts

26 etc_hosts

Check that hostnames are associated to only one IP address.

DESCRIPTION

etc_hosts.pm is an Intel(R) Cluster Checker module to verify that each hostname is associated to only one IP addressin the /etc/hosts file.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

cat

35

Page 36: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 file_permissions

27 file_permissions

Check file existence, ownership, and permissions

DESCRIPTION

file_permissions is an Intel(R) Cluster Checker module to verify the existence, ownership, and permissions on filesand directories.

CONFIGURATION

object

A container that groups the other options by file / directory path

path The path to the file or directory to be tested. A path is required in each object container. The existence of thepath is always checked.

group The name of the group who should own the file / directory specified in the path. If not specified, the groupownership is not checked.

permissions The expected permissions, in octal, of the file / directory specified in the path. If not specified, thepermissions are not checked.

user The name of the user who should own the file / directory specified in the path. If not specified, the userownership is not checked.

Example

<file_permissions><object>

<group>root</group><path>/tmp</path><permissions>1777</permissions><user>root</user>

</object><object>

<path>/shared</path><permissions>0755</permissions>

</object><object>

<path>/home</path></object>

</file_permissions>

MODULE CLASS

unit

36

Page 37: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 file_permissions

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

stat

37

Page 38: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 file_tree

28 file_tree

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the file trees meet requirements.

METHOD

Compute the md5sum of files in the following directories on the head node: /bin, /boot, /dev, /etc, /lib, /lib64, /media,/mnt, /opt, /sbin, /usr, and /var/lib/alternatives.The compute nodes must be identical to each other. If the head node is also a compute node, the head node must be asuperset of the compute nodes.Some files / paths are explicitly excluded from the test. For a list of excluded files please see the Intel(R) ClusterReady Specification.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

find

md5sum

prelink

xargs

NOTES

For a typical OS installation, this module verifies the checksum of hundreds of thousands of files on each node. As aconsequence, this module may take a long time (several minutes to over an hour) to complete.

38

Page 39: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 gcc

29 gcc

Check the functionality and uniformity of the GNU (*) C/C++ compilers

DESCRIPTION

gcc is an Intel(R) Cluster Checker module to verify the GNU C/C++ compilers. The module examines the compilerversion and builds / executes C and C++ ’Hello World’ programs.

CONFIGURATION

gcc-path

The base path to the GNU C/C++ compilers.Default: /usr/bin

version

The string that is compared to the GNU C/C++ compiler version.Default: noneIf versionis not specified in the Intel(R) Cluster Checker configuration file, then the specific compiler version will notbe checked.The uniformity of the version string is compared across the cluster regardless.

Example

<gcc><gcc-path>/usr/bin</gcc-path><version>3.2.3</version>

</gcc>

MODULE CLASS

vector

DEPENDENCIES

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

gcc

g++

test

NOTES

Compiler warnings result in a failure. For some warnings, this is the correct behavior, but for ’harmless’ warnings,this produces false positives.

39

Page 40: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 gcc_3_4_6

30 gcc_3_4_6

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the GNU (*) compilers are present and meet requirements.

METHOD

Compare the gcc and g++ versions to 3.4.6

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

gcc

g++

40

Page 41: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 gdb_6_3

31 gdb_6_3

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the GNU (*) debugger is present and meets requirements.

METHOD

Compare the gdb version to 6.3.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

gdb

41

Page 42: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 generic_correctness

32 generic_correctness

User-defined correctness check

DESCRIPTION

generic_correctness is an Intel(R) Cluster Checker module that executes a specified command and compares the outputto specified output.The intended purpose of this module is to implement site specific and/or temporary checks; cross-site and/or permanent checks should be implemented in dedicated modules.Note: by default, this module willnot be run as root due to the potential security issues and/or inadvertent configurationchanges that are inherent in running an arbitrary command on every node of a cluster as the root user. Set the overrideconfiguration option to run this module as root.

CONFIGURATION

item

The container for a command / result set.

command The command to be executed.

result The exact output of the command that should be considered correct. Case and white space sensitive.

override

Override the check that will not allow this module to run as root.Default: false

Example

<generic_correctness><item>

<command>uname -r</command><result>2.4.21-20.EL</result>

</item><item>

<command>/sbin/lsmod | grep e1000</command><result>e1000 171104 1</result>

</item></generic_correctness>

MODULE CLASS

unit

DEPENDENCIES

sh

ssh

genuine_intel

42

Page 43: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 generic_correctness

EXTERNAL DEPENDENCIES

None

43

Page 44: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 generic_uniformity

33 generic_uniformity

User-defined node uniformity check

DESCRIPTION

generic_uniformity.pm is an Intel(R) Cluster Checker module that executes a specified command and compares theoutput between nodes. The intended purpose of this module is to implement site specific and/or temporary checks;cross-site and/or permanent checks should be implemented in dedicated modules.Note: by default, this module will not be run as root due to the potential security issues and/or inadvertent configurationchanges that are inherent in running an arbitrary command on every node of a cluster as the root user. Set the overrideconfiguration option to run this module as root.

CONFIGURATION

command

The command to be executed.

override

Override the check that will not allow this module to run as root.Default: false

Example

<generic_uniformity><command>uname -r</command><command>/sbin/lsmod | grep e1000</command>

</generic_uniformity>

MODULE CLASS

vector

DEPENDENCIES

sh

ssh

genuine_intel

EXTERNAL DEPENDENCIES

None

44

Page 45: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 genuine_intel

34 genuine_intel

Check that the nodes contain GenuineIntel processors

DESCRIPTION

genuine_intel is an Intel(R) Cluster Checker module to verify that the cluster is built using GenuineIntel processors.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

mount_proc

EXTERNAL DEPENDENCIES

grep

45

Page 46: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 gige

35 gige

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the Ethernet interface meets requirements.

METHOD

Read the interface speed usingethtool for all Ethernet interfaces. The speed of at least one interface must be greaterthan or equal to 1000 Mb/s.Note: ethtool can only be run by root.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ifconfig

ethtool

46

Page 47: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 glibc_2_3_4

36 glibc_2_3_4

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the glibc runtime meets requirements.

METHOD

Compare the libc.so.6 version to 2.3.4.Confirm that the following libraries (or later versions) are in the dynamic linker cache on all nodes. Both 32-bit andx86-64 versions need to be present.

ld-linux.so.2

ld-linux-x86-64.so.2

libBrokenLocale.so.1

libSegFault.so

libanl.so.1

libc.so.6

libcidn.so.1

libcrypt.so.1

libdl.so.2

libgcc_s.so.1

libm.so.6

libnsl.so.1

libnss_compat.so.2

libnss_dns.so.2

libnss_files.so.2

libnss_hesiod.so.2

libnss_nis.so.2

libnss_nisplus.so.2

libpthread.so.0

libresolv.so.2

librt.so.1

libstdc++.so.6

libthread_db.so.1

libutil.so.1

47

Page 48: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 glibc_2_3_4

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

/sbin/ldconfig

48

Page 49: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 gmake_3_80

37 gmake_3_80

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that GNU (*) make is present and meets requirements.

METHOD

Compare the gdb version to 3.80.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

gmake

49

Page 50: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 hdparm

38 hdparm

Check the disk performance of a node

DESCRIPTION

hdparm is an Intel(R) Cluster Checker module to verify the disk read rate of cached and raw device operations andtheir deviation over the cluster nodes. The module executes thehdparm utility to measure the disk performance.Note: this test will only be run as root since hdparm requires direct access to the disk device.

CONFIGURATION

cache-read

The minimum acceptable read rate, in MB/s, from disk buffer cache. This corresponds to the ’-T’ hdparm option.

cache-deviation

The factor of allowed standard deviations from median, used to search for outlier values. The allowed range is (median-/+ deviation * stddev).Default: 3

device

The disk device to be measured.Default: the disk device corresponding to the ’/’ partition.

device-read

The minimum acceptable read rate, in MB/s, from the disk device, reading through cache. This corresponds to the ’-t’hdparm option.

device-deviation

The factor of allowed standard deviations from median, used to search for outlier values. The allowed range is (median-/+ deviation * stddev).Default: 3

Example

<hdparm><cache-deviation>3</cache-deviation><cache-read>2400</cache-read><device>/dev/sda1</device><device-deviation>3</device-deviation><device-read>60</device-read>

</hdparm>

MODULE CLASS

unit

50

Page 51: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 hdparm

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

hdparm

mount

NOTES

This check is not appropriate for diskless nodes and should be excluded.

51

Page 52: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 home

39 home

Check Intel(R) Cluster Ready Compliance

DESCRIPTION

Check that /home meets requirements.

METHOD

Compare the inode number of a directory in /home on all nodes.

CONFIGURATION

None

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

stat

perl

NOTES

If running as a privileged user and /home is managed by automount, at least one user account should be created priorto running this check.

52

Page 53: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 host_conf

40 host_conf

Check the configuration of /etc/host.conf

DESCRIPTION

host_conf is an Intel(R) Cluster Checker module to verify that host.conf has set the resolution order to hosts, nis, bind.If host.conf is missing or the order line is missing from host.conf, the module returns indeterminate.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

cat

/etc/host.conf

53

Page 54: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 hostname

41 hostname

Check the hostname of each node

DESCRIPTION

hostname is an Intel(R) Cluster Checker module to verify that the node hostname is correct. The hostname of the nodedetermined byhostname is compared to the hostname from a configuration file or to the one received by DHCP.The module attempts to read the hostname configuration setting from /etc/sysconfig/network, /etc/HOSTNAME,/etc/hostname, and /etc/sysconfig/system (in that order). Also the module will check the DHCP leases to see if thehostname has been received by DHCP. If only the short hostnames match, the check is considered successful, althougha different status message is displayed. If no match if found the test will fail and will display the available hostnamesagainst which the active hostname was compared.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

cat

hostname

/etc/sysconfig/network

/etc/HOSTNAME

/etc/hostname

/etc/sysconfig/system

54

Page 55: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 hpcc

42 hpcc

Run the HPC Challenge Benchmarks

DESCRIPTION

hpcc is an Intel(R) Cluster Checker module that runs the HPC Challenge benchmark suite. The HPCC benchmarksuite includes 7 benchmarks. Please seehttp://icl.cs.utk.edu/hpcc/ for details. HPCC was built with the Intel(R)C++ Compiler using the ’-O3’ and ’-openmp’ flags.The HPCC input parameters are read from a template input file (see thehpccinf parameter.) The P, Q, and Nparameters are modified from the values in the file. N is set to the file value multiplied by the square root of thenumber of nodes. For example, if the value in the file is 8,000 and the test is run on 8 nodes, the value of N is 22,627.P and Q are set so that P is maximized subject to the following constraints:

P <= Q

P*Q is equal to the total number of processes(not threads)

P and Q are integers

CONFIGURATION

build

Build HPCC from source rather than using the prebuilt binary (external/hpcc .) If true, the Intel(R) C Compiler,Intel(R) MPI Library, and Intel(R) Math Kernel Library must be available; theintel_cc andintel_mpi_internodemodules should be added as dependencies and theintel_cce_rtl andintel_mpi_rt_internode modulesshould be removed as dependencies.Default: false

cc-path

The base path to the Intel(R) C++ Compiler directory. Setting this parameter will automatically setup the environment.Default: none (inherit environment)

fabric

A container for the network interconnect fabric to evaluate. The<fabric> block may be repeated to test multipleinterconnects.

bandwidth The minimum acceptable network bandwidth in GB/s.If a bandwidth value is not specified, then thebandwidth check will be indeterminate.Default: none

device The Intel(R) MPI Library device string, i.e., I_MPI_DEVICEDefault: rdssm

dgemm The minimum acceptable DGEMM performance in GFLOPS.If a dgemm value is not specified, then thedgemm check will be indeterminate.

fft The minimum acceptable FFT performance in GFLOPS.If a fft value is not specified, then the fft check will beindeterminate.

55

Page 56: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 hpcc

hpl The minimum acceptable HP Linpack performance in TFLOPS.If a hpl value is not specified, then the hpl checkwill be indeterminate.Default: none

latency The maximum acceptable network latency in microseconds.If a latency value is not specified, then thelatency check will be indeterminate.Default: none

ptrans The minimum acceptable PTRANS performance in GB/s.If a ptrans value is not specified, then the ptranscheck will be indeterminate.

randomaccess The minimum acceptable RandomAccess performance in GUPs/s.If a randomaccess value is notspecified, then the randomaccess check will be indeterminate.

stream The minimum acceptable STREAM Triad performance in GB/s.If a stream value is not specified, then thestream check will be indeterminate.

hpccinf

The path to the template input file to use.Default: none (useexternal/hpccinf.txt )

mkl-path

The base path to the Intel(R) Math Kernel Library installation directory. Setting this parameter will automaticallysetup the environment.Default: none (inherit environment)

mpi-path

The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

process-number

The number of MPI processes to start on each nodeDefault: 1

thread-number

The number of OpenMP threads to start on each node. This setting corresponds to the OMP_NUM_THREADSenvironment variable.Default: ALL

Example

<hpcc><cc-path>/opt/intel/cce/9.1</cc-path><fabric>

<bandwidth>0.110</bandwidth><device>sock</device>

56

Page 57: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 hpcc

<dgemm>12</dgemm><fft>1</fft><hpl>0.002</hpl><latency>20</latency><ptrans>0.2</ptrans><randomaccess>0.001</randomaccess><stream>1.7</stream>

</fabric><mkl-path>/opt/intel/cmkl/9.0</mkl-path><mpi-path>/opt/intel/mpi/3.0</mpi-path><process-number>1</process-number><thread-number>ALL</thread-number>

</hpcc>

MODULE CLASS

vector

DEPENDENCIES

arch

intel_cce_rtl

intel_mpi_rt_internode

perl

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) C++ Compiler 9.1 or later

Intel(R) Math Kernel Library 9.0 or later

Intel(R) MPI Library 3.0 or later

perl

sed

rm

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<cc-path>, <mkl-path>, and<mpi-path> tags to override.

57

Page 58: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 ib_firmware

43 ib_firmware

Check that the InfiniBand firmware is uniform on all nodes

DESCRIPTION

ib_firmware is an Intel(R) Cluster Checker module to verify the firmware version on the InfiniBand cards on each node.The module can determine the firmware from drivers that use/sys/class/infiniband/mthcaN/fw_ver ,such as OFED (*).

CONFIGURATION

None

MODULE CLASS

vector

DEPENDENCIES

sh

ssh

mount_proc

genuine_intel

EXTERNAL DEPENDENCIES

grep

a supported IB driver

58

Page 59: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 ibadm

44 ibadm

Check that Mellanox in-band monitor processes are not running

DESCRIPTION

ibadm is an Intel(R) Cluster Checker module to verify that the Mellanox Infiniband in-band monitor is not running.The module checks the process list for entries matching ’ibgd’, ’ibadm’, ’ibis’, and ’obbs-pci’.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ps

59

Page 60: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 icr_version_1_0

45 icr_version_1_0

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that /etc/intel/icr contains the POSIX shell environment variable CLUSTER_READY_VERSION and that it isset to the value 1.0.

METHOD

Check that the /etc/intel/icr file contains ’CLUSTER_READY_VERSION=1.0’.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

cat

60

Page 61: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 icr_version_1_1

46 icr_version_1_1

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that /etc/intel/icr contains the POSIX shell environment variable CLUSTER_READY_VERSION and that it isset to the value 1.1.

METHOD

Check that the /etc/intel/icr file contains ’CLUSTER_READY_VERSION=1.1’.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

cat

61

Page 62: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 igb

47 igb

Check the Intel(R) PRO/1000 Network Driver (igb)

DESCRIPTION

igb is an Intel(R) Cluster Checker module to verify the igb kernel module. The module checks that the kernel moduleis loaded, the version and module file are uniform across the cluster, and the Interrupt throttling options are set.

CONFIGURATION

options

The string that is compared to the igb driver options.Default: options igb InterruptThrottleRate=0,0

version

The string that is compared to the version string in the igb kernel module.Default: noneIf versionis not specified in the configuration file, then the igb kernel module version is not checked.

Example

<igb><options>options igb InterruptThrottleRate=0,3</options><version>1.2.30</version>

</igb>

MODULE CLASS

vector

DEPENDENCIES

sh

ssh

genuine_intel

EXTERNAL DEPENDENCIES

grep

lsmod

md5sum

modprobe

strings

62

Page 63: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 imb_collective_intel_mpi

48 imb_collective_intel_mpi

Check MPI collectives over the whole cluster

DESCRIPTION

imb_collective_intel_mpi is an Intel(R) Cluster Checker module that runs the Intel(R) MPI Benchmarks for a specifiedset of collectives. While the test may verify performance at some point, it currently only verifies that the benchmarksuccessfully ran.

CONFIGURATION

benchmark

The name of the Intel(R) MPI Benchmark to run. The<benchmark> tag may be repeated to run multiple benchmarks.See the Intel(R) MPI Benchmark documentation for a list of available collective benchmarks (typically the name ofthe MPI collective operation with the ’MPI_’ prefix removed, e.g., bcast for MPI_Bcast.)Default: barrier

build

Build the Intel(R) MPI Benchmarks from source rather than using the prebuilt binary (external/IMB-MPI1 .) Iftrue, the Intel(R) MPI Library SDK must be available; theintel_mpi_internode module should also be addedas a dependency.Default: false

fabric

A container for the network interconnect fabric to evaluate. The<fabric> block may be repeated to test multipleinterconnects.

device The Intel(R) MPI Library device string, i.e., I_MPI_DEVICEDefault: rdssm

msglen

Override the default IMB message length sequence of 0 to 2ˆi, where i ranges from 0 to 22. Instead, use the messagelengths defined inexternal/IMB_msglen : 0, 1, 2, 4, and 4,194,304. Use the<msglen/> tag to indicate true.Note: this will reduce the time required to run this test but may not detect some failing cases.Default: false

mpi-path

The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

Example

<imb_collective_intel_mpi><benchmark>barrier</benchmark><benchmark>bcast</benchmark><fabric>

63

Page 64: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 imb_collective_intel_mpi

<device>sock</device></fabric><fabric>

<device>rdssm</device></fabric><mpi-path>/opt/intel/mpi-rt/3.0</mpi-path>

</imb_collective_intel_mpi>

MODULE CLASS

vector

DEPENDENCIES

gcc

intel_mpi_rt_internode

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) MPI Library 3.0 or later

mktemp

rm

tar

test

uname

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<mpi-path> tag to override.

64

Page 65: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 imb_message_integrity_intel_mpi

49 imb_message_integrity_intel_mpi

Check MPI message integrity over the whole cluster.

DESCRIPTION

imb_message_integrity_intel_mpi is an Intel(R) Cluster Checker module that compiles and runs the Intel(R) MPIBenchmarks checking the integrity of the messages by using the Sendrcv test. The test checks every message passingresult against the expected outcome and reports the defects of communication.

CONFIGURATION

build

Build the Intel(R) MPI Benchmarks from source rather than using the prebuilt binary (external/IMB-MPI1-DCHECK .)If true, the Intel(R) MPI Library SDK must be available; theintel_mpi_internode module should also be addedas a dependency.Default: false

fabric

A container for the network interconnect fabric to evaluate. The<fabric> block may be repeated to test multipleinterconnects.

device The Intel(R) MPI Library device string, i.e., I_MPI_DEVICEDefault: rdssm

msglen

Override the default IMB message length sequence of 0 to 2ˆi, where i ranges from 0 to 22. Instead, use the messagelengths defined inexternal/IMB_msglen : 0, 1, 2, 4, and 4,194,304. Use the<msglen/> tag to indicate true.Note: this will reduce the time required to run this test but may not detect some failing cases.Default: false

mpi-path

The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

Example

<imb_message_integrity_intel_mpi><fabric>

<device>sock</device></fabric><fabric>

<device>rdssm</device></fabric><mpi-path>/opt/intel/mpi-rt/3.0</mpi-path>

</imb_message_integrity_intel_mpi>

65

Page 66: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 imb_message_integrity_intel_mpi

MODULE CLASS

vector

DEPENDENCIES

gcc

intel_mpi_internode

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) MPI Library 3.0 or later

mktemp

rm

tar

test

uname

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<mpi-path> tag to override.

66

Page 67: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 imb_pingpong_intel_mpi

50 imb_pingpong_intel_mpi

Check network interconnect performance

DESCRIPTION

imb_pingpong_intel_mpi is an Intel(R) Cluster Checker module to verify the network interconnect performance andvariation over the cluster nodeswith thepingpongIntel(R) MPI Benchmark. The module measures the performancebetween each pair of nodes.

CONFIGURATION

build

Build the Intel(R) MPI Benchmarks from source rather than using the prebuilt binary (external/IMB-MPI1 .) Iftrue, the Intel(R) MPI Library SDK must be available; theintel_mpi_internode module should also be addedas a dependency.Default: false

fabric

A container for the network interconnect fabric to evaluate. The<fabric> block may be repeated to test multipleinterconnects.

device The Intel(R) MPI Library device string, i.e., I_MPI_DEVICEDefault: rdssm

latency The maximum acceptable latency in microseconds.If a latency value is not specified, then no latency checkis performed.Default: none

latency-deviation The factor of allowed standard deviations from median, used to search for outlier values. Theallowed range is (median -/+ deviation * stddev).Default: 3

bandwidth The minimum acceptable bandwidth in MB/sec.If a bandwidth is not specified, then no bandwidthcheck is performed.Default: none

bandwidth-deviation The factor of allowed standard deviations from median, used to search for outlier values. Theallowed range is (median -/+ deviation * stddev).Default: 3

msglen

Override the default IMB message length sequence of 0 to 2ˆi, where i ranges from 0 to 22. Instead, use the messagelengths defined inexternal/IMB_msglen : 0, 1, 2, 4, and 4,194,304. Use the<msglen/> tag to indicate true.Note: this will reduce the time required to run this test but may not detect some failing cases.Default: false

67

Page 68: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 imb_pingpong_intel_mpi

mpi-path

The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

Example

<imb_pingpong_intel_mpi><fabric>

<device>sock</device><bandwidth>110</bandwidth><bandwidth-deviation>3</bandwidth-deviation><latency>35</latency><latency-deviation>3</latency-deviation>

</fabric><fabric>

<device>rdssm</device><bandwidth>620</bandwidth><bandwidth-deviation>3</bandwidth-deviation><latency>10</latency><latency-deviation>3</latency-deviation>

</fabric><mpi-path>/opt/intel/mpi/3.0</mpi-path>

</imb_pingpong_intel_mpi>

MODULE CLASS

matrix

DEPENDENCIES

arch

gcc

intel_mpi_rt_internode

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) MPI Library 3.0 or later

mktemp

rm

tar

68

Page 69: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 imb_pingpong_intel_mpi

test

touch

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<mpi-path> tag to override.

69

Page 70: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_cc

51 intel_cc

Check the Intel(R) C++ Compiler

DESCRIPTION

intel_cc is an Intel(R) Cluster Checker module to verify the Intel(R) C++ Compiler. The module checks the compilerversion and builds and executes C and C++ ’Hello World’ programs.Note: this module builds the Hello World binaries from source using the Intel(R) C++ Compiler. If you wish to checkthe functionality of the compiler runtime only, see theintel_cce_rtl module.

CONFIGURATION

cc-path

The base path to the Intel(R) C++ compiler. Setting this parameter will automatically setup the environment.Default: none (inherit environment)

version

The string that is compared to the Intel(R) C++ compiler build stamp.Default: noneIf versionis not specified in the configuration file, then the specific compiler version will not be checked.The samenessof the package ID is compared across the cluster regardless.

Example

<intel_cc><cc-path>/opt/intel/cce/9.1</cc-path>

</intel_cc>

MODULE CLASS

vector

DEPENDENCIES

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) C++ Compiler

test

which

70

Page 71: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_cc

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<cc-path> tag to override.

71

Page 72: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_cc_rtl_9_1

52 intel_cc_rtl_9_1

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the 32-bit Intel(R) C++ Compiler runtime meets requirements.

METHOD

Compare the 32-bit Intel(R) C++ Compiler runtime version to 9.1Confirm that the following files exist on all nodes:

/opt/intel/cc/<version>/lib/libcprts.so (9.x only)

/opt/intel/cc/<version>/lib/libcprts.so.5 (9.x only)

/opt/intel/cc/<version>/lib/libcxa.so (9.x only)

/opt/intel/cc/<version>/lib/libcxa.so.5 (9.x only)

/opt/intel/cc/<version>/lib/libcxaguard.so

/opt/intel/cc/<version>/lib/libcxaguard.so.5

/opt/intel/cc/<version>/lib/libguide.so

/opt/intel/cc/<version>/lib/libguide_stats.so

/opt/intel/cc/<version>/lib/libimf.so

/opt/intel/cc/<version>/lib/libintlc.so (10.x only)

/opt/intel/cc/<version>/lib/libintlc.so.5 (10.x only)

/opt/intel/cc/<version>/lib/libirc.so

/opt/intel/cc/<version>/lib/libsvml.so

/opt/intel/cc/<version>/lib/libunwind.so (9.x only)

/opt/intel/cc/<version>/lib/libunwind.so.5 (9.x only)

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

72

Page 73: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_cc_rtl_9_1

EXTERNAL DEPENDENCIES

perl

Intel(R) C++ Compiler 9.1 or later

73

Page 74: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_cce_rtl

53 intel_cce_rtl

Check the Intel(R) C++ Compiler runtime libraries

DESCRIPTION

intel_cce_rtl is an Intel(R) Cluster Checker module to verify the Intel(R) C++ Compiler runtime libraries. The moduleruns C and C++ Hello World binaries on the compute nodes.Note: this module does not build the Hello World binaries using the Intel(R) C++ Compiler. If you wish to check thefunctionality of the compiler itself, see theintel_cc module.

CONFIGURATION

cc-path

The base path to the Intel(R) C++ Compiler installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

Example

<intel_cce_rtl><cc-path>/opt/intel/cce/9.1</cc-path>

</intel_cce_rtl>

MODULE CLASS

unit

DEPENDENCIES

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) C++ Compiler 9.1 or later runtime

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH) inherited from the user running Intel(R) ClusterChecker is setup correctly.

74

Page 75: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_cce_rtl_9_1

54 intel_cce_rtl_9_1

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the 64-bit Intel C++ Compiler runtime meets requirements.

METHOD

Compare the 64-bit Intel(R) C++ Compiler runtime version to 9.1Confirm that the following files exist on all nodes:

/opt/intel/cce/<version>/lib/libclusterguide.so (9.x only)

/opt/intel/cce/<version>/lib/libclusterguide_stats.so (9.x only)

/opt/intel/cce/<version>/lib/libcprts.so (9.x only)

/opt/intel/cce/<version>/lib/libcprts.so.5 (9.x only)

/opt/intel/cce/<version>/lib/libcxa.so (9.x only)

/opt/intel/cce/<version>/lib/libcxa.so.5 (9.x only)

/opt/intel/cce/<version>/lib/libcxaguard.so

/opt/intel/cce/<version>/lib/libcxaguard.so.5

/opt/intel/cce/<version>/lib/libguide.so

/opt/intel/cce/<version>/lib/libguide_stats.so

/opt/intel/cce/<version>/lib/libimf.so

/opt/intel/cce/<version>/lib/libintlc.so (10.x only)

/opt/intel/cce/<version>/lib/libintlc.so.5 (10.x only)

/opt/intel/cce/<version>/lib/libirc.so

/opt/intel/cce/<version>/lib/libomp_db.so

/opt/intel/cce/<version>/lib/libompstub.so (10.x only)

/opt/intel/cce/<version>/lib/libsvml.so

/opt/intel/cce/<version>/lib/libunwind.so (9.x only)

/opt/intel/cce/<version>/lib/libunwind.so.5 (9.x only)

CONFIGURATION

None

MODULE CLASS

unit

75

Page 76: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_cce_rtl_9_1

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

perl

Intel(R) C++ Compiler 9.1 or later

76

Page 77: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_cmkl_rtl_9_0

55 intel_cmkl_rtl_9_0

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the Intel(R) Math Kernel Library Cluster Edition runtime meets requirements.

METHOD

Compare the Intel(R) Math Kernel Library Cluster Edition runtime version to 9.0.Confirm that the following files exist on all nodes:

/opt/intel/cmkl/<version>/lib/32/libguide.so

/opt/intel/cmkl/<version>/lib/32/libmkl.so

/opt/intel/cmkl/<version>/lib/32/libmkl_core.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/32/libmkl_def.so

/opt/intel/cmkl/<version>/lib/32/libmkl_gf.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/32/libmkl_ias.so (9.0 and 9.1 only)

/opt/intel/cmkl/<version>/lib/32/libmkl_intel.so (10.0 and later)

/opt/intel/mckl/<version>/lib/32/libmkl_intel_thread.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/32/libmkl_lapack.so (9.1 and later)

/opt/intel/cmkl/<version>/lib/32/libmkl_lapack32.so (9.0 only)

/opt/intel/cmkl/<version>/lib/32/libmkl_lapack64.so (9.0 only)

/opt/intel/cmkl/<version>/lib/32/libmkl_p3.so (up to 10.0)

/opt/intel/cmkl/<version>/lib/32/libmkl_p4.so

/opt/intel/cmkl/<version>/lib/32/libmkl_p4m.so

/opt/intel/cmkl/<version>/lib/32/libmkl_p4p.so

/opt/intel/cmkl/<version>/lib/32/libmkl_sequential.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/32/libmkl_vml_def.so

/opt/intel/cmkl/<version>/lib/32/libmkl_vml_ia.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/32/libmkl_vml_p3.so (up to 10.0)

/opt/intel/cmkl/<version>/lib/32/libmkl_vml_p4.so

/opt/intel/cmkl/<version>/lib/32/libmkl_vml_p4m.so

/opt/intel/cmkl/<version>/lib/32/libmkl_vml_p4m2.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/32/libmkl_vml_p4p.so

/opt/intel/cmkl/<version>/lib/32/libvml.so (9.0 and 9.1 only)

77

Page 78: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_cmkl_rtl_9_0

/opt/intel/cmkl/<version>/lib/em64t/libguide.so

/opt/intel/cmkl/<version>/lib/em64t/libmkl.so

/opt/intel/cmkl/<version>/lib/em64t/libmkl_core.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_def.so

/opt/intel/cmkl/<version>/lib/em64t/libmkl_gf_ilp64.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_gf_lp64.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_ias.so (9.0 and 9.1 only)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_intel_ilp64.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_intel_lp64.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_intel_sp2dp.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_intel_thread.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_lapack.so (9.1 and later)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_lapack32.so (9.0 only)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_lapack64.so (9.0 only)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_mc.so

/opt/intel/cmkl/<version>/lib/em64t/libmkl_p4n.so

/opt/intel/cmkl/<version>/lib/em64t/libmkl_sequential.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_vml_def.so

/opt/intel/cmkl/<version>/lib/em64t/libmkl_vml_mc.so

/opt/intel/cmkl/<version>/lib/em64t/libmkl_vml_mc2.so (10.0 and later)

/opt/intel/cmkl/<version>/lib/em64t/libmkl_vml_p4n.so

/opt/intel/cmkl/<version>/lib/em64t/libvml.so (9.0 and 9.1 only)

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

perl

Intel(R) Math Kernel Library Cluster Edition 9.0 or later

78

Page 79: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_devtools_1_0

56 intel_devtools_1_0

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the developer tools meets requirements.

METHOD

Compare the Intel(R) C++ Compiler version to 9.1.038 and confirm that the following files exist:

bin/icc

Compare the Intel(R) Fortran Compiler version to 9.1.032 and confirm that the following files exist:

bin/ifort

Compare the Intel(R) MPI Library version to 3.0.033 and confirm that the following files exist:

bin/mpicc

bin/mpicxx

bin/mpif77

bin/mpif90

bin/mpiicc

bin/mpiicpc

bin/mpiifort

bin64/mpicc

bin64/mpicxx

bin64/mpif77

bin64/mpif90

bin64/mpiicc

bin64/mpiicpc

bin64/mpiifort

Compare the Intel(R) Math Kernel Library Cluster Edition version to 9.0.017.Compare the Intel(R) Trace Analyzer and Collector version to 7.0.1 and confirm that the following files exist:

bin/traceanalyzer

Compare the Intel(R) Debugger version to 9.1 and confirm that the following files exist:

bin/idb

Compare the Intel(R) Thread Checker version to 3.0.Compare the Intel(R) Thread Profiler version to 1.0.Compare the Intel(R) VTune(TM) Performance Analyzer version to 8.0.4.

79

Page 80: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_devtools_1_0

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

genuine_intel

perl

sh

EXTERNAL DEPENDENCIES

find

grep

Intel Software Tools

perl

xargs

80

Page 81: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_fc

57 intel_fc

Check the Intel(R) Fortran compiler

DESCRIPTION

intel_fc is an Intel(R) Cluster Checker module to verify the Intel(R) Fortran Compiler. The module examines thecompiler version and builds and executes a Fortran ’Hello World’ program.Note: this module builds the Hello World binary from source using the Intel(R) Fortran Compiler. If you wish to checkthe functionality of the compiler runtime only, see theintel_fce_rtl module.

CONFIGURATION

fc-path

The base path to the Intel(R) Fortran Compiler. Setting this parameter will automatically setup the environment.Default: none (inherit environment)

version

The string that is compared to the Intel(R) Fortran Compiler build stamp.Default: noneIf versionis not specified in the configuration file, then the specific compiler version will not be checked.The unifor-mity of the package ID is compared across the cluster regardless.

Example

<intel_fc><fc-path>/opt/intel/fce/9.1</fc-path>

</intel_fc>

MODULE CLASS

vector

DEPENDENCIES

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) Fortran Compiler

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<fc-path> tag to override.

81

Page 82: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_fc_rtl_9_1

58 intel_fc_rtl_9_1

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the 32-bit Intel(R) Fortran Compiler runtime meets requirements.

METHOD

Compare the 32-bit Intel(R) Fortran Compiler runtime version to 9.1Confirm that the following files exist on all nodes:

/opt/intel/fc/<version>/lib/libcprts.so (9.x only)

/opt/intel/fc/<version>/lib/libcprts.so.5 (9.x only)

/opt/intel/fc/<version>/lib/libcxa.so (9.x only)

/opt/intel/fc/<version>/lib/libcxa.so.5 (9.x only)

/opt/intel/fc/<version>/lib/libcxaguard.so

/opt/intel/fc/<version>/lib/libcxaguard.so.5

/opt/intel/fc/<version>/lib/libguide.so

/opt/intel/fc/<version>/lib/libguide_stats.so

/opt/intel/fc/<version>/lib/libifcoremt.so

/opt/intel/fc/<version>/lib/libifcoremt.so.5

/opt/intel/fc/<version>/lib/libifcore.so

/opt/intel/fc/<version>/lib/libifcore.so.5

/opt/intel/fc/<version>/lib/libifport.so

/opt/intel/fc/<version>/lib/libifport.so.5

/opt/intel/fc/<version>/lib/libimf.so

/opt/intel/fc/<version>/lib/libintlc.so (10.x only)

/opt/intel/fc/<version>/lib/libintlc.so.5 (10.x only)

/opt/intel/fc/<version>/lib/libirc.so

/opt/intel/fc/<version>/lib/libsvml.so

/opt/intel/fc/<version>/lib/libunwind.so (9.x only)

/opt/intel/fc/<version>/lib/libunwind.so.5 (9.x only)

CONFIGURATION

None

82

Page 83: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_fc_rtl_9_1

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

perl

Intel(R) Fortran Compiler 9.1 or later

83

Page 84: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_fce_rtl

59 intel_fce_rtl

Check the Intel(R) Fortran compiler runtime libraries

DESCRIPTION

intel_fce_rtl is an Intel(R) Cluster Checker module to verify the Intel(R) Fortran compiler runtime libraries. Themodule runs a Fortran Hello World binary on the compute nodes.Note: this module does not build the Hello World binary using the Intel(R) Fortran Compiler. If you wish to check thefunctionality of the compiler itself, see theintel_fc module.

CONFIGURATION

fc-path

The base path to the Intel(R) Fortran Compiler installation directory. Setting this parameter will automatically setupthe environment.Default: none (inherit environment)

Example

<intel_fce_rtl><fc-path>/opt/intel/fce/9.1</fc-path>

</intel_fce_rtl>

MODULE CLASS

unit

DEPENDENCIES

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) Fortran Compiler 9.1 or later

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH) inherited from the user running Intel(R) ClusterChecker is setup correctly.

84

Page 85: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_fce_rtl_9_1

60 intel_fce_rtl_9_1

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the 64-bit Intel(R) Fortran Compiler runtime meets requirements.

METHOD

Compare the 64-bit Intel(R) Fortran Compiler runtime version to 9.1Confirm that the following files exist on all nodes:

/opt/intel/fce/<version>/lib/libclusterguide.so (9.x only)

/opt/intel/fce/<version>/lib/libclusterguide_stats.so (9.x only)

/opt/intel/fce/<version>/lib/libcprts.so (9.x only)

/opt/intel/fce/<version>/lib/libcprts.so.5 (9.x only)

/opt/intel/fce/<version>/lib/libcxa.so (9.x only)

/opt/intel/fce/<version>/lib/libcxa.so.5 (9.x only)

/opt/intel/fce/<version>/lib/libcxaguard.so

/opt/intel/fce/<version>/lib/libcxaguard.so.5

/opt/intel/fce/<version>/lib/libguide.so

/opt/intel/fce/<version>/lib/libguide_stats.so

/opt/intel/fce/<version>/lib/libifcoremt.so

/opt/intel/fce/<version>/lib/libifcoremt.so.5

/opt/intel/fce/<version>/lib/libifcore.so

/opt/intel/fce/<version>/lib/libifcore.so.5

/opt/intel/fce/<version>/lib/libifport.so

/opt/intel/fce/<version>/lib/libifport.so.5

/opt/intel/fce/<version>/lib/libimf.so

/opt/intel/fce/<version>/lib/libintlc.so (10.x only)

/opt/intel/fce/<version>/lib/libintlc.so.5 (10.x only)

/opt/intel/fce/<version>/lib/libirc.so

/opt/intel/fce/<version>/lib/libomp_db.so

/opt/intel/fce/<version>/lib/libompstub.so (10.x only)

/opt/intel/fce/<version>/lib/libsvml.so

/opt/intel/fce/<version>/lib/libunwind.so (9.x only)

/opt/intel/fce/<version>/lib/libunwind.so.5 (9.x only)

85

Page 86: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_fce_rtl_9_1

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

perl

Intel(R) Fortran Compiler 9.1 or later

86

Page 87: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi

61 intel_mpi

Check the Intel(R) MPI Library

DESCRIPTION

intel_mpi is an Intel(R) Cluster Checker module to verify the functionality of Intel(R) MPI Library on each node.This test does not verify that inter-node functionality of Intel(R) MPI Library. The module checks the permissions on$HOME/.mpd.conf, starts and stops mpds, compiles a MPI Hello World binary from source, and runs the program onone or more Intel(R) MPI Library devices.Note: this module builds the MPI Hello World binary from source using the Intel(R) MPI Library compiler wrappers(i.e., mpicc.) If you wish to check the functionality of the MPI runtime only, see theintel_mpi_rt module.

CONFIGURATION

device

The Intel(R) MPI Library device string, i.e., I_MPI_DEVICE. May be specified more than once.Default: rdssm

mpi-path

The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

process-number

The number of MPI processes to start on each nodeDefault: 4

Example

<intel_mpi><device>sock</device><device>rdssm</device><mpi-path>/opt/intel/mpi/3.0</mpi-path><process-number>2</process-number>

</intel_mpi>

MODULE CLASS

unit

DEPENDENCIES

gcc

hostname

loopback

python

87

Page 88: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi

sh

shm_mount

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) MPI Library

mktemp

stat

rm

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<mpi-path> tag to override.

88

Page 89: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi_internode

62 intel_mpi_internode

Check the Intel(R) MPI Library

DESCRIPTION

intel_mpi_internode is an Intel(R) Cluster Checker module to verify the functionality of Intel(R) MPI Library on thewhole cluster. The module starts and stops mpds, compiles a MPI Hello World program from source, and runs theprogram on one or more Intel(R) MPI Library devices.Note: this module builds the MPI Hello World binary from source using the Intel(R) MPI Library compiler wrappers(i.e., mpicc.) If you wish to check the functionality of the MPI runtime only, see theintel_mpi_rt_internodemodule.

CONFIGURATION

device

The Intel(R) MPI Library device string, i.e., I_MPI_DEVICE. May be specified more than once.Default: rdssm

mpi-path

The base path to the Intel(R) MPI Library directory. Setting this parameter will automatically setup the environment.Default: none (inherit environment)

process-number

The number of MPI processes to start on each nodeDefault: 4

Example

<intel_mpi_internode><device>sock</device><device>rdssm</device><mpi-path>/opt/intel/mpi/3.0</mpi-path>

</intel_mpi_internode>

MODULE CLASS

vector

DEPENDENCIES

gcc

hostname

intel_mpi

loopback

sh

ssh_alltoall

89

Page 90: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi_internode

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) MPI Library

mktemp

rm

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<mpi-path> tag to override.Automatically sets I_MPI_USE_DYNAMIC_CONNECTIONS=1.

90

Page 91: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi_rt

63 intel_mpi_rt

Check the functionality of the Intel(R) MPI Library Runtime Environment

DESCRIPTION

intel_mpi_rt is an Intel(R) Cluster Checker module to verify the basic functionality of the Intel(R) MPI Library Run-time Environment on each node. This test does not verify that inter-node functionality of the Intel(R) MPI Library.The module checks the permissions on $HOME/.mpd.conf, starts and stops mpds, and run a MPI Hello World programon one or more Intel(R) MPI Library devices.Note: this module does not build the MPI Hello World binary using the Intel(R) MPI Library compiler wrappers (i.e.,mpicc.) If you wish to check the functionality of the MPI compiler wrappers, see theintel_mpi module.

CONFIGURATION

device

The Intel(R) MPI Library device string, i.e., I_MPI_DEVICE. May be specified more than once.Default: rdssm

mpi-path

The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

process-number

The number of MPI processes to start on each nodeDefault: 4

Example

<intel_mpi_rt><device>sock</device><device>rdssm</device><path>/opt/intel/mpi-rt/3.0</path><process-number>2</process-number>

</intel_mpi_rt>

MODULE CLASS

unit

DEPENDENCIES

hostname

loopback

python

sh

91

Page 92: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi_rt

shm_mount

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) MPI Library 3.0 or later

mktemp

stat

rm

NOTES

By default, assumes that the environment inherited from the user running Intel(R) Cluster Checker is setup correctly.See the<mpi-path> tag to override.

92

Page 93: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi_rt_3_0_033

64 intel_mpi_rt_3_0_033

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the Intel(R) MPI Library runtime meets requirements.

METHOD

Compare the Intel(R) MPI Library version to 3.0 build 033.Confirm that the following files exist on all nodes:

/opt/intel/impi/ <version>/{bin,bin64}/mpdallexit

/opt/intel/impi/ <version>/{bin,bin64}/mpdallexit.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdboot

/opt/intel/impi/ <version>/{bin,bin64}/mpdboot.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdcheck

/opt/intel/impi/ <version>/{bin,bin64}/mpdcheck.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdchkpyver.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdcleanup

/opt/intel/impi/ <version>/{bin,bin64}/mpdcleanup.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdexit

/opt/intel/impi/ <version>/{bin,bin64}/mpdexit.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdgdbdrv.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdhelp

/opt/intel/impi/ <version>/{bin,bin64}/mpdhelp.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdkilljob

/opt/intel/impi/ <version>/{bin,bin64}/mpdkilljob.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdlib.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdlistjobs

/opt/intel/impi/ <version>/{bin,bin64}/mpdlistjobs.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdman.py

/opt/intel/impi/ <version>/{bin,bin64}/mpd

/opt/intel/impi/ <version>/{bin,bin64}/mpd.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdringtest

/opt/intel/impi/ <version>/{bin,bin64}/mpdringtest.py

93

Page 94: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi_rt_3_0_033

/opt/intel/impi/ <version>/{bin,bin64}/mpdroot

/opt/intel/impi/ <version>/{bin,bin64}/mpdrun

/opt/intel/impi/ <version>/{bin,bin64}/mpdrun.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdsigjob

/opt/intel/impi/ <version>/{bin,bin64}/mpdsigjob.py

/opt/intel/impi/ <version>/{bin,bin64}/mpdtrace

/opt/intel/impi/ <version>/{bin,bin64}/mpdtrace.py

/opt/intel/impi/ <version>/{bin,bin64}/mpiexec

/opt/intel/impi/ <version>/{bin,bin64}/mpiexec.py

/opt/intel/impi/ <version>/{bin,bin64}/mpirun

/opt/intel/impi/ <version>/{bin,bin64}/mtv.so

/opt/intel/impi/ <version>/{lib,lib64}/libmpi.so.3.1

/opt/intel/impi/ <version>/{lib,lib64}/libmpi.so.2.1

/opt/intel/impi/ <version>/{lib,lib64}/libmpi.so

/opt/intel/impi/ <version>/{lib,lib64}/libmpi_mt.so.3.1

/opt/intel/impi/ <version>/{lib,lib64}/libmpi_mt.so

/opt/intel/impi/ <version>/lib/libmpiec.so.3.1 (3.0 only)

/opt/intel/impi/ <version>/lib/libmpiec.so.2.1 (3.0 only)

/opt/intel/impi/ <version>/lib/libmpiec.so (3.0 only)

/opt/intel/impi/ <version>/lib/libmpief.so.3.1 (3.0 only)

/opt/intel/impi/ <version>/lib/libmpief.so.2.1 (3.0 only)

/opt/intel/impi/ <version>/lib/libmpief.so (3.0 only)

/opt/intel/impi/ <version>/lib/libmpigc.so.3.1 (3.0 only)

/opt/intel/impi/ <version>/lib/libmpigc.so.2.1 (3.0 only)

/opt/intel/impi/ <version>/lib/libmpigc.so (3.0 only)

/opt/intel/impi/ <version>/{lib,lib64}/libmpigc3.so.3.1

/opt/intel/impi/ <version>/{lib,lib64}/libmpigc3.so.2.1

/opt/intel/impi/ <version>/{lib,lib64}/libmpigc3.so

/opt/intel/impi/ <version>/{lib,lib64}/libmpigc4.so

/opt/intel/impi/ <version>/{lib,lib64}/libmpigf.so.3.1

/opt/intel/impi/ <version>/{lib,lib64}/libmpigf.so.2.1

/opt/intel/impi/ <version>/{lib,lib64}/libmpigf.so

94

Page 95: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi_rt_3_0_033

/opt/intel/impi/ <version>/{lib,lib64}/libmpiic.so.3.1

/opt/intel/impi/ <version>/{lib,lib64}/libmpiic.so.2.1

/opt/intel/impi/ <version>/{lib,lib64}/libmpiic.so

/opt/intel/impi/ <version>/{lib,lib64}/libmpiic4.so

/opt/intel/impi/ <version>/{lib,lib64}/libmpiif.so.3.1

/opt/intel/impi/ <version>/{lib,lib64}/libmpiif.so.2.1

/opt/intel/impi/ <version>/{lib,lib64}/libmpiif.so

/opt/intel/impi/ <version>/lib64/libmpigc4.so.3.1

/opt/intel/impi/ <version>/lib64/libmpiic4.so.3.1

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

perl

Intel(R) MPI Library 3.0 build 033 or later

95

Page 96: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi_rt_internode

65 intel_mpi_rt_internode

Check the functionality of the Intel(R) MPI Library Runtime Environment

DESCRIPTION

intel_mpi_rt_internode is an Intel(R) Cluster Checker module to verify the basic functionality of Intel(R) MPI LibraryRuntime Environment over the whole cluster. The module starts and stops mpds, and runs a MPI Hello World programon one or more Intel(R) MPI Library devices.Note: this module does not build the MPI Hello World binary using the Intel(R) MPI Library compiler wrappers (i.e.,mpicc.) If you wish to check the functionality of the MPI compiler wrappers, see theintel_mpi_internodemodule.

CONFIGURATION

device

The Intel(R) MPI Library device string, i.e., I_MPI_DEVICE. May be specified more than once.Default: rdssm

mpi-path

The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

process-number

The number of MPI processes to start on each nodeDefault: 4

Example

<intel_mpi_rt_internode><device>sock</device><device>rdssm</device><mpi-path>/opt/intel/mpi-rt/3.0</mpi-path>

</intel_mpi_rt_internode>

MODULE CLASS

vector

DEPENDENCIES

hostname

intel_mpi_rt

loopback

sh

ssh_alltoall

96

Page 97: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi_rt_internode

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) MPI Library 3.0 or later

mktemp

rm

NOTES

By default, assumes that the environment inherited from the user running Intel(R) Cluster Checker is setup correctly.See the<mpi-path> tag to override.Automatically sets I_MPI_USE_DYNAMIC_CONNECTIONS=1.

97

Page 98: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi_testsuite

66 intel_mpi_testsuite

Check the Intel(R) MPI Library runtime and network stack

DESCRIPTION

intel_mpi_testsuite is an Intel(R) Cluster Checker module to verify the Intel(R) MPI Library runtime and networkstack by running the Intel(R) MPI Library Test Suite.

CONFIGURATION

cc-path

The base path to the Intel(R) C++ Compiler installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

device

The Intel(R) MPI Library device string, i.e., I_MPI_DEVICE. May be specified more than once.Default: rdssm

fc-path

The base path to the Intel(R) Fortran Compiler installation directory. Setting this parameter will automatically setupthe environment.Default: none (inherit environment)

mpi-path

The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

Example

<intel_mpi_testsuite><device>sock</device><device>rdssm:Openib-ib0</device><mpi-path>/opt/intel/mpi-rt/3.0</mpi-path>

</intel_mpi_testsuite>

MODULE CLASS

vector

DEPENDENCIES

intel_cce_rtl

intel_fce_rtl

intel_mpi_rt_internode

98

Page 99: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_mpi_testsuite

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) C++ Library 9.1 or later

Intel(R) Fortran Library 9.1 or later

Intel(R) MPI Library 3.0 or later

Intel(R) MPI Library Test Suite

mkdir

rm

tar

NOTES

By default, assumes that the environment inherited from the user running Intel(R) Cluster Checker is setup correctly.See the<cc-path>, <fc-path>, and<mpi-path> tags to override.

99

Page 100: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 intel_tbb_rtl_1_0

67 intel_tbb_rtl_1_0

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the Intel(R) Threading Building Blocks runtime meets requirements.

METHOD

Compare the Intel(R) Threading Building Blocks runtime version to 1.0.Confirm that the following files exist on all nodes:

/opt/intel/tbb/<version>/lib/libtbb.so

/opt/intel/tbb/<version>/lib/libtbb_debug.so

/opt/intel/tbb/<version>/lib/libtbbmalloc.so

/opt/intel/tbb/<version>/lib/libtbbmalloc_debug.so

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

perl

Intel(R) Threading Building Blocks 1.0 or later

100

Page 101: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 ip_alltoall

68 ip_alltoall

Check the uniformity of IP addresses

DESCRIPTION

ip_alltoall.pm is an Intel(R) Cluster Checker module to verify the resolution of every host’s IP from every other host.

CONFIGURATION

None

MODULE CLASS

matrix

DEPENDENCIES

ssh

ping_alltoall

genuine_intel

EXTERNAL DEPENDENCIES

ping

101

Page 102: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 ipoib

69 ipoib

Check that the InfiniBand devices are configured

DESCRIPTION

ipoib is an Intel(R) Cluster Checker module to verify that all the InfiniBand devices are properly configured.

CONFIGURATION

down

The specified device should not be verified as it is not configured. This option may be specified multiple times to markmore than one port as down.Default: none

Example

<ipoib><down>ib1</down>

</ipoib>

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ifconfig

102

Page 103: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 java_1_4_2

70 java_1_4_2

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the Java Runtime Environment meets requirements.

METHOD

Compare the Java Runtime Environment version to 1.4.2.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

Java Runtime Environment 1.4.2 or later

103

Page 104: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 jdk_1_4_2

71 jdk_1_4_2

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the Java Software Development Kit meets requirements.

METHOD

Compare the Java compiler version to 1.4.2.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

Java Software Development Kit 1.4.2 or later

104

Page 105: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 kernel

72 kernel

Check the uniformity of the kernel version on all nodes

DESCRIPTION

kernel is an Intel(R) Cluster Checker module to verify the kernel version of a node. The module examines the outputof ‘uname -r‘ to compare against the other nodes.

CONFIGURATION

None

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

uname

105

Page 106: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 kernel_2_6_17

73 kernel_2_6_17

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the Linux kernel meets requirements.

METHOD

Compare the kernel version to the list of sufficient kernels.Note: see the Intel(R) Cluster Ready Specification for a list of sufficient kernels.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

uname

106

Page 107: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 kernel_modules

74 kernel_modules

Check the loaded Linux kernel modules

DESCRIPTION

kernel_modules is an Intel(R) Cluster Checker module to verify the list of loaded kernel modules. It verifies theuniformity of the kernel modules loaded on the compute nodes. If the head node is also a compute node, the test alsoverifies that modules loaded on the compute nodes are also loaded on the head node (although the head node may haveadditional modules loaded that are not present on the compute nodes.)

CONFIGURATION

exclude

Exclude this kernel module from the check, i.e., disregard any non-uniformity between nodes. May be specifiedmultiple times to exclude more than one module.Default: joydev , sr_mod , usb_storage , ohci_hcd

extended_match

Use the fulllsmod output when determining uniformity. By default, only whether the kernel module is loaded ischecked. Specifying this option includes the size, load count, and list of referring modules in the uniformity check.Default: false

module

Explicitly verify that the specified kernel module is loaded. May be specified multiple times to include more than onemodule.Default: none

Example

<kernel_modules><exclude>sunrpc</exclude><extended_match/><module>ipoib</module><module>e1000</module>

</kernel_modules>

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

lsmod

107

Page 108: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 kernel_parameters

75 kernel_parameters

Check the uniformity of the kernel runtime parameters

DESCRIPTION

kernel_parameters is an Intel(R) Cluster Checker module to verify the uniformity of the kernel runtime parametersamong all nodes. The module uses thesysctl program to collect the kernel parameters.

CONFIGURATION

exclude

Kernel parameter to exclude from the check based on a regular expression match. May be specified multiple times. Thefollowing parameters are automatically excluded:kernel.hostname , kernel.domainname , kernel.random* ,kernel.pty.nr , fs.dentry-state , fs.file-max , fs.file-nr , fs.inode-nr , fs.inode-state ,fs.nfs.nfs3_acl_max_entries , fs.nfsd.nfsd3_acl_max_entries , andfs.quota.syncs . Thefollowing parameter subsets are also excluded:net.ipv4.conf.* , net.ipv4.neigh.* , net.ipv4.netfilter.* ,net.ipv6.conf.* , net.ipv6.neigh.* , fs.nfs.* .

Example

<kernel_parameters><exclude>eth1</exclude>

</kernel_parameters>

MODULE CLASS

vector

DEPENDENCIES

mount_proc

sh

ssh

genuine_intel

EXTERNAL DEPENDENCIES

sysctl

108

Page 109: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 ksh

76 ksh

Check the Korn Shell

DESCRIPTION

ksh is an Intel(R) Cluster Checker module to verify the Korn Shell. The module tests if/bin/ksh exists and runs a’Hello World’ script.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

/bin/ksh

109

Page 110: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 lib32_counterpart_lib64

77 lib32_counterpart_lib64

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the 32-bit libraries meet requirements.

METHOD

Confirm that all 32-bit libraries in the dynamic linker cache have a 64-bit counterpart.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

/sbin/ldconfig

110

Page 111: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 loopback

78 loopback

Check that the loopback address is correctly configured

DESCRIPTION

loopback is an Intel(R) Cluster Checker module to verify that the loopback address is correctly configured. Theloopback address (127.0.0.1) must correspond to ’localhost’ in /etc/hosts, and both must respond to ping.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

grep

ping

111

Page 112: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 memory_bandwidth_stream

79 memory_bandwidth_stream

Check the memory bandwidth of a node using the STREAM benchmark

DESCRIPTION

memory_bandwidth_stream is an Intel(R) Cluster Checker module to verify the memory bandwidth of each node usingthe Triad STREAM benchmark and their deviation over the cluster nodes. STREAM is configured to use a 10 millionelement array (requires 229 MB of memory.)

CONFIGURATION

bandwidth

The minimally acceptable Triad memory bandwidth, in MB/s.Default: none

build

Build STREAM from source rather than using the prebuilt binary (external/stream .) If true, the Intel(R) CCompiler must be available; theintel_cc module should be added as a dependency.

cc-path

The base path to the Intel(R) C++ Compiler / runtime libraries. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)

deviation

The factor of allowed standard deviations from median, used to search for outlier values. The allowed range is (median-/+ deviation * stddev).Default: 3

threads

The number of OpenMP threads for STREAM to use. This is equivalent to the OMP_NUM_THREADS environmentvariable.Default: ALL

Example

<memory_bandwidth_stream><bandwidth>3600</bandwidth><cc-path>/opt/intel/cce/9.1</cc-path><deviation>3</deviation>

</memory_bandwidth_stream>

MODULE CLASS

unit

112

Page 113: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 memory_bandwidth_stream

DEPENDENCIES

intel_cce_rtl

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) C++ Compiler 9.1 or later runtime

NOTES

By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<cc-path> tag to override.

113

Page 114: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 mflops_intel_mkl

80 mflops_intel_mkl

Check the floating point performance of a node

DESCRIPTION

mflops_intel_mkl is an Intel(R) Cluster Checker module to verify the floating point performance of each cluster node.The module executes the DGEMM library routine from the Intel(R) Math Kernel Library to measure the floating pointperformance and deviation over the cluster nodes. The environment variable OMP_NUM_THREADS is set to ALLso that all cores are used.

CONFIGURATION

build

Build the DGEMM benchmark from source rather than using the prebuilt binary. If true, the Intel(R) Math Kernel Li-brary must be in the linker path or thepath option should be used; thegcc module should be added as a dependency.Default: false

mflops

The minimum acceptable floating point performance in MFLOPS.Default: noneIf no MFLOPS threshold is defined, the module only confirms that the measured MFLOPS is greater than zero.

m, n, k

The matrix dimensions used in DGEMM. The total memory required is (m*n + m*k + n*k) * sizeof(double) bytes.Default: m = 5000, n = 5000, k = 112 (memory requirement = 200 MB if sizeof(double) is 8 bytes)

mkl-path

The base path to the Intel(R) Math Kernel Library.Default: none

deviation

The factor of allowed standard deviations from median, used to search for outlier values. The allowed range is (median-/+ deviation * stddev).Default: 3

Example

<mflops_intel_mkl><deviation>3</deviation><mkl-path>/opt/intel/cmkl/9.0</mkl-path><mflops>6000</mflops><m>5000</m><n>5000</n><k>112</k>

</mflops_intel_mkl>

114

Page 115: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 mflops_intel_mkl

MODULE CLASS

vector

DEPENDENCIES

gcc

sh

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

Intel(R) Math Kernel Library 9.0 or later

115

Page 116: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 mount_proc

81 mount_proc

Check that the procfs filesystem (/proc) is mounted

DESCRIPTION

mount_proc is an Intel(R) Cluster Checker module to verify that the procfs filesystem is mounted on /proc.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

EXTERNAL DEPENDENCIES

None

NOTES

Assumes that /proc is the mount point for procfs.

116

Page 117: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 mpi_consistency

82 mpi_consistency

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the MPI job startup commands meet requirements.

METHOD

Confirm that the path to mpirun and mpiexec are consistent on all nodes.

CONFIGURATION

None

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

which

117

Page 118: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 network_consistency

83 network_consistency

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the resolution of hostnames and IPs meets requirements.

METHOD

Lookup the IP for each compute node name on each compute node.

CONFIGURATION

None

MODULE CLASS

matrix

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ping

118

Page 119: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 nfs_mounts

84 nfs_mounts

Check NFS mount points

DESCRIPTION

nfs_mounts is an Intel(R) Cluster Checker module to verify that NFS filesystems are mounted on the compute nodes.The test does not verify mountpoints on the head node.

CONFIGURATION

filesystem

Filesystem type to check. If this tag is defined defaults won’t be used.Default: nfs and autofs

mountpoint

The mountpoint to be checked. This option may be specified multiple times to check more than one mountpoint. If nomountpoints are specified, this test will return with an indeterminate result.Default: none

Example

<nfs_mounts><filesystem>nfs</filesystem><filesystem>ext3</filesystem><mountpoint>/opt</mountpoint><mountpoint>/shared</mountpoint>

</nfs_mounts>

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

mount

119

Page 120: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 nisdomain

85 nisdomain

Check that all nodes belong to the same NIS/YP domain

DESCRIPTION

nisdomain is an Intel(R) Cluster Checker module to verify the NIS domain of a node. The module usesnisdomainname .

CONFIGURATION

None

MODULE CLASS

vector

DEPENDENCIES

sh

ssh

genuine_intel

EXTERNAL DEPENDENCIES

nisdomainname

120

Page 121: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 nismaps

86 nismaps

Check the uniformity of the NIS/YP password map

DESCRIPTION

nismaps is an Intel(R) Cluster Checker module to verify the NIS password maps are uniform. The password maps areconsidered uniform if the md5 checksum is identical on all nodes.

CONFIGURATION

None

MODULE CLASS

vector

DEPENDENCIES

sh

ssh

nisdomain

genuine_intel

EXTERNAL DEPENDENCIES

md5sum

ypcat

121

Page 122: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 nsswitch

87 nsswitch

Check the configuration of /etc/nsswitch.conf

DESCRIPTION

nsswitch is an Intel(R) Cluster Checker module to verify the contents of the nsswitch.conf file. It verifies that/etc/nsswitch.conf is consistent across the cluster (formatting variations are allowed, so long as the service order isthe same).

CONFIGURATION

None

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

cat

122

Page 123: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 openib

88 openib

Check the OpenFabrics Enterprise Distribution InfiniBand driver

DESCRIPTION

openib is an Intel(R) Cluster Checker module to verify the OpenIB InfiniBand (*) driver provided by the OpenFabricsEnterprise Distribution. This module confirms that the InfiniBand adapters are the same hardware and firmwarerevisions, the ports are in the Active state, and have the same rate and capabilities. The memlock memory limits in/etc/security/limits.conf are also verified.

CONFIGURATION

Configuration options are divided in three categories:

ibstat-path

ibstat command installation directory. This may be needed if it is installed in a location not present in user’s PATH.

adapter

A container for the InfiniBand adapter device. The <adapter> block may be repeated to verify multiple In-finiBand adapters.

device

The device name of the adapter, e.g., mthca0. This option must be specified for each<adapter> container.

Default: none

down-port

The specified InfiniBand adapter port that should not be verified and is considered in down state. This optionmay be specified multiple times to mark more than one port as down.

Default: none

firmware-version

The firmware version string. If this parameter is not specified, only the uniformity of the firmware version ischecked.

Default: none

rate

The raw link rate in Gbit/s. If this parameter is not specified, only the uniformity of the link rate is checked.

Default: none

memlock

The minimum value, in bytes, of the hard and soft memlock limits.Default: 2000000

123

Page 124: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 openib

Example

<openib><adapter>

<device>mthca0</device><firmware-version>4.7.600</firmware-version><rate>10</rate>

</adapter><adapter>

<device>mthca1</device><down-port>2</down-port><rate>20</rate>

</adapter><ibstat-path>/usr/local/ofed/bin</ibstat-path><memlock>2000000</memlock>

</openib>

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

cat

OpenFabrics Enterprise Distribution

124

Page 125: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 openssh_3_9

89 openssh_3_9

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that OpenSSH (*) meets requirements.

METHOD

Compare the ssh version to OpenSSH 3.9.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ssh

125

Page 126: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 packages

90 packages

Check that the cluster has installed a reference set of given packages.

DESCRIPTION

packages is an Intel(R) Cluster Checker module to verify that a node is an exact copy of a reference system. Themodule uses the output of the-packages output of a reference system as the basis of the comparison.

CONFIGURATION

node

The path to a reference file with a list of installed packages.Default: none

head

The path to a reference file with a list of installed packagesDefault: none

Example

<packages><head>head.list</head><node>node.list</node>

</packages>

MODULE CLASS

unit

DEPENDENCIES

sh

ssh

rpm

genuine_intel

EXTERNAL DEPENDENCIES

rpm

cat

126

Page 127: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 pci

91 pci

Check the uniformity of the devices on the PCI bus

DESCRIPTION

pci is an Intel(R) Cluster Checker module to verify the device IDs connected to the PCI bus. The module collects PCIbus data usinglspci and compares the output across the cluster.

CONFIGURATION

exclude

The name of the PCI device ID and type description to exclude from the check. May be specified multiple times toexclude more than one.Default: none

Example

<pci><exclude>03:00.3 Serial controller</exclude>

</pci>

MODULE CLASS

vector

DEPENDENCIES

sh

ssh

genuine_intel

EXTERNAL DEPENDENCIES

lspci

127

Page 128: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 perl

92 perl

Check the Perl interpreter

DESCRIPTION

perl is an Intel(R) Cluster Checker module to verify the Perl interpreter. The module examines the Perl version andruns a ’Hello World’ one-liner.

CONFIGURATION

perl-path

The base path to the Perl interpreterDefault: /usr/bin

version

The string that is compared to the Perl version.Default: noneIf version is not specified in the configuration file, then the specific Perl version will not be checked. The uniformityof the version string is checked regardless.

Example

<perl><perl-path>/usr/bin</perl-path><version>5.8.7</version>

</perl>

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

Perl

128

Page 129: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 perl_5_6_1

93 perl_5_6_1

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that Perl meets requirements.

METHOD

Compare the Perl version to 5.6.1.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

perl

129

Page 130: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 ping

94 ping

Check that all nodes respond to ping from the head node

DESCRIPTION

ping is an Intel(R) Cluster Checker module to verify that all nodes respond to ping from the head node.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

None

EXTERNAL DEPENDENCIES

ping

130

Page 131: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 ping_alltoall

95 ping_alltoall

Check that all nodes can ping all other nodes

DESCRIPTION

ping_alltoall is an Intel(R) Cluster Checker module to verify that every node can ping every other node.

CONFIGURATION

None

MODULE CLASS

matrix

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ping

131

Page 132: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 portal

96 portal

Check that all nodes can ping the portal

DESCRIPTION

portal is an Intel(R) Cluster Checker module to verify that the site portal can be reached from all nodes of the cluster.The module pings the portal to test the connection.

CONFIGURATION

portal-name

The name of the site portal machine.Default: portal

Example

<portal><portal-name>portal</portal-name>

</portal>

MODULE CLASS

unit

DEPENDENCIES

sh

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ping

132

Page 133: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 process_check

97 process_check

Check for stale processes

DESCRIPTION

process_check is an Intel(R) Cluster Checker module to verify that the process list does not contain runaway processes(in terms of cpu or memory usage), zombies, or other stale processes.

CONFIGURATION

elapsed_time

Time (in seconds) that is used to define a stale process. See alsoexempt_uids .Default: 3600

exclude

Process names that are excluded from the check. This option may be repeated to exclude more than one process name.Default: none

exempt_uids

uids lower than this value are exempt from the elapsed time check. Daemons, etc. started from system accounts shouldnot be flagged as stale regardless of how long they have been running. See alsoelapsed_time .Default: 400

percent_cpu

Percentage of cpu that is used to define a runaway process. Note: on some systems, the percent cpu is defined relativeto a single core, on others it is relative to all cores.Default: 5

percent_memory

Percentage of memory that is used to define a runaway process.Default: 1

zombie_allowed_elapsed_time

Time (in seconds) that is used to allow transient zombies. Intel(R) Cluster Checker and other applications may createtransient zombies that are quicky, but not instantly reaped. Do not flag these transient zombies as ’true’ zombieprocesses unless their elapsed time is greater than this value.Default: 1

Example

<process_check><elapsed_time>3600</elapsed_time><exclude>ntpd</exclude><exclude>portmap</exclude><percent_cpu>5</percent_cpu><percent_memory>1</percent_memory>

</process_check>

133

Page 134: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 process_check

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ps

134

Page 135: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 python

98 python

Check the Python interpreter

DESCRIPTION

python is an Intel(R) Cluster Checker module to verify the Python interpreter. The module examines the Pythonversion and runs a ’Hello World’ one-liner.

CONFIGURATION

python-path

The base path to the Python interpreterDefault: /usr/bin

version

The string that is compared to the Python version.Default: noneIf versionis not specified in the configuration file, then the specific Python version will not be checked.The uniformityof the version string is checked regardless.

Example

<python><python-path>/usr/bin</python-path><version>2.2.3</version>

</python>

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

python

135

Page 136: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 python_2_3_4

99 python_2_3_4

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that python meets requirements

METHOD

Compare the python version to 2.3.4.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

python

136

Page 137: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 rpm

100 rpm

Check the installed RPM packages

DESCRIPTION

rpm is an Intel(R) Cluster Checker module to verify the list of installed rpms. It verifies the uniformity of the rpmsinstalled on the compute nodes. If the head node is also a compute node, the test also verifies that rpms installed onthe compute nodes are also installed on the head node (although the head node may have additional rpms that are notpresent on the compute nodes.)

CONFIGURATION

exclude

The full name of a rpm to exclude from the check. Note that this is the name returned by "rpm -q package", includingthe version number. May be specified multiple times to exclude more than one rpm.Default: none

rpm

Explicitly verify that the specified rpm is installed. May be specified multiple times to include more than one rpm.Default: none

Example

<rpm><exclude>rpm-4.3.3-9_nonptl</exclude><exclude>xterm-192-1</exclude><rpm>mpich-ch_p4-gcc-oscar-module-1.2.7-4</rpm>

</rpm>

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

rpm

137

Page 138: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 sh

101 sh

Check the Bourne Shell

DESCRIPTION

sh is an Intel(R) Cluster Checker module to verify the Bourne Shell. The module tests if/bin/sh exists and runs a’Hello World’ script.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

/bin/sh

138

Page 139: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 shm_mount

102 shm_mount

Check that the shared memory device (/dev/shm) is mounted.

DESCRIPTION

shm_mount is an Intel(R) Cluster Checker module to verify that /dev/shm is mounted correctly.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

mount

139

Page 140: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 single_authentication

103 single_authentication

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that the cluster authentication process meets requirements.

METHOD

ssh to every node from every other node and run a simple command.

CONFIGURATION

None

MODULE CLASS

matrix

DEPENDENCIES

ping_alltoall

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ssh

140

Page 141: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 speedstep

104 speedstep

Check state of the Intel SpeedStep(R) Technology

DESCRIPTION

speedstep is an Intel(R) Cluster Checker module that verifies the state of the Intel SpeedStep(R) Technology. TheLinux kernel support for Intel SpeedStep(R) Technology is provided by the cpufreq subsystem.

CONFIGURATION

state

The required state of the Intel SpeedStep(R) Technology.Default: false

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

test

uname

141

Page 142: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 ssh

105 ssh

Check the ssh connectivity of all nodes

DESCRIPTION

ssh.pm is an Intel(R) Cluster Checker module to verify that all nodes can be reached via ssh from the system runningIntel(R) Cluster Checker.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ping

EXTERNAL DEPENDENCIES

echo

142

Page 143: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 ssh_alltoall

106 ssh_alltoall

Check that every node can ssh to all other nodes

DESCRIPTION

ssh_alltoall is an Intel(R) Cluster Checker module to verify that every node can ssh to every other node in the cluster.

CONFIGURATION

None

MODULE CLASS

matrix

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ssh

143

Page 144: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 ssh_version

107 ssh_version

Check the ssh version uniformity of all nodes

DESCRIPTION

ssh_version is an Intel(R) Cluster Checker module to verify that all nodes have the same ssh version.

CONFIGURATION

ssh-path

The base path to the SSH commandDefault: /usr/bin

Example

<ssh_version><ssh-path>/usr/bin</ssh-path>

</ssh_version>

MODULE CLASS

vector

DEPENDENCIES

ping

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ssh

144

Page 145: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 stray_uids

108 stray_uids

Check that all files in a directory are owned by a known user and group

DESCRIPTION

stray_uids is an Intel(R) Cluster Checker module to verify that all files in a directory (or several directories) are ownedby known users and groups.

CONFIGURATION

dir

A directory to be checked for stray UIDs and GIDs. May be specified multipe times to check more than one directory.Default : /tmpif, and only if, no dir options are specified. If any directories are listed in the config file, /tmp willnotbe checked, unless it is also explicitly listed.

Example

<stray_uids><dir>/tmp</dir><dir>/var/tmp</dir><dir>/home</dir><dir>/var/log</dir>

</stray_uids>

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

stat

145

Page 146: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 subnet_manager

109 subnet_manager

Check the InfiniBand subnet manager

DESCRIPTION

subnet_manager is an Intel(R) Cluster Checker module to verify that the software InfiniBand subnet manager is run-ning on one, and only one, node.

CONFIGURATION

command

The name of the subnet manager processDefault: opensm

Example

<subnet_manager><command>opensm</command>

</subnet_manager>

MODULE CLASS

vector

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

ps

NOTES

Does not consider that subnet manager may be running on a node external to the cluster. Does not consider that theInfiniBand switch may have a hardware subnet manager.

146

Page 147: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 system_memory

110 system_memory

Check the uniformity of the total physical memory and swap memory

DESCRIPTION

system_memory is an Intel(R) Cluster Checker module to verify system memory. The module checks both physicalmemory and swap space.

CONFIGURATION

physical

The amount of physical memory, in KB.Default: median of the collected values

physical_threshold

Maximum absolute deviation from the expected amount of physical memory that is allowable, in KB.Default: 100

swap

The amount of swap memory, in KB.Default: median of the collected values

swap_threshold

Maximum absolute deviation from the expected amount of swap memory that is allowable, in KB.Default: 100

Example

<system_memory><physical>6154464</physical><swap>2040208</swap>

</system_memory>

MODULE CLASS

vector

DEPENDENCIES

ssh

mount_proc

genuine_intel

147

Page 148: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 tcl_8_4_7

111 tcl_8_4_7

Check Intel(R) Cluster Ready compliance

DESCRIPTION

Check that Tcl meets requirements.

METHOD

Compare the Tcl patchlevel to 8.4.7.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

tclsh

148

Page 149: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 tcsh

112 tcsh

Check the enhanced C Shell

DESCRIPTION

tcsh is an Intel(R) Cluster Checker module to verify the enhanced C Shell. The module tests if/bin/tcsh existsand runs a ’Hello World’ script.

CONFIGURATION

None

MODULE CLASS

unit

DEPENDENCIES

ssh

tmp

genuine_intel

EXTERNAL DEPENDENCIES

/bin/tcsh

149

Page 150: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 tmp

113 tmp

Check the permissions on /tmp

DESCRIPTION

tmp is an Intel(R) Cluster Checker module to verify the permissions on /tmp are correct (by default: 1777.)

CONFIGURATION

sticky

If true, consider 0777 to also be correct.Default: false

Example

<tmp><sticky/>

</tmp>

MODULE CLASS

unit

DEPENDENCIES

ssh

genuine_intel

EXTERNAL DEPENDENCIES

stat

NOTES

Assumes that the temporary directory is named /tmp.

150

Page 151: Intel Cluster Checker Module Reference · Intel R Cluster Checker Module Reference Contents 1 1GiB_memory 7 2 65GiB_storage_head 8 3 X11_clients 9 4 X11_libs 11 5 arch 13 6 available_disk

Intel R©Cluster Checker 1.3u2 uid_sync

114 uid_sync

Check the uniformity of the user and group database

DESCRIPTION

uid_sync is an Intel(R) Cluster Checker module to verify the synchronization of users and group information acrossthe cluster. The comparison includes all attributes that are returned bygetpwent andgetgrent .

CONFIGURATION

minuid

The minimum uid to checkDefault: 500

mingid

The minimum gid to checkDefault: 500

Example

<uid_sync><mingid>1000</mingid>

</uid_sync>

MODULE CLASS

vector

DEPENDENCIES

ssh

perl

genuine_intel

EXTERNAL DEPENDENCIES

perl

151