intel cluster checker module reference · intel r cluster checker module reference contents 1...
TRANSCRIPT
Intel R©Cluster CheckerModule Reference
Contents
1 1GiB_memory 7
2 65GiB_storage_head 8
3 X11_clients 9
4 X11_libs 11
5 arch 13
6 available_disk 14
7 base_libs 15
8 bash 16
9 binutils_2_15_92 17
10 clean_ipc 18
11 clock_granularity 19
12 clock_sync 20
13 clomp 21
14 cluster_size 22
15 copy_exactly 23
16 core_count 25
17 core_frequency 26
18 cpuinfo 27
19 cron 28
20 csh 29
21 dat_conf 30
1
Intel R©Cluster Checker 1.3u2 CONTENTS
22 dmidecode 31
23 e1000 32
24 e1000e 33
25 environment 34
26 etc_hosts 35
27 file_permissions 36
28 file_tree 38
29 gcc 39
30 gcc_3_4_6 40
31 gdb_6_3 41
32 generic_correctness 42
33 generic_uniformity 44
34 genuine_intel 45
35 gige 46
36 glibc_2_3_4 47
37 gmake_3_80 49
38 hdparm 50
39 home 52
40 host_conf 53
41 hostname 54
42 hpcc 55
43 ib_firmware 58
44 ibadm 59
45 icr_version_1_0 60
46 icr_version_1_1 61
47 igb 62
48 imb_collective_intel_mpi 63
49 imb_message_integrity_intel_mpi 65
2
Intel R©Cluster Checker 1.3u2 CONTENTS
50 imb_pingpong_intel_mpi 67
51 intel_cc 70
52 intel_cc_rtl_9_1 72
53 intel_cce_rtl 74
54 intel_cce_rtl_9_1 75
55 intel_cmkl_rtl_9_0 77
56 intel_devtools_1_0 79
57 intel_fc 81
58 intel_fc_rtl_9_1 82
59 intel_fce_rtl 84
60 intel_fce_rtl_9_1 85
61 intel_mpi 87
62 intel_mpi_internode 89
63 intel_mpi_rt 91
64 intel_mpi_rt_3_0_033 93
65 intel_mpi_rt_internode 96
66 intel_mpi_testsuite 98
67 intel_tbb_rtl_1_0 100
68 ip_alltoall 101
69 ipoib 102
70 java_1_4_2 103
71 jdk_1_4_2 104
72 kernel 105
73 kernel_2_6_17 106
74 kernel_modules 107
75 kernel_parameters 108
76 ksh 109
77 lib32_counterpart_lib64 110
3
Intel R©Cluster Checker 1.3u2 CONTENTS
78 loopback 111
79 memory_bandwidth_stream 112
80 mflops_intel_mkl 114
81 mount_proc 116
82 mpi_consistency 117
83 network_consistency 118
84 nfs_mounts 119
85 nisdomain 120
86 nismaps 121
87 nsswitch 122
88 openib 123
89 openssh_3_9 125
90 packages 126
91 pci 127
92 perl 128
93 perl_5_6_1 129
94 ping 130
95 ping_alltoall 131
96 portal 132
97 process_check 133
98 python 135
99 python_2_3_4 136
100rpm 137
101sh 138
102shm_mount 139
103single_authentication 140
104speedstep 141
105ssh 142
4
Intel R©Cluster Checker 1.3u2 CONTENTS
106ssh_alltoall 143
107ssh_version 144
108stray_uids 145
109subnet_manager 146
110system_memory 147
111tcl_8_4_7 148
112tcsh 149
113tmp 150
114uid_sync 151
5
Intel R©Cluster Checker 1.3u2 CONTENTS
Disclaimer and Legal Information
INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL(R) PRODUCTS. NO LI-CENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTYRIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL’S TERMS AND CON-DITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND IN-TEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTELPRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PUR-POSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLEC-TUAL PROPERTY RIGHT.Intel products are not intended for use in medical, life saving, life sustaining, critical control or safety systems, or innuclear facility applications.Intel may make changes to specifications and product descriptions at any time, without notice. Designers must notrely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reservesthese for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising fromfuture changes to them. The information here is subject to change without notice. Do not finalize a design with thisinformation.The products described in this document may contain design defects or errors known as errata which may cause theproduct to deviate from published specifications. Current characterized errata are available on request.Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing yourproduct order.Copies of documents which have an order number and are referenced in this document, or other Intel literature, maybe obtained by calling 1-800-548-4725, or by visiting Intel’s Web Site.* Other names and brands may be claimed as the property of others.Copyright (C) 2006-2009, Intel Corporation. All rights reserved.
6
Intel R©Cluster Checker 1.3u2 1GiB_memory
1 1GiB_memory
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the amount of physical memory in each node meets requirements.
METHOD
Get the amount of physical memory from /proc/meminfo.
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
grep
/proc/cpuinfo
/proc/meminfo
7
Intel R©Cluster Checker 1.3u2 65GiB_storage_head
2 65GiB_storage_head
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the amount of storage available to the head node meets requirements.
METHOD
Get the disk size from df(1).
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
df
8
Intel R©Cluster Checker 1.3u2 X11_clients
3 X11_clients
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the X11 clients meet requirements on the head node.
METHOD
Confirm that the following files are available on the head node:
glxinfo
listres
mkhtmlindex
showfont
viewres
x11perf
x11perfcomp
xauth
xclipboard
xev
xfd
xfontsel
xkill
xload
xmessage
xterm
xvinfo
xwininfo
CONFIGURATION
None
MODULE CLASS
unit
9
Intel R©Cluster Checker 1.3u2 X11_clients
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
which
10
Intel R©Cluster Checker 1.3u2 X11_libs
4 X11_libs
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the X11 runtime libraries meet requirements.
METHOD
Confirm that the following libraries (or later versions) are in the dynamic linker cache on all nodes. x86-64 versionsneed to be present.
libFS.so.6
libGLw.so.1
libI810XvMC.so.1
libICE.so.6
libMrm.so.3
libOSMesa.so.4
libUil.so.3
libX11.so.6
libXcomposite.so.1
libXcursor.so.1
libXdamage.so.1
libXevie.so.1
libXext.so.6
libXfixes.so.3
libXfont.so.1
libXft.so.2
libXinerama.so.1
libXi.so.6
libXm.so.3
libXmu.so.6
libXmuu.so.1
libXp.so.6
libXrandr.so.2
libXrender.so.1
11
Intel R©Cluster Checker 1.3u2 X11_libs
libXRes.so.1
libXss.so.1
libXTrap.so.6
libXt.so.6
libXtst.so.6
libXvMC.so.1
libXv.so.1
libXxf86dga.so.1
libXxf86misc.so.1
libXxf86vm.so.1
libfontconfig.so.1
libfreetype.so.6
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
/sbin/ldconfig
12
Intel R©Cluster Checker 1.3u2 arch
5 arch
Check the uniformity of the system architecture on all nodes
DESCRIPTION
arch is an Intel(R) Cluster Checker module to verify the uniformity of the system architecture among nodes. Themodule examines the output of ‘uname -p‘ to compare against the other nodes.
CONFIGURATION
None
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
uname
13
Intel R©Cluster Checker 1.3u2 available_disk
6 available_disk
Check the space available on a local filesystem
DESCRIPTION
available_disk is an Intel(R) Cluster Checker module to verify that a minimum amount of free space is available on alocal filesystem. Only local filesytems can be checked.
CONFIGURATION
filesystem
A container that groups the other options by filesystem
available The minimum amount of free disk space, in KB, that is required. If not specified, the free space is notchecked.
mountpoint The mountpoint of the filesystem.
Example
<available_disk><filesystem>
<available>10485760</available><mountpoint>/</mountpoint>
</filesystem><filesystem>
<available>102400</available><mountpoint>/boot</mountpoint>
</filesystem></available_disk>
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
df
14
Intel R©Cluster Checker 1.3u2 base_libs
7 base_libs
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the base libraries are present and meet requirements.
METHOD
Confirm that the following libraries (or later versions) are in the dynamic linker cache on all nodes. Both 32-bit andx86-64 versions need to be present.
libacl.so.1
libattr.so.1
libbz2.so.1
libcap.so.1
libelf.so.0
libgdbm.so.2
libncurses.so.5
libpam.so.0
libtermcap.so.2
libz.so.1
Libraries checked by theglibc_2_3_4 module are not re-examined in this module.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
glibc_2_3_4
ssh
genuine_intel
EXTERNAL DEPENDENCIES
/sbin/ldconfig
15
Intel R©Cluster Checker 1.3u2 bash
8 bash
Check the Bourne Again Shell
DESCRIPTION
bash is an Intel(R) Cluster Checker module to verify the Bourne Again Shell. The module tests if/bin/bash existsand runs a ’Hello World’ script.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
/bin/bash
test
16
Intel R©Cluster Checker 1.3u2 binutils_2_15_92
9 binutils_2_15_92
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the GNU (*) binutils are present and meet requirements.
METHOD
Compare the versions of the binutils to 2.15.92:
addr2line
ar
as
gprof
ld
nm
objcopy
objdump
ranlib
readelf
size
strings
strip
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
GNU binutils
17
Intel R©Cluster Checker 1.3u2 clean_ipc
10 clean_ipc
Check that no SysV IPC facilities are open
DESCRIPTION
clean_ipc is an Intel(R) Cluster Checker module to verify that the ipc subsystem is clean. The module executesipcs-a to get a list of Shared Memory Segments, Semaphore Arrays, and Message Queues. If there are any entries, it willflag them and fail, giving a count of the entries in each list.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ipcs
18
Intel R©Cluster Checker 1.3u2 clock_granularity
11 clock_granularity
Check the minimum granularity of gettimeofday()
DESCRIPTION
clock_granularity is an Intel(R) Cluster Checker module to verify thegettimeofday() system clock granular-ity. The module executes a C loop and counts the number of times through the loop before the value returned bygettimeofday() changes. As long as the number of times through the loop is greater than 1, this is an accurate, ifsimplistic, way to compute the clock granularity.
CONFIGURATION
granularity
The acceptable clock granularity threshold in microseconds.Default: 2 us
Example
<clock_granularity><granularity>2</granularity>
</clock_granularity>
MODULE CLASS
unit
DEPENDENCIES
gcc
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
None
19
Intel R©Cluster Checker 1.3u2 clock_sync
12 clock_sync
Check the cluster clock synchronization
DESCRIPTION
clock_sync is an Intel(R) Cluster Checker module to verify the system clocks on each node are reasonably synchro-nized. The module calculates the difference between the node time and the cluster median time and compares it to athreshold value.
CONFIGURATION
deviation
The maximum deviation (in seconds) of the clock on any node from the cluster median.Default: 300 seconds
Example
<clock_sync><deviation>300</deviation>
</clock_sync>
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
date
NOTES
Since the node clocks are not sampled at exactly the same instance, specifying too small of a threshold is not recom-mended.
20
Intel R©Cluster Checker 1.3u2 clomp
13 clomp
Check the Intel(R) C++ Compiler Cluster OpenMP runtime library
DESCRIPTION
clomp is an Intel(R) Cluster Checker module to verify the Intel(R) C++ Compiler Cluster OpenMP runtime. Themodule runs a Hello World binary on the compute nodes using Cluster OpenMP and checks that the kernel parameterrandomize_va_space is not set.Note: this module does not build the Hello World binary using the Intel(R) C++ Compiler.
CONFIGURATION
cc-path
The base path to the Intel(R) C++ Compiler installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
Example
<clomp><cc-path>/opt/intel/cc/9.1</cc-path>
</clomp>
MODULE CLASS
vector
DEPENDENCIES
arch
intel_cce_rtl
sh
ssh
ssh_alltoall
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) C++ Compiler 9.1 or later runtime
/sbin/sysctl
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH) inherited from the user running Intel(R) ClusterChecker is setup correctly.Assumes that the nodes are x86_64 architecture.
21
Intel R©Cluster Checker 1.3u2 cluster_size
14 cluster_size
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the number of nodes meets requirements.
METHOD
Count the number of nodes being checked.
CONFIGURATION
None
MODULE CLASS
vector
DEPENDENCIES
genuine_intel
EXTERNAL DEPENDENCIES
None
22
Intel R©Cluster Checker 1.3u2 copy_exactly
15 copy_exactly
Check that a node image is an exact copy of a reference image
DESCRIPTION
copy_exactly is an Intel(R) Cluster Checker module to verify that a node is an exact copy of a reference system. Themodule uses the output of thenode_checksum script run on the reference system as the basis of the comparison.
CONFIGURATION
compute_node
The path to the compute node reference file.Default: none
exclude
File to exclude from the test based on a regular expression match. May be specified multiple times.Default: none
head_node
The path to the head node reference file.Default: none
Example
<copy_exactly><compute_node>file1</compute_node><exclude>^/etc/sysconfig/</exclude><head_node>file2</head_node>
</copy_exactly>
MODULE CLASS
unit
DEPENDENCIES
sh
ssh
tmp
genuine_intel
23
Intel R©Cluster Checker 1.3u2 copy_exactly
EXTERNAL DEPENDENCIES
find
md5sum
prelink
xargs
NOTES
This module may take a long time (several minutes to over an hour, depending on the cluster size) to complete if thenumber of files to check is very large.
24
Intel R©Cluster Checker 1.3u2 core_count
16 core_count
Check the number and type of cores and processors
DESCRIPTION
core_count is an Intel(R) Cluster Checker module to verify the number of processors, physical cores, and logical coresper node. If the number of cores or processors is not specified, the uniformity of the count is checked for all nodes.Additionally, the test module verifies the status of Intel(R) Hyper-Threading Technology.
CONFIGURATION
hyper-threading
Flag to designate that Intel(R) Hyper-Threading Technology should be enabled.Default: Hyper-Threading Technology should not be enabled.
logical-cores
The number of logical cores per node. Logical cores include physical and SMT/HT cores.
physical-cores
The number of physical cores per node. Physical cores exclude SMT/HT cores.
processors
The number of processors per node.
Example
<core_count><hyper-threading/><logical-cores>8</logical-cores><physical-cores>4</physical-cores><processors>2</processors>
</core_count>
MODULE CLASS
vector
DEPENDENCIES
gcc
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
None
25
Intel R©Cluster Checker 1.3u2 core_frequency
17 core_frequency
Check the frequency of all processor cores
DESCRIPTION
core_frequency is an Intel(R) Cluster Checker module to verify that the frequency of each processor core in the clusteris within a specified threshold of a specified frequency.
CONFIGURATION
frequency
Expected frequency of a core in MHz.Default: median of the collected frequencies
threshold
Maximum absolute deviation from the expected frequency that is allowable, in MHz.Default: 5
Example
<core_frequency><frequency>3056</frequency><threshold>50</threshold>
</core_frequency>
MODULE CLASS
vector
DEPENDENCIES
ssh
mount_proc
genuine_intel
EXTERNAL DEPENDENCIES
grep
26
Intel R©Cluster Checker 1.3u2 cpuinfo
18 cpuinfo
Check the uniformity of /proc/cpuinfo
DESCRIPTION
cpuinfo is an Intel(R) Cluster Checker module to verify the uniformity of /proc/cpuinfo on all nodes. Besides checkingthat the core count is the same across the cluster, it tries to verify that all fields are uniform unless explicitly excluded.The following fields are excluded by default:BogoMIPS, bogomips , cpu MHz, itc MHz , physical id ,processor , runqueue andapicid . The core frequency is checked in thecore_frequency test module.
CONFIGURATION
exclude
The name of the field to exclude from the check. May be specified multiple times to exclude more than one field.Default: none
Example
<cpuinfo><exclude>stepping</exclude>
</cpuinfo>
MODULE CLASS
vector
DEPENDENCIES
ssh
mount_proc
genuine_intel
EXTERNAL DEPENDENCIES
cat
27
Intel R©Cluster Checker 1.3u2 cron
19 cron
Check that cron is not running
DESCRIPTION
cron is an Intel(R) Cluster Checker module to verify that the process list does not include any crond processes.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ps
grep
28
Intel R©Cluster Checker 1.3u2 csh
20 csh
Check the C Shell
DESCRIPTION
csh is an Intel(R) Cluster Checker module to verify the C Shell. The module tests if/bin/csh exists and runs a’Hello World’ script.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
/bin/csh
test
29
Intel R©Cluster Checker 1.3u2 dat_conf
21 dat_conf
Check that all entries in /etc/dat.conf are valid
DESCRIPTION
dat_conf is an Intel(R) Cluster Checker module to verify that all entries in /etc/dat.conf have corresponding networkdevices.
CONFIGURATION
None
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
cat
ifconfig
sh
30
Intel R©Cluster Checker 1.3u2 dmidecode
22 dmidecode
Check the uniformity of the SMBIOS/DMI information
DESCRIPTION
dmidecode is an Intel(R) Cluster Checker module to verify the uniformity of the SMBIOS/DMI information returnedby the dmidecode utility.
CONFIGURATION
exclude
dmidecode field to exclude from the check based on an exact match. May be specified multiple times to exclude morethan one field.
Example
<dmidecode><exclude>Memory Device (0x002F): Part Number</exclude><exclude>System Slot Information (0x001D): Length</exclude>
</dmidecode>
MODULE CLASS
vector
DEPENDENCIES
gcc
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
dmidecode (http://www.nongnu.org/dmidecode/)
make
NOTES
This check may only be run by a privileged user.
31
Intel R©Cluster Checker 1.3u2 e1000
23 e1000
Check the Intel(R) PRO/1000 Network Driver (e1000)
DESCRIPTION
e1000 is an Intel(R) Cluster Checker module to verify the e1000 kernel module. The module checks that the kernelmodule is loaded, the version and module file are uniform across the cluster, and the Interrupt throttling options areset.
CONFIGURATION
options
The string that is compared to the e1000 driver options.Default: options e1000 InterruptThrottleRate=0,0 TxIntDelay=0,64 RxAbsIntDelay=0,128 TxAbsIntDelay=0,64
version
The string that is compared to the version string in the e1000 kernel module.Default: noneIf versionis not specified in the configuration file, then the e1000 kernel module version is not checked.
Example
<e1000><options>options e1000 InterruptThrottleRate=0,0</options><version>5.2.52-k3</version>
</e1000>
MODULE CLASS
vector
DEPENDENCIES
sh
ssh
genuine_intel
EXTERNAL DEPENDENCIES
grep
lsmod
md5sum
modprobe
strings
32
Intel R©Cluster Checker 1.3u2 e1000e
24 e1000e
Check the Intel(R) PRO/1000 Network Driver (e1000e)
DESCRIPTION
e1000e is an Intel(R) Cluster Checker module to verify the e1000e kernel module. The module checks that the kernelmodule is loaded, the version and module file are uniform across the cluster, and the Interrupt throttling options areset.
CONFIGURATION
options
The string that is compared to the e1000e driver options.Default: options e1000e InterruptThrottleRate=0,0 TxIntDelay=0,64 RxAbsIntDelay=0,128 TxAbsIntDelay=0,64
version
The string that is compared to the version string in the e1000e kernel module.Default: noneIf versionis not specified in the configuration file, then the e1000e kernel module version is not checked.
Example
<e1000e><options>options e1000e InterruptThrottleRate=0,0</options><version>0.2.9</version>
</e1000e>
MODULE CLASS
vector
DEPENDENCIES
sh
ssh
genuine_intel
EXTERNAL DEPENDENCIES
grep
lsmod
md5sum
modprobe
strings
33
Intel R©Cluster Checker 1.3u2 environment
25 environment
Check the uniformity of environment variables
DESCRIPTION
environment is an Intel(R) Cluster Checker module to verify environment variables. It verifies the uniformity of theenvironment variables on the compute nodes. If the head node is also a compute node, the test also verifies thatthe environment variables on the compute nodes are the same on the head node (although the head node may haveadditional environment variables that are not set on the compute nodes.)
CONFIGURATION
exclude
The name of an environment variable to exclude from the check (case sensitive.) The variables HOST, HOSTNAME,SSH_CLIENT, SSH_CONNECTION, and SSH2_CLIENT are automatically excluded. This option may be specifiedmore than once to exclude multiple environment variables.Default: none
Example
<environment><exclude>LANG</exclude><exclude>PAGER</exclude>
</environment>
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
printenv
34
Intel R©Cluster Checker 1.3u2 etc_hosts
26 etc_hosts
Check that hostnames are associated to only one IP address.
DESCRIPTION
etc_hosts.pm is an Intel(R) Cluster Checker module to verify that each hostname is associated to only one IP addressin the /etc/hosts file.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
cat
35
Intel R©Cluster Checker 1.3u2 file_permissions
27 file_permissions
Check file existence, ownership, and permissions
DESCRIPTION
file_permissions is an Intel(R) Cluster Checker module to verify the existence, ownership, and permissions on filesand directories.
CONFIGURATION
object
A container that groups the other options by file / directory path
path The path to the file or directory to be tested. A path is required in each object container. The existence of thepath is always checked.
group The name of the group who should own the file / directory specified in the path. If not specified, the groupownership is not checked.
permissions The expected permissions, in octal, of the file / directory specified in the path. If not specified, thepermissions are not checked.
user The name of the user who should own the file / directory specified in the path. If not specified, the userownership is not checked.
Example
<file_permissions><object>
<group>root</group><path>/tmp</path><permissions>1777</permissions><user>root</user>
</object><object>
<path>/shared</path><permissions>0755</permissions>
</object><object>
<path>/home</path></object>
</file_permissions>
MODULE CLASS
unit
36
Intel R©Cluster Checker 1.3u2 file_permissions
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
stat
37
Intel R©Cluster Checker 1.3u2 file_tree
28 file_tree
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the file trees meet requirements.
METHOD
Compute the md5sum of files in the following directories on the head node: /bin, /boot, /dev, /etc, /lib, /lib64, /media,/mnt, /opt, /sbin, /usr, and /var/lib/alternatives.The compute nodes must be identical to each other. If the head node is also a compute node, the head node must be asuperset of the compute nodes.Some files / paths are explicitly excluded from the test. For a list of excluded files please see the Intel(R) ClusterReady Specification.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
find
md5sum
prelink
xargs
NOTES
For a typical OS installation, this module verifies the checksum of hundreds of thousands of files on each node. As aconsequence, this module may take a long time (several minutes to over an hour) to complete.
38
Intel R©Cluster Checker 1.3u2 gcc
29 gcc
Check the functionality and uniformity of the GNU (*) C/C++ compilers
DESCRIPTION
gcc is an Intel(R) Cluster Checker module to verify the GNU C/C++ compilers. The module examines the compilerversion and builds / executes C and C++ ’Hello World’ programs.
CONFIGURATION
gcc-path
The base path to the GNU C/C++ compilers.Default: /usr/bin
version
The string that is compared to the GNU C/C++ compiler version.Default: noneIf versionis not specified in the Intel(R) Cluster Checker configuration file, then the specific compiler version will notbe checked.The uniformity of the version string is compared across the cluster regardless.
Example
<gcc><gcc-path>/usr/bin</gcc-path><version>3.2.3</version>
</gcc>
MODULE CLASS
vector
DEPENDENCIES
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
gcc
g++
test
NOTES
Compiler warnings result in a failure. For some warnings, this is the correct behavior, but for ’harmless’ warnings,this produces false positives.
39
Intel R©Cluster Checker 1.3u2 gcc_3_4_6
30 gcc_3_4_6
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the GNU (*) compilers are present and meet requirements.
METHOD
Compare the gcc and g++ versions to 3.4.6
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
gcc
g++
40
Intel R©Cluster Checker 1.3u2 gdb_6_3
31 gdb_6_3
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the GNU (*) debugger is present and meets requirements.
METHOD
Compare the gdb version to 6.3.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
gdb
41
Intel R©Cluster Checker 1.3u2 generic_correctness
32 generic_correctness
User-defined correctness check
DESCRIPTION
generic_correctness is an Intel(R) Cluster Checker module that executes a specified command and compares the outputto specified output.The intended purpose of this module is to implement site specific and/or temporary checks; cross-site and/or permanent checks should be implemented in dedicated modules.Note: by default, this module willnot be run as root due to the potential security issues and/or inadvertent configurationchanges that are inherent in running an arbitrary command on every node of a cluster as the root user. Set the overrideconfiguration option to run this module as root.
CONFIGURATION
item
The container for a command / result set.
command The command to be executed.
result The exact output of the command that should be considered correct. Case and white space sensitive.
override
Override the check that will not allow this module to run as root.Default: false
Example
<generic_correctness><item>
<command>uname -r</command><result>2.4.21-20.EL</result>
</item><item>
<command>/sbin/lsmod | grep e1000</command><result>e1000 171104 1</result>
</item></generic_correctness>
MODULE CLASS
unit
DEPENDENCIES
sh
ssh
genuine_intel
42
Intel R©Cluster Checker 1.3u2 generic_correctness
EXTERNAL DEPENDENCIES
None
43
Intel R©Cluster Checker 1.3u2 generic_uniformity
33 generic_uniformity
User-defined node uniformity check
DESCRIPTION
generic_uniformity.pm is an Intel(R) Cluster Checker module that executes a specified command and compares theoutput between nodes. The intended purpose of this module is to implement site specific and/or temporary checks;cross-site and/or permanent checks should be implemented in dedicated modules.Note: by default, this module will not be run as root due to the potential security issues and/or inadvertent configurationchanges that are inherent in running an arbitrary command on every node of a cluster as the root user. Set the overrideconfiguration option to run this module as root.
CONFIGURATION
command
The command to be executed.
override
Override the check that will not allow this module to run as root.Default: false
Example
<generic_uniformity><command>uname -r</command><command>/sbin/lsmod | grep e1000</command>
</generic_uniformity>
MODULE CLASS
vector
DEPENDENCIES
sh
ssh
genuine_intel
EXTERNAL DEPENDENCIES
None
44
Intel R©Cluster Checker 1.3u2 genuine_intel
34 genuine_intel
Check that the nodes contain GenuineIntel processors
DESCRIPTION
genuine_intel is an Intel(R) Cluster Checker module to verify that the cluster is built using GenuineIntel processors.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
mount_proc
EXTERNAL DEPENDENCIES
grep
45
Intel R©Cluster Checker 1.3u2 gige
35 gige
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the Ethernet interface meets requirements.
METHOD
Read the interface speed usingethtool for all Ethernet interfaces. The speed of at least one interface must be greaterthan or equal to 1000 Mb/s.Note: ethtool can only be run by root.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ifconfig
ethtool
46
Intel R©Cluster Checker 1.3u2 glibc_2_3_4
36 glibc_2_3_4
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the glibc runtime meets requirements.
METHOD
Compare the libc.so.6 version to 2.3.4.Confirm that the following libraries (or later versions) are in the dynamic linker cache on all nodes. Both 32-bit andx86-64 versions need to be present.
ld-linux.so.2
ld-linux-x86-64.so.2
libBrokenLocale.so.1
libSegFault.so
libanl.so.1
libc.so.6
libcidn.so.1
libcrypt.so.1
libdl.so.2
libgcc_s.so.1
libm.so.6
libnsl.so.1
libnss_compat.so.2
libnss_dns.so.2
libnss_files.so.2
libnss_hesiod.so.2
libnss_nis.so.2
libnss_nisplus.so.2
libpthread.so.0
libresolv.so.2
librt.so.1
libstdc++.so.6
libthread_db.so.1
libutil.so.1
47
Intel R©Cluster Checker 1.3u2 glibc_2_3_4
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
/sbin/ldconfig
48
Intel R©Cluster Checker 1.3u2 gmake_3_80
37 gmake_3_80
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that GNU (*) make is present and meets requirements.
METHOD
Compare the gdb version to 3.80.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
gmake
49
Intel R©Cluster Checker 1.3u2 hdparm
38 hdparm
Check the disk performance of a node
DESCRIPTION
hdparm is an Intel(R) Cluster Checker module to verify the disk read rate of cached and raw device operations andtheir deviation over the cluster nodes. The module executes thehdparm utility to measure the disk performance.Note: this test will only be run as root since hdparm requires direct access to the disk device.
CONFIGURATION
cache-read
The minimum acceptable read rate, in MB/s, from disk buffer cache. This corresponds to the ’-T’ hdparm option.
cache-deviation
The factor of allowed standard deviations from median, used to search for outlier values. The allowed range is (median-/+ deviation * stddev).Default: 3
device
The disk device to be measured.Default: the disk device corresponding to the ’/’ partition.
device-read
The minimum acceptable read rate, in MB/s, from the disk device, reading through cache. This corresponds to the ’-t’hdparm option.
device-deviation
The factor of allowed standard deviations from median, used to search for outlier values. The allowed range is (median-/+ deviation * stddev).Default: 3
Example
<hdparm><cache-deviation>3</cache-deviation><cache-read>2400</cache-read><device>/dev/sda1</device><device-deviation>3</device-deviation><device-read>60</device-read>
</hdparm>
MODULE CLASS
unit
50
Intel R©Cluster Checker 1.3u2 hdparm
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
hdparm
mount
NOTES
This check is not appropriate for diskless nodes and should be excluded.
51
Intel R©Cluster Checker 1.3u2 home
39 home
Check Intel(R) Cluster Ready Compliance
DESCRIPTION
Check that /home meets requirements.
METHOD
Compare the inode number of a directory in /home on all nodes.
CONFIGURATION
None
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
stat
perl
NOTES
If running as a privileged user and /home is managed by automount, at least one user account should be created priorto running this check.
52
Intel R©Cluster Checker 1.3u2 host_conf
40 host_conf
Check the configuration of /etc/host.conf
DESCRIPTION
host_conf is an Intel(R) Cluster Checker module to verify that host.conf has set the resolution order to hosts, nis, bind.If host.conf is missing or the order line is missing from host.conf, the module returns indeterminate.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
cat
/etc/host.conf
53
Intel R©Cluster Checker 1.3u2 hostname
41 hostname
Check the hostname of each node
DESCRIPTION
hostname is an Intel(R) Cluster Checker module to verify that the node hostname is correct. The hostname of the nodedetermined byhostname is compared to the hostname from a configuration file or to the one received by DHCP.The module attempts to read the hostname configuration setting from /etc/sysconfig/network, /etc/HOSTNAME,/etc/hostname, and /etc/sysconfig/system (in that order). Also the module will check the DHCP leases to see if thehostname has been received by DHCP. If only the short hostnames match, the check is considered successful, althougha different status message is displayed. If no match if found the test will fail and will display the available hostnamesagainst which the active hostname was compared.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
cat
hostname
/etc/sysconfig/network
/etc/HOSTNAME
/etc/hostname
/etc/sysconfig/system
54
Intel R©Cluster Checker 1.3u2 hpcc
42 hpcc
Run the HPC Challenge Benchmarks
DESCRIPTION
hpcc is an Intel(R) Cluster Checker module that runs the HPC Challenge benchmark suite. The HPCC benchmarksuite includes 7 benchmarks. Please seehttp://icl.cs.utk.edu/hpcc/ for details. HPCC was built with the Intel(R)C++ Compiler using the ’-O3’ and ’-openmp’ flags.The HPCC input parameters are read from a template input file (see thehpccinf parameter.) The P, Q, and Nparameters are modified from the values in the file. N is set to the file value multiplied by the square root of thenumber of nodes. For example, if the value in the file is 8,000 and the test is run on 8 nodes, the value of N is 22,627.P and Q are set so that P is maximized subject to the following constraints:
P <= Q
P*Q is equal to the total number of processes(not threads)
P and Q are integers
CONFIGURATION
build
Build HPCC from source rather than using the prebuilt binary (external/hpcc .) If true, the Intel(R) C Compiler,Intel(R) MPI Library, and Intel(R) Math Kernel Library must be available; theintel_cc andintel_mpi_internodemodules should be added as dependencies and theintel_cce_rtl andintel_mpi_rt_internode modulesshould be removed as dependencies.Default: false
cc-path
The base path to the Intel(R) C++ Compiler directory. Setting this parameter will automatically setup the environment.Default: none (inherit environment)
fabric
A container for the network interconnect fabric to evaluate. The<fabric> block may be repeated to test multipleinterconnects.
bandwidth The minimum acceptable network bandwidth in GB/s.If a bandwidth value is not specified, then thebandwidth check will be indeterminate.Default: none
device The Intel(R) MPI Library device string, i.e., I_MPI_DEVICEDefault: rdssm
dgemm The minimum acceptable DGEMM performance in GFLOPS.If a dgemm value is not specified, then thedgemm check will be indeterminate.
fft The minimum acceptable FFT performance in GFLOPS.If a fft value is not specified, then the fft check will beindeterminate.
55
Intel R©Cluster Checker 1.3u2 hpcc
hpl The minimum acceptable HP Linpack performance in TFLOPS.If a hpl value is not specified, then the hpl checkwill be indeterminate.Default: none
latency The maximum acceptable network latency in microseconds.If a latency value is not specified, then thelatency check will be indeterminate.Default: none
ptrans The minimum acceptable PTRANS performance in GB/s.If a ptrans value is not specified, then the ptranscheck will be indeterminate.
randomaccess The minimum acceptable RandomAccess performance in GUPs/s.If a randomaccess value is notspecified, then the randomaccess check will be indeterminate.
stream The minimum acceptable STREAM Triad performance in GB/s.If a stream value is not specified, then thestream check will be indeterminate.
hpccinf
The path to the template input file to use.Default: none (useexternal/hpccinf.txt )
mkl-path
The base path to the Intel(R) Math Kernel Library installation directory. Setting this parameter will automaticallysetup the environment.Default: none (inherit environment)
mpi-path
The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
process-number
The number of MPI processes to start on each nodeDefault: 1
thread-number
The number of OpenMP threads to start on each node. This setting corresponds to the OMP_NUM_THREADSenvironment variable.Default: ALL
Example
<hpcc><cc-path>/opt/intel/cce/9.1</cc-path><fabric>
<bandwidth>0.110</bandwidth><device>sock</device>
56
Intel R©Cluster Checker 1.3u2 hpcc
<dgemm>12</dgemm><fft>1</fft><hpl>0.002</hpl><latency>20</latency><ptrans>0.2</ptrans><randomaccess>0.001</randomaccess><stream>1.7</stream>
</fabric><mkl-path>/opt/intel/cmkl/9.0</mkl-path><mpi-path>/opt/intel/mpi/3.0</mpi-path><process-number>1</process-number><thread-number>ALL</thread-number>
</hpcc>
MODULE CLASS
vector
DEPENDENCIES
arch
intel_cce_rtl
intel_mpi_rt_internode
perl
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) C++ Compiler 9.1 or later
Intel(R) Math Kernel Library 9.0 or later
Intel(R) MPI Library 3.0 or later
perl
sed
rm
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<cc-path>, <mkl-path>, and<mpi-path> tags to override.
57
Intel R©Cluster Checker 1.3u2 ib_firmware
43 ib_firmware
Check that the InfiniBand firmware is uniform on all nodes
DESCRIPTION
ib_firmware is an Intel(R) Cluster Checker module to verify the firmware version on the InfiniBand cards on each node.The module can determine the firmware from drivers that use/sys/class/infiniband/mthcaN/fw_ver ,such as OFED (*).
CONFIGURATION
None
MODULE CLASS
vector
DEPENDENCIES
sh
ssh
mount_proc
genuine_intel
EXTERNAL DEPENDENCIES
grep
a supported IB driver
58
Intel R©Cluster Checker 1.3u2 ibadm
44 ibadm
Check that Mellanox in-band monitor processes are not running
DESCRIPTION
ibadm is an Intel(R) Cluster Checker module to verify that the Mellanox Infiniband in-band monitor is not running.The module checks the process list for entries matching ’ibgd’, ’ibadm’, ’ibis’, and ’obbs-pci’.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ps
59
Intel R©Cluster Checker 1.3u2 icr_version_1_0
45 icr_version_1_0
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that /etc/intel/icr contains the POSIX shell environment variable CLUSTER_READY_VERSION and that it isset to the value 1.0.
METHOD
Check that the /etc/intel/icr file contains ’CLUSTER_READY_VERSION=1.0’.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
cat
60
Intel R©Cluster Checker 1.3u2 icr_version_1_1
46 icr_version_1_1
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that /etc/intel/icr contains the POSIX shell environment variable CLUSTER_READY_VERSION and that it isset to the value 1.1.
METHOD
Check that the /etc/intel/icr file contains ’CLUSTER_READY_VERSION=1.1’.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
cat
61
Intel R©Cluster Checker 1.3u2 igb
47 igb
Check the Intel(R) PRO/1000 Network Driver (igb)
DESCRIPTION
igb is an Intel(R) Cluster Checker module to verify the igb kernel module. The module checks that the kernel moduleis loaded, the version and module file are uniform across the cluster, and the Interrupt throttling options are set.
CONFIGURATION
options
The string that is compared to the igb driver options.Default: options igb InterruptThrottleRate=0,0
version
The string that is compared to the version string in the igb kernel module.Default: noneIf versionis not specified in the configuration file, then the igb kernel module version is not checked.
Example
<igb><options>options igb InterruptThrottleRate=0,3</options><version>1.2.30</version>
</igb>
MODULE CLASS
vector
DEPENDENCIES
sh
ssh
genuine_intel
EXTERNAL DEPENDENCIES
grep
lsmod
md5sum
modprobe
strings
62
Intel R©Cluster Checker 1.3u2 imb_collective_intel_mpi
48 imb_collective_intel_mpi
Check MPI collectives over the whole cluster
DESCRIPTION
imb_collective_intel_mpi is an Intel(R) Cluster Checker module that runs the Intel(R) MPI Benchmarks for a specifiedset of collectives. While the test may verify performance at some point, it currently only verifies that the benchmarksuccessfully ran.
CONFIGURATION
benchmark
The name of the Intel(R) MPI Benchmark to run. The<benchmark> tag may be repeated to run multiple benchmarks.See the Intel(R) MPI Benchmark documentation for a list of available collective benchmarks (typically the name ofthe MPI collective operation with the ’MPI_’ prefix removed, e.g., bcast for MPI_Bcast.)Default: barrier
build
Build the Intel(R) MPI Benchmarks from source rather than using the prebuilt binary (external/IMB-MPI1 .) Iftrue, the Intel(R) MPI Library SDK must be available; theintel_mpi_internode module should also be addedas a dependency.Default: false
fabric
A container for the network interconnect fabric to evaluate. The<fabric> block may be repeated to test multipleinterconnects.
device The Intel(R) MPI Library device string, i.e., I_MPI_DEVICEDefault: rdssm
msglen
Override the default IMB message length sequence of 0 to 2ˆi, where i ranges from 0 to 22. Instead, use the messagelengths defined inexternal/IMB_msglen : 0, 1, 2, 4, and 4,194,304. Use the<msglen/> tag to indicate true.Note: this will reduce the time required to run this test but may not detect some failing cases.Default: false
mpi-path
The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
Example
<imb_collective_intel_mpi><benchmark>barrier</benchmark><benchmark>bcast</benchmark><fabric>
63
Intel R©Cluster Checker 1.3u2 imb_collective_intel_mpi
<device>sock</device></fabric><fabric>
<device>rdssm</device></fabric><mpi-path>/opt/intel/mpi-rt/3.0</mpi-path>
</imb_collective_intel_mpi>
MODULE CLASS
vector
DEPENDENCIES
gcc
intel_mpi_rt_internode
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) MPI Library 3.0 or later
mktemp
rm
tar
test
uname
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<mpi-path> tag to override.
64
Intel R©Cluster Checker 1.3u2 imb_message_integrity_intel_mpi
49 imb_message_integrity_intel_mpi
Check MPI message integrity over the whole cluster.
DESCRIPTION
imb_message_integrity_intel_mpi is an Intel(R) Cluster Checker module that compiles and runs the Intel(R) MPIBenchmarks checking the integrity of the messages by using the Sendrcv test. The test checks every message passingresult against the expected outcome and reports the defects of communication.
CONFIGURATION
build
Build the Intel(R) MPI Benchmarks from source rather than using the prebuilt binary (external/IMB-MPI1-DCHECK .)If true, the Intel(R) MPI Library SDK must be available; theintel_mpi_internode module should also be addedas a dependency.Default: false
fabric
A container for the network interconnect fabric to evaluate. The<fabric> block may be repeated to test multipleinterconnects.
device The Intel(R) MPI Library device string, i.e., I_MPI_DEVICEDefault: rdssm
msglen
Override the default IMB message length sequence of 0 to 2ˆi, where i ranges from 0 to 22. Instead, use the messagelengths defined inexternal/IMB_msglen : 0, 1, 2, 4, and 4,194,304. Use the<msglen/> tag to indicate true.Note: this will reduce the time required to run this test but may not detect some failing cases.Default: false
mpi-path
The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
Example
<imb_message_integrity_intel_mpi><fabric>
<device>sock</device></fabric><fabric>
<device>rdssm</device></fabric><mpi-path>/opt/intel/mpi-rt/3.0</mpi-path>
</imb_message_integrity_intel_mpi>
65
Intel R©Cluster Checker 1.3u2 imb_message_integrity_intel_mpi
MODULE CLASS
vector
DEPENDENCIES
gcc
intel_mpi_internode
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) MPI Library 3.0 or later
mktemp
rm
tar
test
uname
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<mpi-path> tag to override.
66
Intel R©Cluster Checker 1.3u2 imb_pingpong_intel_mpi
50 imb_pingpong_intel_mpi
Check network interconnect performance
DESCRIPTION
imb_pingpong_intel_mpi is an Intel(R) Cluster Checker module to verify the network interconnect performance andvariation over the cluster nodeswith thepingpongIntel(R) MPI Benchmark. The module measures the performancebetween each pair of nodes.
CONFIGURATION
build
Build the Intel(R) MPI Benchmarks from source rather than using the prebuilt binary (external/IMB-MPI1 .) Iftrue, the Intel(R) MPI Library SDK must be available; theintel_mpi_internode module should also be addedas a dependency.Default: false
fabric
A container for the network interconnect fabric to evaluate. The<fabric> block may be repeated to test multipleinterconnects.
device The Intel(R) MPI Library device string, i.e., I_MPI_DEVICEDefault: rdssm
latency The maximum acceptable latency in microseconds.If a latency value is not specified, then no latency checkis performed.Default: none
latency-deviation The factor of allowed standard deviations from median, used to search for outlier values. Theallowed range is (median -/+ deviation * stddev).Default: 3
bandwidth The minimum acceptable bandwidth in MB/sec.If a bandwidth is not specified, then no bandwidthcheck is performed.Default: none
bandwidth-deviation The factor of allowed standard deviations from median, used to search for outlier values. Theallowed range is (median -/+ deviation * stddev).Default: 3
msglen
Override the default IMB message length sequence of 0 to 2ˆi, where i ranges from 0 to 22. Instead, use the messagelengths defined inexternal/IMB_msglen : 0, 1, 2, 4, and 4,194,304. Use the<msglen/> tag to indicate true.Note: this will reduce the time required to run this test but may not detect some failing cases.Default: false
67
Intel R©Cluster Checker 1.3u2 imb_pingpong_intel_mpi
mpi-path
The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
Example
<imb_pingpong_intel_mpi><fabric>
<device>sock</device><bandwidth>110</bandwidth><bandwidth-deviation>3</bandwidth-deviation><latency>35</latency><latency-deviation>3</latency-deviation>
</fabric><fabric>
<device>rdssm</device><bandwidth>620</bandwidth><bandwidth-deviation>3</bandwidth-deviation><latency>10</latency><latency-deviation>3</latency-deviation>
</fabric><mpi-path>/opt/intel/mpi/3.0</mpi-path>
</imb_pingpong_intel_mpi>
MODULE CLASS
matrix
DEPENDENCIES
arch
gcc
intel_mpi_rt_internode
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) MPI Library 3.0 or later
mktemp
rm
tar
68
Intel R©Cluster Checker 1.3u2 imb_pingpong_intel_mpi
test
touch
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<mpi-path> tag to override.
69
Intel R©Cluster Checker 1.3u2 intel_cc
51 intel_cc
Check the Intel(R) C++ Compiler
DESCRIPTION
intel_cc is an Intel(R) Cluster Checker module to verify the Intel(R) C++ Compiler. The module checks the compilerversion and builds and executes C and C++ ’Hello World’ programs.Note: this module builds the Hello World binaries from source using the Intel(R) C++ Compiler. If you wish to checkthe functionality of the compiler runtime only, see theintel_cce_rtl module.
CONFIGURATION
cc-path
The base path to the Intel(R) C++ compiler. Setting this parameter will automatically setup the environment.Default: none (inherit environment)
version
The string that is compared to the Intel(R) C++ compiler build stamp.Default: noneIf versionis not specified in the configuration file, then the specific compiler version will not be checked.The samenessof the package ID is compared across the cluster regardless.
Example
<intel_cc><cc-path>/opt/intel/cce/9.1</cc-path>
</intel_cc>
MODULE CLASS
vector
DEPENDENCIES
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) C++ Compiler
test
which
70
Intel R©Cluster Checker 1.3u2 intel_cc
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<cc-path> tag to override.
71
Intel R©Cluster Checker 1.3u2 intel_cc_rtl_9_1
52 intel_cc_rtl_9_1
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the 32-bit Intel(R) C++ Compiler runtime meets requirements.
METHOD
Compare the 32-bit Intel(R) C++ Compiler runtime version to 9.1Confirm that the following files exist on all nodes:
/opt/intel/cc/<version>/lib/libcprts.so (9.x only)
/opt/intel/cc/<version>/lib/libcprts.so.5 (9.x only)
/opt/intel/cc/<version>/lib/libcxa.so (9.x only)
/opt/intel/cc/<version>/lib/libcxa.so.5 (9.x only)
/opt/intel/cc/<version>/lib/libcxaguard.so
/opt/intel/cc/<version>/lib/libcxaguard.so.5
/opt/intel/cc/<version>/lib/libguide.so
/opt/intel/cc/<version>/lib/libguide_stats.so
/opt/intel/cc/<version>/lib/libimf.so
/opt/intel/cc/<version>/lib/libintlc.so (10.x only)
/opt/intel/cc/<version>/lib/libintlc.so.5 (10.x only)
/opt/intel/cc/<version>/lib/libirc.so
/opt/intel/cc/<version>/lib/libsvml.so
/opt/intel/cc/<version>/lib/libunwind.so (9.x only)
/opt/intel/cc/<version>/lib/libunwind.so.5 (9.x only)
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
72
Intel R©Cluster Checker 1.3u2 intel_cc_rtl_9_1
EXTERNAL DEPENDENCIES
perl
Intel(R) C++ Compiler 9.1 or later
73
Intel R©Cluster Checker 1.3u2 intel_cce_rtl
53 intel_cce_rtl
Check the Intel(R) C++ Compiler runtime libraries
DESCRIPTION
intel_cce_rtl is an Intel(R) Cluster Checker module to verify the Intel(R) C++ Compiler runtime libraries. The moduleruns C and C++ Hello World binaries on the compute nodes.Note: this module does not build the Hello World binaries using the Intel(R) C++ Compiler. If you wish to check thefunctionality of the compiler itself, see theintel_cc module.
CONFIGURATION
cc-path
The base path to the Intel(R) C++ Compiler installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
Example
<intel_cce_rtl><cc-path>/opt/intel/cce/9.1</cc-path>
</intel_cce_rtl>
MODULE CLASS
unit
DEPENDENCIES
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) C++ Compiler 9.1 or later runtime
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH) inherited from the user running Intel(R) ClusterChecker is setup correctly.
74
Intel R©Cluster Checker 1.3u2 intel_cce_rtl_9_1
54 intel_cce_rtl_9_1
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the 64-bit Intel C++ Compiler runtime meets requirements.
METHOD
Compare the 64-bit Intel(R) C++ Compiler runtime version to 9.1Confirm that the following files exist on all nodes:
/opt/intel/cce/<version>/lib/libclusterguide.so (9.x only)
/opt/intel/cce/<version>/lib/libclusterguide_stats.so (9.x only)
/opt/intel/cce/<version>/lib/libcprts.so (9.x only)
/opt/intel/cce/<version>/lib/libcprts.so.5 (9.x only)
/opt/intel/cce/<version>/lib/libcxa.so (9.x only)
/opt/intel/cce/<version>/lib/libcxa.so.5 (9.x only)
/opt/intel/cce/<version>/lib/libcxaguard.so
/opt/intel/cce/<version>/lib/libcxaguard.so.5
/opt/intel/cce/<version>/lib/libguide.so
/opt/intel/cce/<version>/lib/libguide_stats.so
/opt/intel/cce/<version>/lib/libimf.so
/opt/intel/cce/<version>/lib/libintlc.so (10.x only)
/opt/intel/cce/<version>/lib/libintlc.so.5 (10.x only)
/opt/intel/cce/<version>/lib/libirc.so
/opt/intel/cce/<version>/lib/libomp_db.so
/opt/intel/cce/<version>/lib/libompstub.so (10.x only)
/opt/intel/cce/<version>/lib/libsvml.so
/opt/intel/cce/<version>/lib/libunwind.so (9.x only)
/opt/intel/cce/<version>/lib/libunwind.so.5 (9.x only)
CONFIGURATION
None
MODULE CLASS
unit
75
Intel R©Cluster Checker 1.3u2 intel_cce_rtl_9_1
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
perl
Intel(R) C++ Compiler 9.1 or later
76
Intel R©Cluster Checker 1.3u2 intel_cmkl_rtl_9_0
55 intel_cmkl_rtl_9_0
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the Intel(R) Math Kernel Library Cluster Edition runtime meets requirements.
METHOD
Compare the Intel(R) Math Kernel Library Cluster Edition runtime version to 9.0.Confirm that the following files exist on all nodes:
/opt/intel/cmkl/<version>/lib/32/libguide.so
/opt/intel/cmkl/<version>/lib/32/libmkl.so
/opt/intel/cmkl/<version>/lib/32/libmkl_core.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/32/libmkl_def.so
/opt/intel/cmkl/<version>/lib/32/libmkl_gf.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/32/libmkl_ias.so (9.0 and 9.1 only)
/opt/intel/cmkl/<version>/lib/32/libmkl_intel.so (10.0 and later)
/opt/intel/mckl/<version>/lib/32/libmkl_intel_thread.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/32/libmkl_lapack.so (9.1 and later)
/opt/intel/cmkl/<version>/lib/32/libmkl_lapack32.so (9.0 only)
/opt/intel/cmkl/<version>/lib/32/libmkl_lapack64.so (9.0 only)
/opt/intel/cmkl/<version>/lib/32/libmkl_p3.so (up to 10.0)
/opt/intel/cmkl/<version>/lib/32/libmkl_p4.so
/opt/intel/cmkl/<version>/lib/32/libmkl_p4m.so
/opt/intel/cmkl/<version>/lib/32/libmkl_p4p.so
/opt/intel/cmkl/<version>/lib/32/libmkl_sequential.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/32/libmkl_vml_def.so
/opt/intel/cmkl/<version>/lib/32/libmkl_vml_ia.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/32/libmkl_vml_p3.so (up to 10.0)
/opt/intel/cmkl/<version>/lib/32/libmkl_vml_p4.so
/opt/intel/cmkl/<version>/lib/32/libmkl_vml_p4m.so
/opt/intel/cmkl/<version>/lib/32/libmkl_vml_p4m2.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/32/libmkl_vml_p4p.so
/opt/intel/cmkl/<version>/lib/32/libvml.so (9.0 and 9.1 only)
77
Intel R©Cluster Checker 1.3u2 intel_cmkl_rtl_9_0
/opt/intel/cmkl/<version>/lib/em64t/libguide.so
/opt/intel/cmkl/<version>/lib/em64t/libmkl.so
/opt/intel/cmkl/<version>/lib/em64t/libmkl_core.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_def.so
/opt/intel/cmkl/<version>/lib/em64t/libmkl_gf_ilp64.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_gf_lp64.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_ias.so (9.0 and 9.1 only)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_intel_ilp64.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_intel_lp64.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_intel_sp2dp.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_intel_thread.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_lapack.so (9.1 and later)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_lapack32.so (9.0 only)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_lapack64.so (9.0 only)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_mc.so
/opt/intel/cmkl/<version>/lib/em64t/libmkl_p4n.so
/opt/intel/cmkl/<version>/lib/em64t/libmkl_sequential.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_vml_def.so
/opt/intel/cmkl/<version>/lib/em64t/libmkl_vml_mc.so
/opt/intel/cmkl/<version>/lib/em64t/libmkl_vml_mc2.so (10.0 and later)
/opt/intel/cmkl/<version>/lib/em64t/libmkl_vml_p4n.so
/opt/intel/cmkl/<version>/lib/em64t/libvml.so (9.0 and 9.1 only)
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
perl
Intel(R) Math Kernel Library Cluster Edition 9.0 or later
78
Intel R©Cluster Checker 1.3u2 intel_devtools_1_0
56 intel_devtools_1_0
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the developer tools meets requirements.
METHOD
Compare the Intel(R) C++ Compiler version to 9.1.038 and confirm that the following files exist:
bin/icc
Compare the Intel(R) Fortran Compiler version to 9.1.032 and confirm that the following files exist:
bin/ifort
Compare the Intel(R) MPI Library version to 3.0.033 and confirm that the following files exist:
bin/mpicc
bin/mpicxx
bin/mpif77
bin/mpif90
bin/mpiicc
bin/mpiicpc
bin/mpiifort
bin64/mpicc
bin64/mpicxx
bin64/mpif77
bin64/mpif90
bin64/mpiicc
bin64/mpiicpc
bin64/mpiifort
Compare the Intel(R) Math Kernel Library Cluster Edition version to 9.0.017.Compare the Intel(R) Trace Analyzer and Collector version to 7.0.1 and confirm that the following files exist:
bin/traceanalyzer
Compare the Intel(R) Debugger version to 9.1 and confirm that the following files exist:
bin/idb
Compare the Intel(R) Thread Checker version to 3.0.Compare the Intel(R) Thread Profiler version to 1.0.Compare the Intel(R) VTune(TM) Performance Analyzer version to 8.0.4.
79
Intel R©Cluster Checker 1.3u2 intel_devtools_1_0
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
genuine_intel
perl
sh
EXTERNAL DEPENDENCIES
find
grep
Intel Software Tools
perl
xargs
80
Intel R©Cluster Checker 1.3u2 intel_fc
57 intel_fc
Check the Intel(R) Fortran compiler
DESCRIPTION
intel_fc is an Intel(R) Cluster Checker module to verify the Intel(R) Fortran Compiler. The module examines thecompiler version and builds and executes a Fortran ’Hello World’ program.Note: this module builds the Hello World binary from source using the Intel(R) Fortran Compiler. If you wish to checkthe functionality of the compiler runtime only, see theintel_fce_rtl module.
CONFIGURATION
fc-path
The base path to the Intel(R) Fortran Compiler. Setting this parameter will automatically setup the environment.Default: none (inherit environment)
version
The string that is compared to the Intel(R) Fortran Compiler build stamp.Default: noneIf versionis not specified in the configuration file, then the specific compiler version will not be checked.The unifor-mity of the package ID is compared across the cluster regardless.
Example
<intel_fc><fc-path>/opt/intel/fce/9.1</fc-path>
</intel_fc>
MODULE CLASS
vector
DEPENDENCIES
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) Fortran Compiler
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<fc-path> tag to override.
81
Intel R©Cluster Checker 1.3u2 intel_fc_rtl_9_1
58 intel_fc_rtl_9_1
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the 32-bit Intel(R) Fortran Compiler runtime meets requirements.
METHOD
Compare the 32-bit Intel(R) Fortran Compiler runtime version to 9.1Confirm that the following files exist on all nodes:
/opt/intel/fc/<version>/lib/libcprts.so (9.x only)
/opt/intel/fc/<version>/lib/libcprts.so.5 (9.x only)
/opt/intel/fc/<version>/lib/libcxa.so (9.x only)
/opt/intel/fc/<version>/lib/libcxa.so.5 (9.x only)
/opt/intel/fc/<version>/lib/libcxaguard.so
/opt/intel/fc/<version>/lib/libcxaguard.so.5
/opt/intel/fc/<version>/lib/libguide.so
/opt/intel/fc/<version>/lib/libguide_stats.so
/opt/intel/fc/<version>/lib/libifcoremt.so
/opt/intel/fc/<version>/lib/libifcoremt.so.5
/opt/intel/fc/<version>/lib/libifcore.so
/opt/intel/fc/<version>/lib/libifcore.so.5
/opt/intel/fc/<version>/lib/libifport.so
/opt/intel/fc/<version>/lib/libifport.so.5
/opt/intel/fc/<version>/lib/libimf.so
/opt/intel/fc/<version>/lib/libintlc.so (10.x only)
/opt/intel/fc/<version>/lib/libintlc.so.5 (10.x only)
/opt/intel/fc/<version>/lib/libirc.so
/opt/intel/fc/<version>/lib/libsvml.so
/opt/intel/fc/<version>/lib/libunwind.so (9.x only)
/opt/intel/fc/<version>/lib/libunwind.so.5 (9.x only)
CONFIGURATION
None
82
Intel R©Cluster Checker 1.3u2 intel_fc_rtl_9_1
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
perl
Intel(R) Fortran Compiler 9.1 or later
83
Intel R©Cluster Checker 1.3u2 intel_fce_rtl
59 intel_fce_rtl
Check the Intel(R) Fortran compiler runtime libraries
DESCRIPTION
intel_fce_rtl is an Intel(R) Cluster Checker module to verify the Intel(R) Fortran compiler runtime libraries. Themodule runs a Fortran Hello World binary on the compute nodes.Note: this module does not build the Hello World binary using the Intel(R) Fortran Compiler. If you wish to check thefunctionality of the compiler itself, see theintel_fc module.
CONFIGURATION
fc-path
The base path to the Intel(R) Fortran Compiler installation directory. Setting this parameter will automatically setupthe environment.Default: none (inherit environment)
Example
<intel_fce_rtl><fc-path>/opt/intel/fce/9.1</fc-path>
</intel_fce_rtl>
MODULE CLASS
unit
DEPENDENCIES
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) Fortran Compiler 9.1 or later
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH) inherited from the user running Intel(R) ClusterChecker is setup correctly.
84
Intel R©Cluster Checker 1.3u2 intel_fce_rtl_9_1
60 intel_fce_rtl_9_1
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the 64-bit Intel(R) Fortran Compiler runtime meets requirements.
METHOD
Compare the 64-bit Intel(R) Fortran Compiler runtime version to 9.1Confirm that the following files exist on all nodes:
/opt/intel/fce/<version>/lib/libclusterguide.so (9.x only)
/opt/intel/fce/<version>/lib/libclusterguide_stats.so (9.x only)
/opt/intel/fce/<version>/lib/libcprts.so (9.x only)
/opt/intel/fce/<version>/lib/libcprts.so.5 (9.x only)
/opt/intel/fce/<version>/lib/libcxa.so (9.x only)
/opt/intel/fce/<version>/lib/libcxa.so.5 (9.x only)
/opt/intel/fce/<version>/lib/libcxaguard.so
/opt/intel/fce/<version>/lib/libcxaguard.so.5
/opt/intel/fce/<version>/lib/libguide.so
/opt/intel/fce/<version>/lib/libguide_stats.so
/opt/intel/fce/<version>/lib/libifcoremt.so
/opt/intel/fce/<version>/lib/libifcoremt.so.5
/opt/intel/fce/<version>/lib/libifcore.so
/opt/intel/fce/<version>/lib/libifcore.so.5
/opt/intel/fce/<version>/lib/libifport.so
/opt/intel/fce/<version>/lib/libifport.so.5
/opt/intel/fce/<version>/lib/libimf.so
/opt/intel/fce/<version>/lib/libintlc.so (10.x only)
/opt/intel/fce/<version>/lib/libintlc.so.5 (10.x only)
/opt/intel/fce/<version>/lib/libirc.so
/opt/intel/fce/<version>/lib/libomp_db.so
/opt/intel/fce/<version>/lib/libompstub.so (10.x only)
/opt/intel/fce/<version>/lib/libsvml.so
/opt/intel/fce/<version>/lib/libunwind.so (9.x only)
/opt/intel/fce/<version>/lib/libunwind.so.5 (9.x only)
85
Intel R©Cluster Checker 1.3u2 intel_fce_rtl_9_1
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
perl
Intel(R) Fortran Compiler 9.1 or later
86
Intel R©Cluster Checker 1.3u2 intel_mpi
61 intel_mpi
Check the Intel(R) MPI Library
DESCRIPTION
intel_mpi is an Intel(R) Cluster Checker module to verify the functionality of Intel(R) MPI Library on each node.This test does not verify that inter-node functionality of Intel(R) MPI Library. The module checks the permissions on$HOME/.mpd.conf, starts and stops mpds, compiles a MPI Hello World binary from source, and runs the program onone or more Intel(R) MPI Library devices.Note: this module builds the MPI Hello World binary from source using the Intel(R) MPI Library compiler wrappers(i.e., mpicc.) If you wish to check the functionality of the MPI runtime only, see theintel_mpi_rt module.
CONFIGURATION
device
The Intel(R) MPI Library device string, i.e., I_MPI_DEVICE. May be specified more than once.Default: rdssm
mpi-path
The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
process-number
The number of MPI processes to start on each nodeDefault: 4
Example
<intel_mpi><device>sock</device><device>rdssm</device><mpi-path>/opt/intel/mpi/3.0</mpi-path><process-number>2</process-number>
</intel_mpi>
MODULE CLASS
unit
DEPENDENCIES
gcc
hostname
loopback
python
87
Intel R©Cluster Checker 1.3u2 intel_mpi
sh
shm_mount
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) MPI Library
mktemp
stat
rm
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<mpi-path> tag to override.
88
Intel R©Cluster Checker 1.3u2 intel_mpi_internode
62 intel_mpi_internode
Check the Intel(R) MPI Library
DESCRIPTION
intel_mpi_internode is an Intel(R) Cluster Checker module to verify the functionality of Intel(R) MPI Library on thewhole cluster. The module starts and stops mpds, compiles a MPI Hello World program from source, and runs theprogram on one or more Intel(R) MPI Library devices.Note: this module builds the MPI Hello World binary from source using the Intel(R) MPI Library compiler wrappers(i.e., mpicc.) If you wish to check the functionality of the MPI runtime only, see theintel_mpi_rt_internodemodule.
CONFIGURATION
device
The Intel(R) MPI Library device string, i.e., I_MPI_DEVICE. May be specified more than once.Default: rdssm
mpi-path
The base path to the Intel(R) MPI Library directory. Setting this parameter will automatically setup the environment.Default: none (inherit environment)
process-number
The number of MPI processes to start on each nodeDefault: 4
Example
<intel_mpi_internode><device>sock</device><device>rdssm</device><mpi-path>/opt/intel/mpi/3.0</mpi-path>
</intel_mpi_internode>
MODULE CLASS
vector
DEPENDENCIES
gcc
hostname
intel_mpi
loopback
sh
ssh_alltoall
89
Intel R©Cluster Checker 1.3u2 intel_mpi_internode
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) MPI Library
mktemp
rm
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<mpi-path> tag to override.Automatically sets I_MPI_USE_DYNAMIC_CONNECTIONS=1.
90
Intel R©Cluster Checker 1.3u2 intel_mpi_rt
63 intel_mpi_rt
Check the functionality of the Intel(R) MPI Library Runtime Environment
DESCRIPTION
intel_mpi_rt is an Intel(R) Cluster Checker module to verify the basic functionality of the Intel(R) MPI Library Run-time Environment on each node. This test does not verify that inter-node functionality of the Intel(R) MPI Library.The module checks the permissions on $HOME/.mpd.conf, starts and stops mpds, and run a MPI Hello World programon one or more Intel(R) MPI Library devices.Note: this module does not build the MPI Hello World binary using the Intel(R) MPI Library compiler wrappers (i.e.,mpicc.) If you wish to check the functionality of the MPI compiler wrappers, see theintel_mpi module.
CONFIGURATION
device
The Intel(R) MPI Library device string, i.e., I_MPI_DEVICE. May be specified more than once.Default: rdssm
mpi-path
The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
process-number
The number of MPI processes to start on each nodeDefault: 4
Example
<intel_mpi_rt><device>sock</device><device>rdssm</device><path>/opt/intel/mpi-rt/3.0</path><process-number>2</process-number>
</intel_mpi_rt>
MODULE CLASS
unit
DEPENDENCIES
hostname
loopback
python
sh
91
Intel R©Cluster Checker 1.3u2 intel_mpi_rt
shm_mount
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) MPI Library 3.0 or later
mktemp
stat
rm
NOTES
By default, assumes that the environment inherited from the user running Intel(R) Cluster Checker is setup correctly.See the<mpi-path> tag to override.
92
Intel R©Cluster Checker 1.3u2 intel_mpi_rt_3_0_033
64 intel_mpi_rt_3_0_033
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the Intel(R) MPI Library runtime meets requirements.
METHOD
Compare the Intel(R) MPI Library version to 3.0 build 033.Confirm that the following files exist on all nodes:
/opt/intel/impi/ <version>/{bin,bin64}/mpdallexit
/opt/intel/impi/ <version>/{bin,bin64}/mpdallexit.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdboot
/opt/intel/impi/ <version>/{bin,bin64}/mpdboot.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdcheck
/opt/intel/impi/ <version>/{bin,bin64}/mpdcheck.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdchkpyver.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdcleanup
/opt/intel/impi/ <version>/{bin,bin64}/mpdcleanup.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdexit
/opt/intel/impi/ <version>/{bin,bin64}/mpdexit.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdgdbdrv.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdhelp
/opt/intel/impi/ <version>/{bin,bin64}/mpdhelp.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdkilljob
/opt/intel/impi/ <version>/{bin,bin64}/mpdkilljob.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdlib.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdlistjobs
/opt/intel/impi/ <version>/{bin,bin64}/mpdlistjobs.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdman.py
/opt/intel/impi/ <version>/{bin,bin64}/mpd
/opt/intel/impi/ <version>/{bin,bin64}/mpd.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdringtest
/opt/intel/impi/ <version>/{bin,bin64}/mpdringtest.py
93
Intel R©Cluster Checker 1.3u2 intel_mpi_rt_3_0_033
/opt/intel/impi/ <version>/{bin,bin64}/mpdroot
/opt/intel/impi/ <version>/{bin,bin64}/mpdrun
/opt/intel/impi/ <version>/{bin,bin64}/mpdrun.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdsigjob
/opt/intel/impi/ <version>/{bin,bin64}/mpdsigjob.py
/opt/intel/impi/ <version>/{bin,bin64}/mpdtrace
/opt/intel/impi/ <version>/{bin,bin64}/mpdtrace.py
/opt/intel/impi/ <version>/{bin,bin64}/mpiexec
/opt/intel/impi/ <version>/{bin,bin64}/mpiexec.py
/opt/intel/impi/ <version>/{bin,bin64}/mpirun
/opt/intel/impi/ <version>/{bin,bin64}/mtv.so
/opt/intel/impi/ <version>/{lib,lib64}/libmpi.so.3.1
/opt/intel/impi/ <version>/{lib,lib64}/libmpi.so.2.1
/opt/intel/impi/ <version>/{lib,lib64}/libmpi.so
/opt/intel/impi/ <version>/{lib,lib64}/libmpi_mt.so.3.1
/opt/intel/impi/ <version>/{lib,lib64}/libmpi_mt.so
/opt/intel/impi/ <version>/lib/libmpiec.so.3.1 (3.0 only)
/opt/intel/impi/ <version>/lib/libmpiec.so.2.1 (3.0 only)
/opt/intel/impi/ <version>/lib/libmpiec.so (3.0 only)
/opt/intel/impi/ <version>/lib/libmpief.so.3.1 (3.0 only)
/opt/intel/impi/ <version>/lib/libmpief.so.2.1 (3.0 only)
/opt/intel/impi/ <version>/lib/libmpief.so (3.0 only)
/opt/intel/impi/ <version>/lib/libmpigc.so.3.1 (3.0 only)
/opt/intel/impi/ <version>/lib/libmpigc.so.2.1 (3.0 only)
/opt/intel/impi/ <version>/lib/libmpigc.so (3.0 only)
/opt/intel/impi/ <version>/{lib,lib64}/libmpigc3.so.3.1
/opt/intel/impi/ <version>/{lib,lib64}/libmpigc3.so.2.1
/opt/intel/impi/ <version>/{lib,lib64}/libmpigc3.so
/opt/intel/impi/ <version>/{lib,lib64}/libmpigc4.so
/opt/intel/impi/ <version>/{lib,lib64}/libmpigf.so.3.1
/opt/intel/impi/ <version>/{lib,lib64}/libmpigf.so.2.1
/opt/intel/impi/ <version>/{lib,lib64}/libmpigf.so
94
Intel R©Cluster Checker 1.3u2 intel_mpi_rt_3_0_033
/opt/intel/impi/ <version>/{lib,lib64}/libmpiic.so.3.1
/opt/intel/impi/ <version>/{lib,lib64}/libmpiic.so.2.1
/opt/intel/impi/ <version>/{lib,lib64}/libmpiic.so
/opt/intel/impi/ <version>/{lib,lib64}/libmpiic4.so
/opt/intel/impi/ <version>/{lib,lib64}/libmpiif.so.3.1
/opt/intel/impi/ <version>/{lib,lib64}/libmpiif.so.2.1
/opt/intel/impi/ <version>/{lib,lib64}/libmpiif.so
/opt/intel/impi/ <version>/lib64/libmpigc4.so.3.1
/opt/intel/impi/ <version>/lib64/libmpiic4.so.3.1
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
perl
Intel(R) MPI Library 3.0 build 033 or later
95
Intel R©Cluster Checker 1.3u2 intel_mpi_rt_internode
65 intel_mpi_rt_internode
Check the functionality of the Intel(R) MPI Library Runtime Environment
DESCRIPTION
intel_mpi_rt_internode is an Intel(R) Cluster Checker module to verify the basic functionality of Intel(R) MPI LibraryRuntime Environment over the whole cluster. The module starts and stops mpds, and runs a MPI Hello World programon one or more Intel(R) MPI Library devices.Note: this module does not build the MPI Hello World binary using the Intel(R) MPI Library compiler wrappers (i.e.,mpicc.) If you wish to check the functionality of the MPI compiler wrappers, see theintel_mpi_internodemodule.
CONFIGURATION
device
The Intel(R) MPI Library device string, i.e., I_MPI_DEVICE. May be specified more than once.Default: rdssm
mpi-path
The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
process-number
The number of MPI processes to start on each nodeDefault: 4
Example
<intel_mpi_rt_internode><device>sock</device><device>rdssm</device><mpi-path>/opt/intel/mpi-rt/3.0</mpi-path>
</intel_mpi_rt_internode>
MODULE CLASS
vector
DEPENDENCIES
hostname
intel_mpi_rt
loopback
sh
ssh_alltoall
96
Intel R©Cluster Checker 1.3u2 intel_mpi_rt_internode
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) MPI Library 3.0 or later
mktemp
rm
NOTES
By default, assumes that the environment inherited from the user running Intel(R) Cluster Checker is setup correctly.See the<mpi-path> tag to override.Automatically sets I_MPI_USE_DYNAMIC_CONNECTIONS=1.
97
Intel R©Cluster Checker 1.3u2 intel_mpi_testsuite
66 intel_mpi_testsuite
Check the Intel(R) MPI Library runtime and network stack
DESCRIPTION
intel_mpi_testsuite is an Intel(R) Cluster Checker module to verify the Intel(R) MPI Library runtime and networkstack by running the Intel(R) MPI Library Test Suite.
CONFIGURATION
cc-path
The base path to the Intel(R) C++ Compiler installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
device
The Intel(R) MPI Library device string, i.e., I_MPI_DEVICE. May be specified more than once.Default: rdssm
fc-path
The base path to the Intel(R) Fortran Compiler installation directory. Setting this parameter will automatically setupthe environment.Default: none (inherit environment)
mpi-path
The base path to the Intel(R) MPI Library installation directory. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
Example
<intel_mpi_testsuite><device>sock</device><device>rdssm:Openib-ib0</device><mpi-path>/opt/intel/mpi-rt/3.0</mpi-path>
</intel_mpi_testsuite>
MODULE CLASS
vector
DEPENDENCIES
intel_cce_rtl
intel_fce_rtl
intel_mpi_rt_internode
98
Intel R©Cluster Checker 1.3u2 intel_mpi_testsuite
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) C++ Library 9.1 or later
Intel(R) Fortran Library 9.1 or later
Intel(R) MPI Library 3.0 or later
Intel(R) MPI Library Test Suite
mkdir
rm
tar
NOTES
By default, assumes that the environment inherited from the user running Intel(R) Cluster Checker is setup correctly.See the<cc-path>, <fc-path>, and<mpi-path> tags to override.
99
Intel R©Cluster Checker 1.3u2 intel_tbb_rtl_1_0
67 intel_tbb_rtl_1_0
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the Intel(R) Threading Building Blocks runtime meets requirements.
METHOD
Compare the Intel(R) Threading Building Blocks runtime version to 1.0.Confirm that the following files exist on all nodes:
/opt/intel/tbb/<version>/lib/libtbb.so
/opt/intel/tbb/<version>/lib/libtbb_debug.so
/opt/intel/tbb/<version>/lib/libtbbmalloc.so
/opt/intel/tbb/<version>/lib/libtbbmalloc_debug.so
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
perl
Intel(R) Threading Building Blocks 1.0 or later
100
Intel R©Cluster Checker 1.3u2 ip_alltoall
68 ip_alltoall
Check the uniformity of IP addresses
DESCRIPTION
ip_alltoall.pm is an Intel(R) Cluster Checker module to verify the resolution of every host’s IP from every other host.
CONFIGURATION
None
MODULE CLASS
matrix
DEPENDENCIES
ssh
ping_alltoall
genuine_intel
EXTERNAL DEPENDENCIES
ping
101
Intel R©Cluster Checker 1.3u2 ipoib
69 ipoib
Check that the InfiniBand devices are configured
DESCRIPTION
ipoib is an Intel(R) Cluster Checker module to verify that all the InfiniBand devices are properly configured.
CONFIGURATION
down
The specified device should not be verified as it is not configured. This option may be specified multiple times to markmore than one port as down.Default: none
Example
<ipoib><down>ib1</down>
</ipoib>
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ifconfig
102
Intel R©Cluster Checker 1.3u2 java_1_4_2
70 java_1_4_2
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the Java Runtime Environment meets requirements.
METHOD
Compare the Java Runtime Environment version to 1.4.2.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
Java Runtime Environment 1.4.2 or later
103
Intel R©Cluster Checker 1.3u2 jdk_1_4_2
71 jdk_1_4_2
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the Java Software Development Kit meets requirements.
METHOD
Compare the Java compiler version to 1.4.2.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
Java Software Development Kit 1.4.2 or later
104
Intel R©Cluster Checker 1.3u2 kernel
72 kernel
Check the uniformity of the kernel version on all nodes
DESCRIPTION
kernel is an Intel(R) Cluster Checker module to verify the kernel version of a node. The module examines the outputof ‘uname -r‘ to compare against the other nodes.
CONFIGURATION
None
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
uname
105
Intel R©Cluster Checker 1.3u2 kernel_2_6_17
73 kernel_2_6_17
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the Linux kernel meets requirements.
METHOD
Compare the kernel version to the list of sufficient kernels.Note: see the Intel(R) Cluster Ready Specification for a list of sufficient kernels.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
uname
106
Intel R©Cluster Checker 1.3u2 kernel_modules
74 kernel_modules
Check the loaded Linux kernel modules
DESCRIPTION
kernel_modules is an Intel(R) Cluster Checker module to verify the list of loaded kernel modules. It verifies theuniformity of the kernel modules loaded on the compute nodes. If the head node is also a compute node, the test alsoverifies that modules loaded on the compute nodes are also loaded on the head node (although the head node may haveadditional modules loaded that are not present on the compute nodes.)
CONFIGURATION
exclude
Exclude this kernel module from the check, i.e., disregard any non-uniformity between nodes. May be specifiedmultiple times to exclude more than one module.Default: joydev , sr_mod , usb_storage , ohci_hcd
extended_match
Use the fulllsmod output when determining uniformity. By default, only whether the kernel module is loaded ischecked. Specifying this option includes the size, load count, and list of referring modules in the uniformity check.Default: false
module
Explicitly verify that the specified kernel module is loaded. May be specified multiple times to include more than onemodule.Default: none
Example
<kernel_modules><exclude>sunrpc</exclude><extended_match/><module>ipoib</module><module>e1000</module>
</kernel_modules>
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
lsmod
107
Intel R©Cluster Checker 1.3u2 kernel_parameters
75 kernel_parameters
Check the uniformity of the kernel runtime parameters
DESCRIPTION
kernel_parameters is an Intel(R) Cluster Checker module to verify the uniformity of the kernel runtime parametersamong all nodes. The module uses thesysctl program to collect the kernel parameters.
CONFIGURATION
exclude
Kernel parameter to exclude from the check based on a regular expression match. May be specified multiple times. Thefollowing parameters are automatically excluded:kernel.hostname , kernel.domainname , kernel.random* ,kernel.pty.nr , fs.dentry-state , fs.file-max , fs.file-nr , fs.inode-nr , fs.inode-state ,fs.nfs.nfs3_acl_max_entries , fs.nfsd.nfsd3_acl_max_entries , andfs.quota.syncs . Thefollowing parameter subsets are also excluded:net.ipv4.conf.* , net.ipv4.neigh.* , net.ipv4.netfilter.* ,net.ipv6.conf.* , net.ipv6.neigh.* , fs.nfs.* .
Example
<kernel_parameters><exclude>eth1</exclude>
</kernel_parameters>
MODULE CLASS
vector
DEPENDENCIES
mount_proc
sh
ssh
genuine_intel
EXTERNAL DEPENDENCIES
sysctl
108
Intel R©Cluster Checker 1.3u2 ksh
76 ksh
Check the Korn Shell
DESCRIPTION
ksh is an Intel(R) Cluster Checker module to verify the Korn Shell. The module tests if/bin/ksh exists and runs a’Hello World’ script.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
/bin/ksh
109
Intel R©Cluster Checker 1.3u2 lib32_counterpart_lib64
77 lib32_counterpart_lib64
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the 32-bit libraries meet requirements.
METHOD
Confirm that all 32-bit libraries in the dynamic linker cache have a 64-bit counterpart.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
/sbin/ldconfig
110
Intel R©Cluster Checker 1.3u2 loopback
78 loopback
Check that the loopback address is correctly configured
DESCRIPTION
loopback is an Intel(R) Cluster Checker module to verify that the loopback address is correctly configured. Theloopback address (127.0.0.1) must correspond to ’localhost’ in /etc/hosts, and both must respond to ping.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
grep
ping
111
Intel R©Cluster Checker 1.3u2 memory_bandwidth_stream
79 memory_bandwidth_stream
Check the memory bandwidth of a node using the STREAM benchmark
DESCRIPTION
memory_bandwidth_stream is an Intel(R) Cluster Checker module to verify the memory bandwidth of each node usingthe Triad STREAM benchmark and their deviation over the cluster nodes. STREAM is configured to use a 10 millionelement array (requires 229 MB of memory.)
CONFIGURATION
bandwidth
The minimally acceptable Triad memory bandwidth, in MB/s.Default: none
build
Build STREAM from source rather than using the prebuilt binary (external/stream .) If true, the Intel(R) CCompiler must be available; theintel_cc module should be added as a dependency.
cc-path
The base path to the Intel(R) C++ Compiler / runtime libraries. Setting this parameter will automatically setup theenvironment.Default: none (inherit environment)
deviation
The factor of allowed standard deviations from median, used to search for outlier values. The allowed range is (median-/+ deviation * stddev).Default: 3
threads
The number of OpenMP threads for STREAM to use. This is equivalent to the OMP_NUM_THREADS environmentvariable.Default: ALL
Example
<memory_bandwidth_stream><bandwidth>3600</bandwidth><cc-path>/opt/intel/cce/9.1</cc-path><deviation>3</deviation>
</memory_bandwidth_stream>
MODULE CLASS
unit
112
Intel R©Cluster Checker 1.3u2 memory_bandwidth_stream
DEPENDENCIES
intel_cce_rtl
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) C++ Compiler 9.1 or later runtime
NOTES
By default, assumes that the environment (LD_LIBRARY_PATH, PATH) inherited from the user running Intel(R)Cluster Checker is setup correctly. See the<cc-path> tag to override.
113
Intel R©Cluster Checker 1.3u2 mflops_intel_mkl
80 mflops_intel_mkl
Check the floating point performance of a node
DESCRIPTION
mflops_intel_mkl is an Intel(R) Cluster Checker module to verify the floating point performance of each cluster node.The module executes the DGEMM library routine from the Intel(R) Math Kernel Library to measure the floating pointperformance and deviation over the cluster nodes. The environment variable OMP_NUM_THREADS is set to ALLso that all cores are used.
CONFIGURATION
build
Build the DGEMM benchmark from source rather than using the prebuilt binary. If true, the Intel(R) Math Kernel Li-brary must be in the linker path or thepath option should be used; thegcc module should be added as a dependency.Default: false
mflops
The minimum acceptable floating point performance in MFLOPS.Default: noneIf no MFLOPS threshold is defined, the module only confirms that the measured MFLOPS is greater than zero.
m, n, k
The matrix dimensions used in DGEMM. The total memory required is (m*n + m*k + n*k) * sizeof(double) bytes.Default: m = 5000, n = 5000, k = 112 (memory requirement = 200 MB if sizeof(double) is 8 bytes)
mkl-path
The base path to the Intel(R) Math Kernel Library.Default: none
deviation
The factor of allowed standard deviations from median, used to search for outlier values. The allowed range is (median-/+ deviation * stddev).Default: 3
Example
<mflops_intel_mkl><deviation>3</deviation><mkl-path>/opt/intel/cmkl/9.0</mkl-path><mflops>6000</mflops><m>5000</m><n>5000</n><k>112</k>
</mflops_intel_mkl>
114
Intel R©Cluster Checker 1.3u2 mflops_intel_mkl
MODULE CLASS
vector
DEPENDENCIES
gcc
sh
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
Intel(R) Math Kernel Library 9.0 or later
115
Intel R©Cluster Checker 1.3u2 mount_proc
81 mount_proc
Check that the procfs filesystem (/proc) is mounted
DESCRIPTION
mount_proc is an Intel(R) Cluster Checker module to verify that the procfs filesystem is mounted on /proc.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
EXTERNAL DEPENDENCIES
None
NOTES
Assumes that /proc is the mount point for procfs.
116
Intel R©Cluster Checker 1.3u2 mpi_consistency
82 mpi_consistency
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the MPI job startup commands meet requirements.
METHOD
Confirm that the path to mpirun and mpiexec are consistent on all nodes.
CONFIGURATION
None
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
which
117
Intel R©Cluster Checker 1.3u2 network_consistency
83 network_consistency
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the resolution of hostnames and IPs meets requirements.
METHOD
Lookup the IP for each compute node name on each compute node.
CONFIGURATION
None
MODULE CLASS
matrix
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ping
118
Intel R©Cluster Checker 1.3u2 nfs_mounts
84 nfs_mounts
Check NFS mount points
DESCRIPTION
nfs_mounts is an Intel(R) Cluster Checker module to verify that NFS filesystems are mounted on the compute nodes.The test does not verify mountpoints on the head node.
CONFIGURATION
filesystem
Filesystem type to check. If this tag is defined defaults won’t be used.Default: nfs and autofs
mountpoint
The mountpoint to be checked. This option may be specified multiple times to check more than one mountpoint. If nomountpoints are specified, this test will return with an indeterminate result.Default: none
Example
<nfs_mounts><filesystem>nfs</filesystem><filesystem>ext3</filesystem><mountpoint>/opt</mountpoint><mountpoint>/shared</mountpoint>
</nfs_mounts>
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
mount
119
Intel R©Cluster Checker 1.3u2 nisdomain
85 nisdomain
Check that all nodes belong to the same NIS/YP domain
DESCRIPTION
nisdomain is an Intel(R) Cluster Checker module to verify the NIS domain of a node. The module usesnisdomainname .
CONFIGURATION
None
MODULE CLASS
vector
DEPENDENCIES
sh
ssh
genuine_intel
EXTERNAL DEPENDENCIES
nisdomainname
120
Intel R©Cluster Checker 1.3u2 nismaps
86 nismaps
Check the uniformity of the NIS/YP password map
DESCRIPTION
nismaps is an Intel(R) Cluster Checker module to verify the NIS password maps are uniform. The password maps areconsidered uniform if the md5 checksum is identical on all nodes.
CONFIGURATION
None
MODULE CLASS
vector
DEPENDENCIES
sh
ssh
nisdomain
genuine_intel
EXTERNAL DEPENDENCIES
md5sum
ypcat
121
Intel R©Cluster Checker 1.3u2 nsswitch
87 nsswitch
Check the configuration of /etc/nsswitch.conf
DESCRIPTION
nsswitch is an Intel(R) Cluster Checker module to verify the contents of the nsswitch.conf file. It verifies that/etc/nsswitch.conf is consistent across the cluster (formatting variations are allowed, so long as the service order isthe same).
CONFIGURATION
None
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
cat
122
Intel R©Cluster Checker 1.3u2 openib
88 openib
Check the OpenFabrics Enterprise Distribution InfiniBand driver
DESCRIPTION
openib is an Intel(R) Cluster Checker module to verify the OpenIB InfiniBand (*) driver provided by the OpenFabricsEnterprise Distribution. This module confirms that the InfiniBand adapters are the same hardware and firmwarerevisions, the ports are in the Active state, and have the same rate and capabilities. The memlock memory limits in/etc/security/limits.conf are also verified.
CONFIGURATION
Configuration options are divided in three categories:
ibstat-path
ibstat command installation directory. This may be needed if it is installed in a location not present in user’s PATH.
adapter
A container for the InfiniBand adapter device. The <adapter> block may be repeated to verify multiple In-finiBand adapters.
device
The device name of the adapter, e.g., mthca0. This option must be specified for each<adapter> container.
Default: none
down-port
The specified InfiniBand adapter port that should not be verified and is considered in down state. This optionmay be specified multiple times to mark more than one port as down.
Default: none
firmware-version
The firmware version string. If this parameter is not specified, only the uniformity of the firmware version ischecked.
Default: none
rate
The raw link rate in Gbit/s. If this parameter is not specified, only the uniformity of the link rate is checked.
Default: none
memlock
The minimum value, in bytes, of the hard and soft memlock limits.Default: 2000000
123
Intel R©Cluster Checker 1.3u2 openib
Example
<openib><adapter>
<device>mthca0</device><firmware-version>4.7.600</firmware-version><rate>10</rate>
</adapter><adapter>
<device>mthca1</device><down-port>2</down-port><rate>20</rate>
</adapter><ibstat-path>/usr/local/ofed/bin</ibstat-path><memlock>2000000</memlock>
</openib>
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
cat
OpenFabrics Enterprise Distribution
124
Intel R©Cluster Checker 1.3u2 openssh_3_9
89 openssh_3_9
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that OpenSSH (*) meets requirements.
METHOD
Compare the ssh version to OpenSSH 3.9.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ssh
125
Intel R©Cluster Checker 1.3u2 packages
90 packages
Check that the cluster has installed a reference set of given packages.
DESCRIPTION
packages is an Intel(R) Cluster Checker module to verify that a node is an exact copy of a reference system. Themodule uses the output of the-packages output of a reference system as the basis of the comparison.
CONFIGURATION
node
The path to a reference file with a list of installed packages.Default: none
head
The path to a reference file with a list of installed packagesDefault: none
Example
<packages><head>head.list</head><node>node.list</node>
</packages>
MODULE CLASS
unit
DEPENDENCIES
sh
ssh
rpm
genuine_intel
EXTERNAL DEPENDENCIES
rpm
cat
126
Intel R©Cluster Checker 1.3u2 pci
91 pci
Check the uniformity of the devices on the PCI bus
DESCRIPTION
pci is an Intel(R) Cluster Checker module to verify the device IDs connected to the PCI bus. The module collects PCIbus data usinglspci and compares the output across the cluster.
CONFIGURATION
exclude
The name of the PCI device ID and type description to exclude from the check. May be specified multiple times toexclude more than one.Default: none
Example
<pci><exclude>03:00.3 Serial controller</exclude>
</pci>
MODULE CLASS
vector
DEPENDENCIES
sh
ssh
genuine_intel
EXTERNAL DEPENDENCIES
lspci
127
Intel R©Cluster Checker 1.3u2 perl
92 perl
Check the Perl interpreter
DESCRIPTION
perl is an Intel(R) Cluster Checker module to verify the Perl interpreter. The module examines the Perl version andruns a ’Hello World’ one-liner.
CONFIGURATION
perl-path
The base path to the Perl interpreterDefault: /usr/bin
version
The string that is compared to the Perl version.Default: noneIf version is not specified in the configuration file, then the specific Perl version will not be checked. The uniformityof the version string is checked regardless.
Example
<perl><perl-path>/usr/bin</perl-path><version>5.8.7</version>
</perl>
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
Perl
128
Intel R©Cluster Checker 1.3u2 perl_5_6_1
93 perl_5_6_1
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that Perl meets requirements.
METHOD
Compare the Perl version to 5.6.1.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
perl
129
Intel R©Cluster Checker 1.3u2 ping
94 ping
Check that all nodes respond to ping from the head node
DESCRIPTION
ping is an Intel(R) Cluster Checker module to verify that all nodes respond to ping from the head node.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
None
EXTERNAL DEPENDENCIES
ping
130
Intel R©Cluster Checker 1.3u2 ping_alltoall
95 ping_alltoall
Check that all nodes can ping all other nodes
DESCRIPTION
ping_alltoall is an Intel(R) Cluster Checker module to verify that every node can ping every other node.
CONFIGURATION
None
MODULE CLASS
matrix
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ping
131
Intel R©Cluster Checker 1.3u2 portal
96 portal
Check that all nodes can ping the portal
DESCRIPTION
portal is an Intel(R) Cluster Checker module to verify that the site portal can be reached from all nodes of the cluster.The module pings the portal to test the connection.
CONFIGURATION
portal-name
The name of the site portal machine.Default: portal
Example
<portal><portal-name>portal</portal-name>
</portal>
MODULE CLASS
unit
DEPENDENCIES
sh
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ping
132
Intel R©Cluster Checker 1.3u2 process_check
97 process_check
Check for stale processes
DESCRIPTION
process_check is an Intel(R) Cluster Checker module to verify that the process list does not contain runaway processes(in terms of cpu or memory usage), zombies, or other stale processes.
CONFIGURATION
elapsed_time
Time (in seconds) that is used to define a stale process. See alsoexempt_uids .Default: 3600
exclude
Process names that are excluded from the check. This option may be repeated to exclude more than one process name.Default: none
exempt_uids
uids lower than this value are exempt from the elapsed time check. Daemons, etc. started from system accounts shouldnot be flagged as stale regardless of how long they have been running. See alsoelapsed_time .Default: 400
percent_cpu
Percentage of cpu that is used to define a runaway process. Note: on some systems, the percent cpu is defined relativeto a single core, on others it is relative to all cores.Default: 5
percent_memory
Percentage of memory that is used to define a runaway process.Default: 1
zombie_allowed_elapsed_time
Time (in seconds) that is used to allow transient zombies. Intel(R) Cluster Checker and other applications may createtransient zombies that are quicky, but not instantly reaped. Do not flag these transient zombies as ’true’ zombieprocesses unless their elapsed time is greater than this value.Default: 1
Example
<process_check><elapsed_time>3600</elapsed_time><exclude>ntpd</exclude><exclude>portmap</exclude><percent_cpu>5</percent_cpu><percent_memory>1</percent_memory>
</process_check>
133
Intel R©Cluster Checker 1.3u2 process_check
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ps
134
Intel R©Cluster Checker 1.3u2 python
98 python
Check the Python interpreter
DESCRIPTION
python is an Intel(R) Cluster Checker module to verify the Python interpreter. The module examines the Pythonversion and runs a ’Hello World’ one-liner.
CONFIGURATION
python-path
The base path to the Python interpreterDefault: /usr/bin
version
The string that is compared to the Python version.Default: noneIf versionis not specified in the configuration file, then the specific Python version will not be checked.The uniformityof the version string is checked regardless.
Example
<python><python-path>/usr/bin</python-path><version>2.2.3</version>
</python>
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
python
135
Intel R©Cluster Checker 1.3u2 python_2_3_4
99 python_2_3_4
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that python meets requirements
METHOD
Compare the python version to 2.3.4.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
python
136
Intel R©Cluster Checker 1.3u2 rpm
100 rpm
Check the installed RPM packages
DESCRIPTION
rpm is an Intel(R) Cluster Checker module to verify the list of installed rpms. It verifies the uniformity of the rpmsinstalled on the compute nodes. If the head node is also a compute node, the test also verifies that rpms installed onthe compute nodes are also installed on the head node (although the head node may have additional rpms that are notpresent on the compute nodes.)
CONFIGURATION
exclude
The full name of a rpm to exclude from the check. Note that this is the name returned by "rpm -q package", includingthe version number. May be specified multiple times to exclude more than one rpm.Default: none
rpm
Explicitly verify that the specified rpm is installed. May be specified multiple times to include more than one rpm.Default: none
Example
<rpm><exclude>rpm-4.3.3-9_nonptl</exclude><exclude>xterm-192-1</exclude><rpm>mpich-ch_p4-gcc-oscar-module-1.2.7-4</rpm>
</rpm>
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
rpm
137
Intel R©Cluster Checker 1.3u2 sh
101 sh
Check the Bourne Shell
DESCRIPTION
sh is an Intel(R) Cluster Checker module to verify the Bourne Shell. The module tests if/bin/sh exists and runs a’Hello World’ script.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
/bin/sh
138
Intel R©Cluster Checker 1.3u2 shm_mount
102 shm_mount
Check that the shared memory device (/dev/shm) is mounted.
DESCRIPTION
shm_mount is an Intel(R) Cluster Checker module to verify that /dev/shm is mounted correctly.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
mount
139
Intel R©Cluster Checker 1.3u2 single_authentication
103 single_authentication
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that the cluster authentication process meets requirements.
METHOD
ssh to every node from every other node and run a simple command.
CONFIGURATION
None
MODULE CLASS
matrix
DEPENDENCIES
ping_alltoall
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ssh
140
Intel R©Cluster Checker 1.3u2 speedstep
104 speedstep
Check state of the Intel SpeedStep(R) Technology
DESCRIPTION
speedstep is an Intel(R) Cluster Checker module that verifies the state of the Intel SpeedStep(R) Technology. TheLinux kernel support for Intel SpeedStep(R) Technology is provided by the cpufreq subsystem.
CONFIGURATION
state
The required state of the Intel SpeedStep(R) Technology.Default: false
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
test
uname
141
Intel R©Cluster Checker 1.3u2 ssh
105 ssh
Check the ssh connectivity of all nodes
DESCRIPTION
ssh.pm is an Intel(R) Cluster Checker module to verify that all nodes can be reached via ssh from the system runningIntel(R) Cluster Checker.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ping
EXTERNAL DEPENDENCIES
echo
142
Intel R©Cluster Checker 1.3u2 ssh_alltoall
106 ssh_alltoall
Check that every node can ssh to all other nodes
DESCRIPTION
ssh_alltoall is an Intel(R) Cluster Checker module to verify that every node can ssh to every other node in the cluster.
CONFIGURATION
None
MODULE CLASS
matrix
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ssh
143
Intel R©Cluster Checker 1.3u2 ssh_version
107 ssh_version
Check the ssh version uniformity of all nodes
DESCRIPTION
ssh_version is an Intel(R) Cluster Checker module to verify that all nodes have the same ssh version.
CONFIGURATION
ssh-path
The base path to the SSH commandDefault: /usr/bin
Example
<ssh_version><ssh-path>/usr/bin</ssh-path>
</ssh_version>
MODULE CLASS
vector
DEPENDENCIES
ping
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ssh
144
Intel R©Cluster Checker 1.3u2 stray_uids
108 stray_uids
Check that all files in a directory are owned by a known user and group
DESCRIPTION
stray_uids is an Intel(R) Cluster Checker module to verify that all files in a directory (or several directories) are ownedby known users and groups.
CONFIGURATION
dir
A directory to be checked for stray UIDs and GIDs. May be specified multipe times to check more than one directory.Default : /tmpif, and only if, no dir options are specified. If any directories are listed in the config file, /tmp willnotbe checked, unless it is also explicitly listed.
Example
<stray_uids><dir>/tmp</dir><dir>/var/tmp</dir><dir>/home</dir><dir>/var/log</dir>
</stray_uids>
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
stat
145
Intel R©Cluster Checker 1.3u2 subnet_manager
109 subnet_manager
Check the InfiniBand subnet manager
DESCRIPTION
subnet_manager is an Intel(R) Cluster Checker module to verify that the software InfiniBand subnet manager is run-ning on one, and only one, node.
CONFIGURATION
command
The name of the subnet manager processDefault: opensm
Example
<subnet_manager><command>opensm</command>
</subnet_manager>
MODULE CLASS
vector
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
ps
NOTES
Does not consider that subnet manager may be running on a node external to the cluster. Does not consider that theInfiniBand switch may have a hardware subnet manager.
146
Intel R©Cluster Checker 1.3u2 system_memory
110 system_memory
Check the uniformity of the total physical memory and swap memory
DESCRIPTION
system_memory is an Intel(R) Cluster Checker module to verify system memory. The module checks both physicalmemory and swap space.
CONFIGURATION
physical
The amount of physical memory, in KB.Default: median of the collected values
physical_threshold
Maximum absolute deviation from the expected amount of physical memory that is allowable, in KB.Default: 100
swap
The amount of swap memory, in KB.Default: median of the collected values
swap_threshold
Maximum absolute deviation from the expected amount of swap memory that is allowable, in KB.Default: 100
Example
<system_memory><physical>6154464</physical><swap>2040208</swap>
</system_memory>
MODULE CLASS
vector
DEPENDENCIES
ssh
mount_proc
genuine_intel
147
Intel R©Cluster Checker 1.3u2 tcl_8_4_7
111 tcl_8_4_7
Check Intel(R) Cluster Ready compliance
DESCRIPTION
Check that Tcl meets requirements.
METHOD
Compare the Tcl patchlevel to 8.4.7.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
tclsh
148
Intel R©Cluster Checker 1.3u2 tcsh
112 tcsh
Check the enhanced C Shell
DESCRIPTION
tcsh is an Intel(R) Cluster Checker module to verify the enhanced C Shell. The module tests if/bin/tcsh existsand runs a ’Hello World’ script.
CONFIGURATION
None
MODULE CLASS
unit
DEPENDENCIES
ssh
tmp
genuine_intel
EXTERNAL DEPENDENCIES
/bin/tcsh
149
Intel R©Cluster Checker 1.3u2 tmp
113 tmp
Check the permissions on /tmp
DESCRIPTION
tmp is an Intel(R) Cluster Checker module to verify the permissions on /tmp are correct (by default: 1777.)
CONFIGURATION
sticky
If true, consider 0777 to also be correct.Default: false
Example
<tmp><sticky/>
</tmp>
MODULE CLASS
unit
DEPENDENCIES
ssh
genuine_intel
EXTERNAL DEPENDENCIES
stat
NOTES
Assumes that the temporary directory is named /tmp.
150
Intel R©Cluster Checker 1.3u2 uid_sync
114 uid_sync
Check the uniformity of the user and group database
DESCRIPTION
uid_sync is an Intel(R) Cluster Checker module to verify the synchronization of users and group information acrossthe cluster. The comparison includes all attributes that are returned bygetpwent andgetgrent .
CONFIGURATION
minuid
The minimum uid to checkDefault: 500
mingid
The minimum gid to checkDefault: 500
Example
<uid_sync><mingid>1000</mingid>
</uid_sync>
MODULE CLASS
vector
DEPENDENCIES
ssh
perl
genuine_intel
EXTERNAL DEPENDENCIES
perl
151