HP Cluster Platform Interconnects v2010 Quadrics QsNetII Interconnect - Page 114

Using the qsnet2_level_test Utility

Page 114 highlights

The following two examples show a successful run on a 16 port interconnect: # ./qsnet2_dmatest -a -e -se -dm -n10000 -RD 1m mnfr_dmatest: master self=0 peer=0 source 0x2000000000602000 dest 0x200000000460a000 event 0x2000000000703fe0 done 0x2000000004607fa8 mnfr_dmatest: using route'D'-> 00000000000000000000000000002008 0: reps 10000 min 1048576 max 1048576 inc 0 (batch 1) dma test will run for approx. 10 secs. please wait.. 0: 1048576 bytes 1145.02 uSec 915.77 MB/s # ./qsnet2_dmatest -a -e -se -dm -n10000 -R4,D,D 1m mnfr_dmatest: master self=0 peer=0 source 0x2000000000602000 dest 0x200000000460a000 event 0x2000000000703fe0 done 0x2000000004607fa8 mnfr_dmatest: using route '4,D,D' -> 0000000000000000000000000088000c 0: reps 10000 min 1048576 max 1048576 inc 0 (batch 1) dma test will run for approx. 10 secs. please wait.. 0: 1048576 bytes 1153.01 uSec 909.43 MB/s The following command example simulates an error. It is intended to test the level 3 chips in the network. However, because the 16 port interconnect has only 2 levels, the script fails and reports that the link has failed: # ./qsnet2_dmatest -a -e -se -dm -n10000 -R4,4,D,D,D 1m mnfr_dmatest: master self=0 peer=0 source 0x2000000000602000 dest 0x200000000460a000 event 0x2000000000703fe0 done 0x2000000004607fa8 mnfr_dmatest: using route '4,4,D,D,D' -> 000000000000000000000000888c000c mnfr_dmatest: not making expected progress, checking errors mnfr_dmatest: total errors reported during test: 0 You can also use the qsnet2_dmatest utility to calculate a static route through the network by deriving the route data derived from the nodeid of the node on which you run the test. Use the -U option to calculate the static route as follows: # qsnet2_dmatest -UN -se -dm -e 0 1m Where N = an integer in the range 1 through 4. Specifying 1 will test only the 1st level of routes through the interconnect. Specifying 2 tests the first and second level, and so on to all four levels. The following sample output show use of the -U option. 0: reps 1000 min 0 max 1048576 inc 0 (batch 1) 0: 0 bytes 2.67 uSec 0.00 MB/s 0: 4 bytes 2.78 uSec 1.44 MB/s 0: 8 bytes 2.77 uSec 2.89 MB/s 0: 16 bytes 2.80 uSec 5.72 MB/s 0: 32 bytes 2.79 uSec 11.46 MB/s 0: 64 bytes 2.84 uSec 22.54 MB/s 0: 128 bytes 3.01 uSec 42.48 MB/s 0: 256 bytes 3.11 uSec 82.39 MB/s 0: 512 bytes 3.39 uSec 151.03 MB/s . . . 0: 1048576 bytes 1155.90 uSec 907.15 MB/s Node Network Error Report: - No Errors 12.5 Using the qsnet2_level_test Utility When running the HP XC operating environment, you can use the qsnet2_level_test utility for diagnosing interconnect problems. To run the command on all nodes simultaneously, use the /cluster/bin/crsh command. If all nodes do not complete the test, the command terminates after a timeout period, killing any incomplete processes. 12-12 Maintenance and Diagnostic Procedures

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166

The following two examples show a successful run on a 16 port interconnect:
#
./qsnet2_dmatest -a -e -se -dm -n10000 -RD 1m
mnfr_dmatest: master self=0 peer=0
source 0x2000000000602000 dest 0x200000000460a000 event 0x2000000000703fe0
done 0x2000000004607fa8
mnfr_dmatest: using route’D’-> 00000000000000000000000000002008
0: reps 10000 min 1048576 max 1048576 inc 0 (batch 1)
dma test will run for approx. 10 secs. please wait..
0: 1048576 bytes 1145.02 uSec 915.77 MB/s
#
./qsnet2_dmatest -a -e -se -dm -n10000 -R4,D,D 1m
mnfr_dmatest: master self=0 peer=0
source 0x2000000000602000 dest 0x200000000460a000 event 0x2000000000703fe0
done 0x2000000004607fa8
mnfr_dmatest: using route ’4,D,D’ -> 0000000000000000000000000088000c
0: reps 10000 min 1048576 max 1048576 inc 0 (batch 1)
dma test will run for approx. 10 secs. please wait..
0: 1048576 bytes 1153.01 uSec 909.43 MB/s
The following command example simulates an error. It is intended to test the level
3 chips in the network. However, because the 16 port interconnect has only 2
levels, the script fails and reports that the link has failed:
#
./qsnet2_dmatest -a -e -se -dm -n10000 -R4,4,D,D,D 1m
mnfr_dmatest: master self=0 peer=0
source 0x2000000000602000 dest 0x200000000460a000 event 0x2000000000703fe0
done 0x2000000004607fa8
mnfr_dmatest: using route ’4,4,D,D,D’ -> 000000000000000000000000888c000c
mnfr_dmatest: not making expected progress, checking errors
mnfr_dmatest: total errors reported during test: 0
You can also use the
qsnet2_dmatest
utility to calculate a static route through
the network by deriving the route data derived from the
nodeid
of the node on
which you run the test. Use the
-U
option to calculate the static route as follows:
#
qsnet2_dmatest -U
N
-se -dm -e 0 1m
Where
N
= an integer in the range 1 through 4. Specifying 1 will test only the
1st level of routes through the interconnect. Specifying 2 tests the first and
second level, and so on to all four levels. The following sample output show use
of the
-U
option.
0: reps 1000 min 0 max 1048576 inc 0 (batch 1)
0: 0 bytes 2.67 uSec 0.00 MB/s
0: 4 bytes 2.78 uSec 1.44 MB/s
0: 8 bytes 2.77 uSec 2.89 MB/s
0: 16 bytes 2.80 uSec 5.72 MB/s
0: 32 bytes 2.79 uSec 11.46 MB/s
0: 64 bytes 2.84 uSec 22.54 MB/s
0: 128 bytes 3.01 uSec 42.48 MB/s
0: 256 bytes 3.11 uSec 82.39 MB/s
0: 512 bytes 3.39 uSec 151.03 MB/s
.
.
.
0: 1048576 bytes 1155.90 uSec 907.15 MB/s
Node Network Error Report:
- No Errors
12.5 Using the qsnet2_level_test Utility
When running the HP XC operating environment, you can use the
qsnet2_level_test
utility for diagnosing interconnect problems. To run the
command on all nodes simultaneously, use the
/cluster/bin/crsh
command.
If all nodes do not complete the test, the command terminates after a timeout
period, killing any incomplete processes.
12-12
Maintenance and Diagnostic Procedures