HP Cluster Platform Interconnects v2010 Quadrics QsNetII Interconnect - Page 109

Using jtest to Check Interconnects

Page 109 highlights

The remaining columns represent the error counts against the named link. Use the-t option to control the display output, The following display shows the errors since the qsnetstat command starting sampling (history) and also the errors which have occurred since the last sample (delta): -Link Errors Summary---(History/Delta Name B C:L/Port State CRC Errs Clock Errs Data Errs Protocol Errs QR1N10 2 5:3 Intnl R ( 0/0 ) (11760/12) ( 0/0 ) ( 0/0 ) QR1T06 7 2 DLink R ( 0/0 ) (9811/9 ) ( 0/0 ) ( 0/0 ) QR1T05 7 12 DLink R ( 0/0 ) (9767/9 ) ( 0/0 ) ( 0/0 ) QR1T05 1 13 DLink N ( 0/0 ) (9767/9 ) ( 0/0 ) ( 0/0 ) QR1T02 5 11 DLink R ( 0/0 ) (9616/9 ) ( 136/0 ) ( 0/0 ) QR1N01 0 2 E66 N ( 301/0 ) ( 50/0 ) ( 195/0 ) ( 0/0 ) QR1N00 0 4 E4 N ( 237/0 ) ( 0/0 ) ( 0/0 ) ( 0/0 ) QR1T02 1 10 DLink R ( 185/1 ) ( 0/0 ) ( 975/3 ) ( 0/0 ) QR1N07 1 9 E473 N ( 0/0 ) (1746/0 ) ( 922/0 ) ( 0/0 ) QR1N13 5 7 ULink R ( 15/0 ) ( 0/0 ) ( 0/0 ) ( 0/0 ) (c)ount (d)elta (h)istory (s)ummary (a)ll (r)eset (z)ero +/-rate (4secs) 12.2.2 Using jtest to Check Interconnects Section 11.2 described how to run the jtest command by using a connection (such as telnet directly to the controller card's management firmware. When the interconnect management network is configured (as described in Section 9.3.2), you can launch the jtest command from the cluster's control node. Interconnect management scripts and commands are located in the /opt/qsnet/bin directory. You can manually determine the environmental status of an interconnect by using the jtest utility as follows 1. Launch the /opt/qsnet/bin/jtest utility remotely as described in Section 11.2. You can specify one or more interconnects by name, such as QR0T01, or you can use the -modules -1 option to run the jtest utility on all interconnects listed in the /etc/hosts file. For example: # /opt/qsnet/bin/jtest QR0N00 QR0N01 QR0N02 QR0N03 QR0T00 QR0T01 or jtest> modules -1 2. Use the following commands to obtain information from the selected interconnects: jtest> info 3. An information screen similar to the screen shown in Example 11-1 is displayed. 4. Use the following commands to obtain environmental information from the selected interconnects: jtest> env 5. Environmental information similar to the following is displayed: Slot: Temperature: Fan speeds: PSU status: jtest> 0123456789 35 30 23 21 4017 3792 3750 4066 4017 3970 on on Using the information displayed, you can verify that the following environmental parameters are within specification: Maintenance and Diagnostic Procedures 12-7

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166

The remaining columns represent the error counts against the named link. Use
the
-t
option to control the display output, The following display shows the errors
since the
qsnetstat
command starting sampling (history) and also the errors
which have occurred since the last sample (delta):
-Link Errors Summary---(History/Delta)---------------------------
Name
B C:L/Port State CRC Errs
Clock Errs Data Errs Protocol Errs
QR1N10 2 5:3 Intnl R
( 0/0 )
(11760/12) ( 0/0 )
( 0/0 )
QR1T06 7 2
DLink R
( 0/0 )
(9811/9 )
( 0/0 )
( 0/0 )
QR1T05 7 12 DLink R
( 0/0 )
(9767/9 )
( 0/0 )
( 0/0 )
QR1T05 1 13 DLink N
( 0/0 )
(9767/9 )
( 0/0 )
( 0/0 )
QR1T02 5 11 DLink R
( 0/0 )
(9616/9 )
( 136/0 ) ( 0/0 )
QR1N01 0 2
E66
N
( 301/0 ) ( 50/0 )
( 195/0 ) ( 0/0 )
QR1N00 0 4
E4
N
( 237/0 ) ( 0/0 )
( 0/0 )
( 0/0 )
QR1T02 1 10 DLink R
( 185/1 ) ( 0/0 )
( 975/3 ) ( 0/0 )
QR1N07 1 9
E473
N
( 0/0 )
(1746/0 )
( 922/0 ) ( 0/0 )
QR1N13 5 7
ULink R
( 15/0 )
( 0/0 )
( 0/0 )
( 0/0 )
<display truncated>
(c)ount (d)elta (h)istory (s)ummary (a)ll (r)eset (z)ero +/-rate (4secs)
12.2.2 Using jtest to Check Interconnects
Section 11.2 described how to run the
jtest
command by using a connection (such
as
telnet
directly to the controller card’s management firmware. When the
interconnect management network is configured (as described in Section 9.3.2),
you can launch the jtest command from the cluster’s control node. Interconnect
management scripts and commands are located in the
/opt/qsnet/bin
directory.
You can manually determine the environmental status of an interconnect by using
the
jtest
utility as follows
1.
Launch the
/opt/qsnet/bin/jtest
utility remotely as described in
Section 11.2.
You can specify one or more interconnects by name, such as QR0T01, or you
can use the
-modules -1
option to run the
jtest
utility on all interconnects
listed in the
/etc/hosts
file. For example:
#
/opt/qsnet/bin/jtest
QR0N00 QR0N01 QR0N02 QR0N03 QR0T00 QR0T01
or
jtest>
modules -1
2.
Use the following commands to obtain information from the selected
interconnects:
jtest>
info
3.
An information screen similar to the screen shown in Example 11-1 is
displayed.
4.
Use the following commands to obtain environmental information from the
selected interconnects:
jtest>
env
5.
Environmental information similar to the following is displayed:
Slot:
0123456789
Temperature: 35 30 23 21
Fan speeds:
4017 3792 3750 4066 4017 3970
PSU status:
on on
jtest>
Using the information displayed, you can verify that the following environmental
parameters are within specification:
Maintenance and Diagnostic Procedures
12-7