HP Cluster Platform Interconnects v2010 Quadrics QsNetII Interconnect - Page 131

Troubleshooting Node Problems

Page 131 highlights

13 Troubleshooting Nodes and Links The following techniques for troubleshooting are intended to assist you in identifying symptoms of common problems and to direct you towards appropriate diagnostics: • Node troubleshooting, (see Section 13.1). • Link cable troubleshooting, (see Section 13.2). 13.1 Troubleshooting Node Problems Node powers up but is otherwise dead or fails early in the BIOS It is possible that the QM500 network adapter is making a poor electrical connection in its PCI slot. The BIOS may report an NMi, parity error, or SERR. Proceed as follows: 1. Open the case and inspect the PCI connection to make sure it is well-seated and not under any strain. Sometimes a case can be poorly aligned and this causes the PCI card to bend slightly in its slot. This can be seen by looking carefully along the edges of the card 2. Remove the card from the connector and inspect the PCI slot on the bus. A flashlight is useful to check for the proper placement of the PCI connector pins and any pieces of debris that might be affecting the connection 3. Check the PCI edge connector on the QM500 network adapter. It may be dirty or look old (that is, not shiny). If so, carefully clean the connector either with cleaning alcohol or a pencil-top eraser 4. Reseat the card, making sure it is not put under tension as it goes back into the PCI slot 5. If the node boots, check dmesg to see if the driver detects the card. If found, try testing the card with qselantest. Node claims to have no QM500 network adapter You might see this problem when using the dmesg command if the QM500's driver displays a message similar to the following: elan: no cards found. Proceed as follows: 1. Check the green LED on the node's QM500 network adapter or other signs that the QM500 is powered on. If the red LED is also lit, this is not a sign of error at this point. If there are no signs of life and the node is connected up with a cable that can be shown to be good, there may be a power problem on the card. 2. Power the node off and check the card seating and PCI connector. It is possible that the card is badly seated and is failing to respond on the PCI bus. Troubleshooting Nodes and Links 13-1

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166

13
Troubleshooting Nodes and Links
The following techniques for troubleshooting are intended to assist you in
identifying symptoms of common problems and to direct you towards appropriate
diagnostics:
Node troubleshooting, (see Section 13.1).
Link cable troubleshooting, (see Section 13.2).
13.1 Troubleshooting Node Problems
Node powers up but is otherwise dead or fails early in the BIOS
It is possible that the QM500 network adapter is making a poor electrical
connection in its PCI slot. The BIOS may report an
NMi
, parity error, or
SERR
.
Proceed as follows:
1.
Open the case and inspect the PCI connection to make sure it is well-seated
and not under any strain. Sometimes a case can be poorly aligned and this
causes the PCI card to bend slightly in its slot. This can be seen by looking
carefully along the edges of the card
2.
Remove the card from the connector and inspect the PCI slot on the bus. A
flashlight is useful to check for the proper placement of the PCI connector pins
and any pieces of debris that might be affecting the connection
3.
Check the PCI edge connector on the QM500 network adapter. It may be dirty
or look old (that is, not shiny). If so, carefully clean the connector either with
cleaning alcohol or a pencil-top eraser
4.
Reseat the card, making sure it is not put under tension as it goes back into
the PCI slot
5.
If the node boots, check
dmesg
to see if the driver detects the card. If found,
try testing the card with
qselantest
.
Node claims to have no QM500 network adapter
You might see this problem when using the
dmesg
command if the QM500’s driver
displays a message similar to the following:
elan: no cards found.
Proceed as follows:
1.
Check the green LED on the node’s QM500 network adapter or other signs that
the QM500 is powered on. If the red LED is also lit, this is not a sign of error
at this point. If there are no signs of life and the node is connected up with a
cable that can be shown to be good, there may be a power problem on the card.
2.
Power the node off and check the card seating and PCI connector. It is possible
that the card is badly seated and is failing to respond on the PCI bus.
Troubleshooting Nodes and Links
13-1