HP Cluster Platform Interconnects v2010 Quadrics QsNetII Interconnect - Page 119

Using jtest to Identify Link Errors

Page 119 highlights

": clock: " This message indicates a change in the interconnect clock signal. A clock failure means the clock frequency has drifted +/-1.0 MHz from 656 MHz, while clock ok means the clock frequency has returned to within specification. 12.7.2 Using swmlogger for Production Mode Testing Use the output from the swmlogger to monitor the system during production mode, as described by the flowchart in Figure 12-1. Figure 12-1: Production mode Testing Using swmlogger Start. Verify that the swmlogger is configured and running. No Is it configured and running? Yes Check the log file account where swmlogger is configured to send alerts and error messages. Configure the swmlogger. No Are there any errors? Feel assured that the machine appears to be operating correctly. No Continue operation until the next scheduled drain time test. Yes Are they critical? Yes Schedule immediate downtime to diagnose and repair the cluster. End HPTC-0025 12.8 Using jtest to Identify Link Errors You can use the jtest utility with its errors option to detect link errors. The jtest> errors command shows any errors detected on the interconnect registers Maintenance and Diagnostic Procedures 12-17

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166

"<module>: clock: <clock message="">"
This message indicates a change in the interconnect clock signal. A
clock
failure
means the clock frequency has drifted +/-1.0 MHz from 656 MHz,
while
clock ok
means the clock frequency has returned to within specification.
12.7.2 Using swmlogger for Production Mode Testing
Use the output from the
swmlogger
to monitor the system during production
mode, as described by the flowchart in Figure 12-1.
Figure 12-1: Production mode Testing Using swmlogger
End
Start.
Verify that the swmlogger is configured and running.
Is it configured
and running?
No
Yes
Configure the
swmlogger.
Check the log file
account where swmlogger is
configured to send alerts and error messages.
Are there any errors?
Are they
critical?
No
No
Yes
Yes
Feel assured that the
machine appears to be
operating correctly.
Continue operation
until the next
scheduled drain
time test.
Schedule
immediate
downtime to
diagnose and repair
the cluster.
HPTC-0025
12.8 Using jtest to Identify Link Errors
You can use the
jtest
utility with its
errors
option to detect link errors. The
jtest> errors
command shows any errors detected on the interconnect registers
Maintenance and Diagnostic Procedures
12-17