HP StorageWorks 2/16V HP StorageWorks Fabric OS 5.X Procedures User Guide (AA- - Page 144

Gathering information for technical support, Troubleshooting questions

Page 144 highlights

Gathering information for technical support If you are troubleshooting a production system, you need to gather data quickly. As soon as a problem is observed, perform the following tasks (if you are using a dual-CP system, run the commands on both CPs): 1. Issue the supportSave command to save RASLOG, TRACE, and supportShow (active CP only) information for the local CP to a remote FTP location. On a dual-CP system, only the local CP information is saved and supportShow information is not available on the active CP. For details about the supportSave command, see "Viewing and saving diagnostic information" on page 140 and "Setting up automatic trace dump transfers" on page 140. 2. Issue the pdShow command to display data from a panic dump file. The panic dump file contains information that might be useful to determine the cause of the system panic. 3. Issue the saveCore command to save or remove core files created by daemons. For details about these commands, see the HP StorageWorks Fabric OS 5.x command reference guide. Troubleshooting questions Common steps to take and questions to ask when troubleshooting a system problem are the following: 1. Determine the current Fabric OS level. 2. Determine the switch hardware version. 3. Determine whether the switch is operational. 4. Answer the following impact assessment and urgency questions: • Is the switch down? • Is it a standalone switch? • How large is the fabric? • Is the fabric redundant? 5. Run the supportSave command. See "Viewing and saving diagnostic information" on page 140 and "Setting up automatic trace dump transfers" on page 140. 6. Document the sequence of events by answering the following questions: • What happened just prior to the problem? • Is the problem reproducible? • If so, what are the steps to produce the problem? • What configuration was in place when the problem occurred? 7. Determine whether a failover occurred. 8. Determine whether security was enabled. 9. Determine whether POST was enabled. 10.Determine whether serial port (console) logs were available. 11.Determine which CP blade was active. (This is applicable only to the Core Switch 2/64 and SAN Director 2/128.) 12.Determine what and when the last actions or changes were made to the system environment. Use the following steps to retrieve as much of the informational items as possible before contacting the SAN technical support vendor. 1. Determine the following switch information: • Serial number (located on the chassis) • WWN (obtain using licenseIdShow or wwn commands) • Fabric OS version (obtain using version command) • Switch configuration settings • Command supportSave output 144 Troubleshooting

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218
  • 219
  • 220
  • 221
  • 222
  • 223
  • 224
  • 225
  • 226
  • 227
  • 228
  • 229
  • 230
  • 231
  • 232
  • 233
  • 234
  • 235
  • 236
  • 237
  • 238
  • 239
  • 240
  • 241
  • 242
  • 243
  • 244
  • 245
  • 246
  • 247
  • 248

144
Troubleshooting
Gathering information for technical support
If you are troubleshooting a production system, you need to gather data quickly. As soon as a problem is
observed, perform the following tasks (if you are using a dual-CP system, run the commands on both CPs):
1.
Issue the
supportSave
command to save RASLOG, TRACE, and
supportShow
(active CP only)
information for the local CP to a remote FTP location.
On a dual-CP system, only the local CP information is saved and
supportShow
information is not
available on the active CP.
For details about the
supportSave
command, see ”
Viewing and saving diagnostic information
” on
page 140 and ”
Setting up automatic trace dump transfers
” on page 140.
2.
Issue the
pdShow
command to display data from a panic dump file.
The panic dump file contains information that might be useful to determine the cause of the system
panic.
3.
Issue the
saveCore
command to save or remove core files created by daemons.
For details about these commands, see the
HP StorageWorks Fabric OS 5.x command reference guide
.
Troubleshooting questions
Common steps to take and questions to ask when troubleshooting a system problem are the following:
1.
Determine the current Fabric OS level.
2.
Determine the switch hardware version.
3.
Determine whether the switch is operational.
4.
Answer the following impact assessment and urgency questions:
Is the switch down?
Is it a standalone switch?
How large is the fabric?
Is the fabric redundant?
5.
Run the
supportSave
command.
See ”
Viewing and saving diagnostic information
” on page 140 and ”
Setting up automatic trace dump
transfers
” on page 140.
6.
Document the sequence of events by answering the following questions:
What happened just prior to the problem?
Is the problem reproducible?
If so, what are the steps to produce the problem?
What configuration was in place when the problem occurred?
7.
Determine whether a failover occurred.
8.
Determine whether security was enabled.
9.
Determine whether POST was enabled.
10.
Determine whether serial port (console) logs were available.
11.
Determine which CP blade was active. (This is applicable only to the Core Switch 2/64 and SAN
Director 2/128.)
12.
Determine what and when the last actions or changes were made to the system environment.
Use the following steps to retrieve as much of the informational items as possible before contacting the
SAN technical support vendor.
1.
Determine the following switch information:
Serial number (located on the chassis)
WWN (obtain using
licenseIdShow
or
wwn
commands)
Fabric OS version (obtain using
version
command)
Switch configuration settings
Command
supportSave
output