HP Integrity rx2800 rx2800 i2 User Service Guide - Page 69
Troubleshooting, Methodology, General troubleshooting methodology
View all HP Integrity rx2800 manuals
Add to My Manuals
Save this manual to your list of manuals |
Page 69 highlights
5 Troubleshooting The purpose of this chapter is to provide a preferred methodology (strategies and procedures) and tools for troubleshooting the server error and fault conditions. Methodology General troubleshooting methodology There are multiple entry points to the troubleshooting process, dependent upon your level of troubleshooting expertise, the tools/processes/procedures which you have at your disposal, and the nature of the system fault or failure. Typically, you select from a set of symptoms, ranging from very simple (system LED is blinking) to the most difficult (Machine Check Abort (MCA)) has occurred. The following is a list of symptom examples: NOTE: Your output might differ from the output in the examples in this book depending on your server and its configuration. • Front panel LED blinking • System alert present on console • System won't power-up • System won't boot • Error/Event Message received • Machine Check Abort (MCA) occurred Narrow down the observed problem to the specific troubleshooting procedure required. Isolate the failure to a specific part of the server, so you can perform more detailed troubleshooting. For example: • Problem- Front panel LED blinking NOTE: The front panel health LEDs flash amber with a warning indication, or flash red with a fault indication. ◦ System Alert on console? Analyze the alert by using the system event log (SEL), to identify the last error logged by the server. Use the iLO 3 MP commands to view the SEL, either through the iLO 3 MP serial text interface, or through telnet, Secure Shell, or through the web GUI on the iLO 3 MP LAN. You should now have a good idea about which area of the system requires further analysis. For example, if the symptom was "system won't power-up", the initial troubleshooting procedure may indicate a problem with the dc power rail not coming up after the power switch was turned on. You have now reached the point where the failed CRU has been identified and needs to be replaced. Perform the specific removal and replacement procedure, and verification steps. NOTE: If multiple CRUs are identified as part of the solution, a fix cannot be guaranteed unless all identified failed CRUs are replaced. There may be specific recovery procedures you need to perform to finish the repair. For example, if the system board is replaced, you need to restore customer specific information. Methodology 69