Supermicro motherboard IPMI common error codes

Incorrect memory DIMM population

Assertion: Incorrect memory DIMM population
Problem description: Memory identification is normal, but usage is abnormal.
Cause: Different memory is installed on the motherboard.

Solution: Replace the memory with the same model and capacity.

CPU 1 UPI BUS Corrected Error - Assertion

picture

Configuration error - CPU 1 UPI BUS Corrected Error - Assertion
Problem description: Abnormal server usage, blue screen, and crash.
Cause: CPU1 failure caused an error.
Solution: Replace CPU1.

Power Supply input lost or out-of-range

Power Supply Failure detected. Predictuve Failure. Power Supply input lost (AC/DC). Power Supply input lost or out-of-range. Configuration error.

Power failure detected. Predictive failure. Power input is missing (AC/DC). Power input is missing or out of range. Configuration error
Hardware environment: 2049U-TR4 Xeon5120
System environment: 2012R2 a> Solution: The following two methods can be used to solve it Cause: Unknown
Problem description: Power supply error in IPMI monitoring

Method 1: "Restore factory default settings" in IPMI settings can solve the problem

Method 2. If the above cannot be solved: solve it by updating the BMC.

Uncorrectable memory component found(P1-DIMMA1)

Picture
1. When there are multiple memories, one memory is not recognized
Failing DIMM: DIMM location. (Uncorrectable memory component found) (P1 -DIMMA1)
No memory DIMM detected, install memory DIMMs.
Error-No system memory is physically installed in the system. - Assertion

2. When there is only one memory, the memory is not recognized
Failing DIMM: DIMM location. (Uncorrectable memory component found) (P1-DIMMA1)
No memory DIMM detected, install memory DIMMs.
Error-No system memory is physically installed in the system. - Assertion
Memory training failure. (P1-DIMMA1)

Problem description: The memory is not fully recognized. When checking the IPMI log, it is found that the memory error is reported. After connecting the monitor, it is seen that the BIOS self-test reports a memory error when the server is turned on.
Cause: Memory failure caused the error.
Solution: Replace the memory.

500 Internal Server error

Problem description:
When using a browser to connect to impi, a 500 Internal Server error is reported.
Using ipmiview to connect, you can connect normally and read information. But the remote window cannot be opened.
Cause: unknown
Solution:
1. Turn on the computer after power failure
2 . Use the ipmitool tool to restore the default to normal. Restore command: ipmitool mc reset warm
I have consulted the motherboard manufacturer's technical engineers and there is currently no better way to completely solve the problem except the above method.

Disk0 SMART failure - Asserlion

Disk0 SMART failure - Asserlion
Problem description: There is a server that starts very slowly and often fails to start normally.
Cause: Hard disk problem.
View through smartctl

Solution: Replace the hard drive.

Upper Non-recoverable - going high - Assertion

Upper Non-recoverable - going high - Assertion
Upper Critical - going high - Assertion
Problem description: There is a server that often crashes and restarts. Checking the IPMI log, although there are only 4 memories, all memory slots reported the same error. When checking the memory temperature, it was found that the memory temperature display was also abnormal.

Cause: Memory problem.
Solution: Replace the memory.

Guess you like

Origin blog.csdn.net/u010087338/article/details/134745286