Multiple servers reboot simultaneously
Environment
- Red Hat Enterprise Linux - All Releases - All architectures.
Issue
- Multiple servers reboot at the same time.
- No panic messages appear on screen
- Kdump does not capture vmcore.
Resolution
- Moving systems to UPS backup has prevented reboots.
Root Cause
- Poor or unreliable power source.
Diagnostic Steps
- Kdump is configured, tested to work, but doesn't.
- Servers have a single source of power (the power grid or a single UPS).
- If the system has a baseband management module such as ILO or DRAC, messages similar to the below may be shown:
Caution iLO 2 04/04/2012 10:43 04/04/2012 10:43 1 Server reset.
0021 Caution 14:45 04/04/2012 14:45 04/04/2012 0001
LOG: POST Error: A Critical Error occurred prior to this power-up
This solution is part of Red Hat’s fast-track publication program, providing a huge library of solutions that Red Hat engineers have created while supporting our customers. To give you the knowledge you need the instant it becomes available, these articles may be presented in a raw and unedited form.
Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.
