How do we tell if your server crashed?
I'm trying to look for a key word or phrase in any /var/log/* files which my Splunk event manager could pick up on to tell me if a server crashed over night and rebooted.
I don't want to just check for reboots, we do those WAY too much these days with all the patching going on.
What do I search on to find this event? Is there anything to configure in kdump that'll help?
Responses
Unfortunately there is no single perfect solution/way to get server crashed event. If system is unexpectedly crashed then /var/log will not have any clue about same ......
You can configure kdump and enable sysctl parameter which will help capturing vmcore .....
That doesn't so much inform you of every crash, however. What it tells you is:
* The server has been configured to collect core files (many organizations explicitly disable this for various reasons)
* A server that was configured to collect crash-cores was actually able to recover a core-file post-crash ...which isn't a 100% occurrence.
Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.
