Pacemaker failed Resources and Fence reboots on GFS2
We have heavily used 4 node cluster running on pacemaker and RHEL 7.5 version. Wanted to see in the community is there a good reporting python/perl script tools you guys use to manage and report these clusters.
Second thing, we have around 300 resources on the cluster, with many groups and application running differently on different cluster nodes. we need to create dependency map, if you have best practice or way to handle this task.
What are good logs you keep watch other than pcsd.log and corosync.log. messages files.
Thank you advance for sharing your experience here.
-KG
Responses