Etcd failing to list members and timeouts with messages as "failed to send out heartbeat on time" in RedHat OpenShift Container Platform v3.9
Issue
-
Etcd unable to start and timing out in RedHat OpenShift Container Platform (OCP) v3.9.
-
Following messages are seen in etcd healthchecks
# etcdctl --cert-file /etc/origin/master/master.etcd-client.crt --key-file /etc/origin/master/master.etcd-client.key --ca-file /etc/origin/master/master. etcd-ca.crt --endpoints="https://XX.XX.XX.XX:2379,https://XX.XX.XX.XX:2379,https://XX.XX.XX:2379" cluster-health
cluster may be unhealthy: failed to list members
# etcdctl --cert-file /etc/origin/master/master.etcd-client.crt --key-file /etc/origin/master/master.etcd-client.key --ca-file /etc/origin/master/master. etcd-ca.crt --endpoints="https://XX.XX.XX.XX:2379,https://XX.XX.XX.XX:2379,https://XX.XX.XX.XX:2379" cluster-health
cluster may be unhealthy: failed to list members
Error: client: etcd cluster is unavailable or misconfigured; error #0: client: endpoint https://XX.XX.XX.XX:2379 exceeded header timeout
; error #1: client: endpoint https://XX.XX.XX.XX:2379 exceeded header timeout
; error #2: client: endpoint https://XX.XX.XX.XX:2379 exceeded header timeout
error #0: client: endpoint https://XX.XX.XX.XX:2379 exceeded header timeout
error #1: client: endpoint https://XX.XX.XX.XX:2379 exceeded header timeout
error #2: client: endpoint https://XX.XX.XX.XX:2379 exceeded header timeout
Environment
- Red Hat OpenShift Container Platform (OCP) v3.9
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.