Errors in sshd services of the master nodes in ARO cluster
Environment
- Azure Red Hat OpenShift [ARO]
- 4.x
Issue
- Errors messages are seen in
sshd journal logsevery 5 seconds for thesshd serviceofmasternodes.
kex_exchange_identification: read: Connection reset by peer Sep xx xx:xx:xx xx-aro-azure-xxxxx--xxx-master-0 sshd[xxxx]: error: kex_exchange_identification: read: Connection reset by peer Sep xx xx:xx:xx xx-aro-azure-xxxxx--xxx-master-0 sshd[xxxx]: error: kex_exchange_identification: read: Connection reset by peer Sep xx xx:xx:xx xx-aro-azure-xxxxx--xxx-master-0 sshd[xxxx]: error:
Resolution
- The error messages can be safely ignored as these are just
heath probeson theinternal load balancer.
Root Cause
-
AROcluster creates aninternal load balancerin thecluster resource groupthat has 3load balancingrules, one for eachmasterforssh accessto master nodes. This is what allows in particularARO SREsto access master nodes in case of emergencies. -
There are
heath probeson thisinternal load balancerthat will initiate connections to eachmasternode onport 22every 5 seconds without key or whatever, so nosshconnection will be established in the end, and then will interrupt the connection. This generates an interrupted connection message from master node perspective that is seen in themasternode'ssshdjournal logs.
Diagnostic Steps
- Check the journal logs for
sshdservice.
# journalctl -u sshd.service -f
This solution is part of Red Hat’s fast-track publication program, providing a huge library of solutions that Red Hat engineers have created while supporting our customers. To give you the knowledge you need the instant it becomes available, these articles may be presented in a raw and unedited form.
Comments