Networking and Router Problems in Red Hat OpenShift Container Platform
Issue
- Our ha-proxies have a pretty high restart count (> 300 in a few days). As they do not log much, I can just provide you those outputs. Any ideas how to debug this further? In the logs we see:
1201 08:42:28.395691 1 ratelimiter.go:50] error reloading router: wait: no child processes
- Occasionally we report
pods
which are failing to connect or resolve a service name within the OpenShift cluster. After a few seconds it starts to work again and we have no idea where this is coming from. - I'm having weird networking problems in my OpenShift cluster and I see the following messages reported on my nodes
Dec 06 09:39:11 ose3-node1 kernel: net_ratelimit: 119 callbacks suppressed
Dec 06 09:39:17 ose3-node1 kernel: net_ratelimit: 154 callbacks suppressed
Environment
- Red Hat OpenShift Container Platform 3.x
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.