JGroups locked for ~15 minutes during high load and network glitch
Issue
- During network connection failure the failure detection (FD) of JGroups is blocked and the cluster communication is badly affected
- If the network connection is lost for some reason the cluster will 'hang' for 15 minutes until the instance is expelled, but from the JGroups settings it is expected to fail after about 40sec
- If a node is blocked or disconnected during high traffic and should be expelled from the cluster this take longer than expected and the cluster will not work properly
Environment
- Red Hat JBoss Enterprise Application Platform (EAP)
- 7
- Red Hat Data Grid (RHDG)
- 7
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.