JGroups locked for ~15 minutes during high load and network glitch

Solution Verified - Updated -

Issue

  • During network connection failure the failure detection (FD) of JGroups is blocked and the cluster communication is badly affected
  • If the network connection is lost for some reason the cluster will 'hang' for 15 minutes until the instance is expelled, but from the JGroups settings it is expected to fail after about 40sec
  • If a node is blocked or disconnected during high traffic and should be expelled from the cluster this take longer than expected and the cluster will not work properly

Environment

  • Red Hat JBoss Enterprise Application Platform (EAP)
    • 7
  • Red Hat Data Grid (RHDG)
    • 7

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content