RH-SSO Authentication Fails When Cluster Node Loses Connection

Solution Verified - Updated -

Issue

  • Authentication Fails
  • Log repeatedly shows errors like

    ERROR [org.keycloak.services.error.KeycloakErrorHandler] (default task-1) Uncaught server error: org.infinispan.util.concurrent.TimeoutException: ISPN000476: Timed out waiting for responses for request 116 from RHSSO_N
    
    ERROR [org.infinispan.interceptors.impl.InvocationContextInterceptor] (timeout-thread--p14-t1) ISPN000136: Error executing command PutKeyValueCommand, writing keys [c6bba6e2-ca6d-426a-8230-33ad9712848b]: org.infinispan.util.concurrent.TimeoutException: ISPN000476: Timed out waiting for responses for request 116 from RHSSO_N
    
  • After repeated timeout errors, the log still shows the failed member in the cluster topology

    [Context=sessions] ISPN100002: Starting rebalance with members [RHSSO_1.... RHSSO_N], phase READ_OLD_WRITE_ALL, topology id 22 
    

Environment

  • Red Hat Single Sign-On (RH-SSO) 7
  • Standalone Clustered Mode or Domain Clustered Mode
  • Distributed Caches have multiple owners:

            <distributed-cache name="sessions" owners="2"/>
            <distributed-cache name="authenticationSessions" owners="2"/>
            <distributed-cache name="offlineSessions" owners="2"/>
            <distributed-cache name="clientSessions" owners="2"/>
            <distributed-cache name="offlineClientSessions" owners="2"/>
            <distributed-cache name="loginFailures" owners="2"/>
    
  • JGroups TCP clustering stack

  • Connection times-out rather than failing when a the cluster member fails.

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content