Why does neutron take so long to restart ?

Solution In Progress - Updated -

Environment

  • Red Hat Openstack Platform 7

Issue

During the process of scaling out the number of compute nodes in our RHOSP7, the stack update has failed, it can be seen that the neutron-server service takes>180 seconds to start up.

Resolution

Using HA and ‘auto_delete’ queues, the system gets eventually to a state where some channels don’t have associated connections (shown in the management UI as ‘unknown’) and calls to ‘rabbitmqctl list_channels’ do not return. This is caused by the queues blocking the channels during the termination, when the rabbit_mirror_queue_master does not return.

Currently the issue is being tracked in the following bugzilla:

This solution is part of Red Hat’s fast-track publication program, providing a huge library of solutions that Red Hat engineers have created while supporting our customers. To give you the knowledge you need the instant it becomes available, these articles may be presented in a raw and unedited form.