Can't launch any instances after new OSP10 Install
Issue
- Unable to launch instances, observe errors in nova-compute.log:
2017-01-19 14:45:17.800 745011 ERROR nova.compute.manager MessagingTimeout: Timed out waiting for a reply to message ID <uuid>
- There are corresponding errors in nova-conductor.log:
2017-01-19 15:22:29.666 728782 WARNING oslo_messaging._drivers.amqpdriver [req-797c738d-0219-4686-9d95-ff41545498b2 - - - - -] reply_<id> doesn't exists, drop reply to <uuid>
2017-01-19 15:22:29.667 728782 INFO oslo_messaging._drivers.amqpdriver [req-797c738d-0219-4686-9d95-ff41545498b2 - - - - -] The reply <uuid> cannot be sent reply_<id> reply queue don't exist after 60 sec abandoning...
- Errors and failure continue despite restarting nova-conductor service
- One of the controllers is not in the rabbitmq cluster:
From controller-0:
[root@overcloud-controller-0 ~]# rabbitmqctl cluster_status
Cluster status of node 'rabbit@overcloud-controller-0' ...
[{nodes,[{disc,['rabbit@overcloud-controller-0',
'rabbit@overcloud-controller-2']}]},
{running_nodes,['rabbit@overcloud-controller-2',
'rabbit@overcloud-controller-0']},
{cluster_name,<<"rabbit@overcloud-controller-2.localdomain">>},
{partitions,[]},
{alarms,[{'rabbit@overcloud-controller-2',[]},
{'rabbit@overcloud-controller-0',[]}]}]
From controller-1:
[root@overcloud-controller-1 ~]# rabbitmqctl cluster_status
Cluster status of node 'rabbit@overcloud-controller-1' ...
[{nodes,[{disc,['rabbit@overcloud-controller-1']}]},
{running_nodes,['rabbit@overcloud-controller-1']},
{cluster_name,<<"rabbit@overcloud-controller-1.localdomain">>},
{partitions,[]},
{alarms,[{'rabbit@overcloud-controller-1',[]}]}]
Environment
- Red Hat OpenStack Platform 10.0
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.