Overcloud deploy fails intermittently

Solution Unverified - Updated -

Issue

  • Stack deployment fails intermittently with one of the controllers hanging during the early network setup phases. You can't ping it or SSH to it even though you can ssh/ping the 2 other controllers.
  • This error is seen in the journal of the affected controller:
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: [2017/06/14 12:16:43 PM] [INFO] running ifup on interface: vlan1042
Jun 14 12:16:43 overcloud-controller-0.localdomain ovs-vsctl[43036]: ovs|00001|vsctl|INFO|Called as ovs-vsctl -t 10 -- --if-exists del-port br-ex vlan1042 -- add-port br-ex vlan1042 tag=1042 -- set Interface vlan1042 type=internal
Jun 14 12:16:43 overcloud-controller-0.localdomain kernel: device vlan1042 entered promiscuous mode
Jun 14 12:16:43 overcloud-controller-0.localdomain NetworkManager[1914]: <info>  [1497457003.3219] manager: (vlan1042): new Generic device (/org/freedesktop/NetworkManager/Devices/185)
Jun 14 12:16:43 overcloud-controller-0.localdomain NetworkManager[1914]: <info>  [1497457003.3415] device (vlan1042): link connected
Jun 14 12:16:43 overcloud-controller-0.localdomain /etc/sysconfig/network-scripts/ifup-eth[43065]: Error, some other host already uses address 10.0.0.3.
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: Traceback (most recent call last):
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: File "/usr/bin/os-net-config", line 10, in <module>
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: sys.exit(main())
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: File "/usr/lib/python2.7/site-packages/os_net_config/cli.py", line 187, in main
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: activate=not opts.no_activate)
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: File "/usr/lib/python2.7/site-packages/os_net_config/impl_ifcfg.py", line 931, in apply
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: self.ifup(interface)
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: File "/usr/lib/python2.7/site-packages/os_net_config/__init__.py", line 244, in ifup
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: self.execute(msg, '/sbin/ifup', interface)
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: File "/usr/lib/python2.7/site-packages/os_net_config/__init__.py", line 224, in execute
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: processutils.execute(cmd, *args, **kwargs)
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: File "/usr/lib/python2.7/site-packages/oslo_concurrency/processutils.py", line 389, in execute
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: cmd=sanitized_cmd)
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: oslo_concurrency.processutils.ProcessExecutionError: Unexpected error while running command.
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: Command: /sbin/ifup vlan1042
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: Exit code: 1
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: Stdout: u'ERROR    : [/etc/sysconfig/network-scripts/ifup-eth] Error, some other host already uses address 10.0.0.3.\n'
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: Stderr: u''
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: + RETVAL=1
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: + [[ 1 == 2 ]]
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: + [[ 1 != 0 ]]
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: + echo 'ERROR: os-net-config configuration failed.'
Jun 14 12:16:43 overcloud-controller-0.localdomain os-collect-config[4012]: ERROR: os-net-config configuration failed.

Environment

  • Red Hat OpenStack Platform 10.0

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content