After a recent OpenStack-Ansible (OSA) deployment on CentOS, I found that keepalived was not starting properly at boot time:
Keepalived_vrrp: Cant find interface br-mgmt for vrrp_instance internal !!! Keepalived_vrrp: Truncating auth_pass to 8 characters Keepalived_vrrp: VRRP is trying to assign ip address 172.29.236.11/32 to unknown br-mgmt interface !!! go out and fix your conf !!! Keepalived_vrrp: Cant find interface br-mgmt for vrrp_instance external !!! Keepalived_vrrp: Truncating auth_pass to 8 characters Keepalived_vrrp: VRRP is trying to assign ip address 192.168.250.11/32 to unknown br-mgmt interface !!! go out and fix your conf !!! Keepalived_vrrp: VRRP_Instance(internal) Unknown interface ! systemd: Started LVS and VRRP High Availability Monitor. Keepalived_vrrp: Stopped Keepalived: Keepalived_vrrp exited with permanent error CONFIG. Terminating
OSA deployments have a management bridge for traffic between containers. These containers run the OpenStack APIs and other support services. By default, this bridge is called
The keepalived daemon is starting before NetworkManager can bring up the
br-mgmt bridge and that is causing keepalived to fail. We need a way to tell systemd to wait on the network before bringing up keepalived.
Waiting on NetworkManager
There is a special systemd target,
network-online.target, that is not reached until all networking is properly configured. NetworkManager comes with a handy service called
NetworkManager-wait-online.service that must be complete before the
network-online target can be reached:
# rpm -ql NetworkManager | grep network-online /usr/lib/systemd/system/network-online.target.wants /usr/lib/systemd/system/network-online.target.wants/NetworkManager-wait-online.service
Start by ensuring that the
NetworkManager-wait-online service starts at boot time:
systemctl enable NetworkManager-wait-online.service
Next, we tell the keepalived service to wait on
network-online.target. Bring up an editor for overriding the
systemctl edit keepalived.service
Once the editor appears, add the following text:
[Unit] Wants=network-online.target After=network-online.target
Save the file in the editor and reboot the server. The keepalived service should come up successfully after NetworkManager signals that all of the network devices are online.
Learn more by reading the upstream NetworkTarget documentation.