One of the tricker parts of productionalizing a modern OpenStack cloud is how to minimize downtime at the tenant logical router level. Much progress is being made, but the grizzly and havana codebases are still heavily used and have very little mindshare surrounding how to best provide a highly available logical router agent environment.
In this talk I will discuss some of the strategies I have implemented, solicit suggestions or insight from others, and dive into what I believe are the biggest hurdles keeping stock L3 agent logical routers from being truly highly available.