We were wondering if the failover script can have a check that makes sure codfw -> eqiad replication is working and if not, stop.
Considering that eqiad is always the active DC (in an active-passive model as we have now) and codfw is the passive, replication codfw -> eqiad is normally disconnected.
This is usually fine, but it should not be the case if we are going to failover to codfw, in which case, replication needs to be enabled again, so eqiad receives the new keys (and purge the old ones) so once we switch back we don't run into incidents:
T206841
T206740
https://wikitech.wikimedia.org/wiki/Incident_documentation/20181016-eqiad_parsercache_empty_post-switchover