Page MenuHomePhabricator

Migrate etcd ganeti VMs to plain disk template
Closed, ResolvedPublic

Description

etcd is sensitive to IO latencies and DRBD mode C (which ganeti uses to ensure disk consistency between primary and secondary) causes by definition this to increase. Not only that, but at times a full DRBD re-sync might be required from primary to secondary, inducing a period during which writes to etcd are going to be throttled. The Kubernetes cluster could suffer because of that. Since etcd is a replicated datastore with HA built-in there is really no reason to add the extra layer of complexity and latency DRBD adds. Switching all etcd VMs to the plain disk template would solve this issue. The drawback would be a bit more complicated ganeti cluster maintenance operations (knowing that etcd is plain and ignoring it during migrations)

Event Timeline

akosiaris claimed this task.
akosiaris updated the task description. (Show Details)

https://wikitech.wikimedia.org/wiki/Ganeti#VMs_without_DRBD_disk_template has been added to address the drawback needing to be communicated and documented.

All etcd VMs have been migrated to use the plain disk template. Note there is one more VM that is like that, namely d-i-test for which reserving the extra disk space makes no sense.

Resolving this.