Breakdown of servers and their config
I'll file a bug against the Debian OpenSSH package, this seems like a suitable candidate to apply in a point release as the patch is small enough and it fixes a genuine bug.
Once T201342 is done, it seems like the best candidate for this.
Fri, Jun 21
Thu, Jun 20
Or maybe use one of cp1071-cp1074, the servers which were used for the original ATS tests? These were bought in 2015 and are currently unused.
Can we please move forward with ordering a fixed disk? This broken disk causes subtle errors for all fleet-wide Cumin/debdeploy runs touching e.g. dpkg as it stalls I/O almost infinitely.
Wed, Jun 19
JFTR, this is implemented using the +needchange flag, e.g.
add puppet automation to bootstrap a KDC service from scratch on a node (caveat: this might mean only partial automation since currently the kdc packages, when installing, require manual inputs)
Are you using component/node10? This should be fixed already, see https://phabricator.wikimedia.org/T215562#5066711 and followups.
Tue, Jun 18
Can we please move forward with the decom steps for at least 1013? This host is down due to hardware trouble for nearly two months( T220907) and always shows up as failing in fleet-wide Cumin runs.
Tue, Jun 11
Fri, Jun 7
Note that the Ganeti reboots are a little different here as we need to load the new QEMU along for the new instruction. To reboot a Ganeti instance one needs to log to the respective Ganeti master (ganeti2003.codfw.wmnet for codfw, ganeti1001.eqiad.wmnet for eqiad) and run
Thu, Jun 6
There's two angles to address:
Wed, Jun 5
Tue, Jun 4
Mon, Jun 3
Actually, this is probably entirely unused, Fabián pointed me to T212934
prometheus-pdns-rec-exporter should be available for Stretch, it's used on the production recursors, which are on Stretch:
If the server is pooled, wait for a lock on poolcounter (we can tune appropriately the concurrency allowed)
If Tensorflow works fine without hsa-ext-rocr-dev, we also have a third option, which seems cleaner and easier:
- Import the existing repository (sans hsa-ext-rocr-dev) to a new thirdparty/rocm component
- Create a dummy hsa-ext-rocr-dev deb using https://packages.debian.org/stable/equivs and import that to component/rocm
Wed, May 29
Apache should be harmless, it's just different versions of Apache 2.4, but I vaguely remember an issue with something requiring PHP5. But I might be completely off track here, it's just a vague recollection.