- dbstore1008
- db2222
- db2221
- db2220
- db2218
- db2208
- db2200 backup source (Jaime will do it)
- db2198 backup source (Jaime will do it)
- db2187
- db2182
- db2168
- db2159
- db2150
- db1236
- db1227
- db1202
- db1194
- db1191
- db1181
- db1174
- db1171
- db1170
- db1158
- db1155
- clouddb1018
- clouddb1014
- an-redacteddb1001
Description
| Status | Subtype | Assigned | Task | ||
|---|---|---|---|---|---|
| Resolved | Marostegui | T382842 Upgrade to 10.6.20 and rebuild recentchanges and pagelinks tables | |||
| Resolved | Marostegui | T385550 Upgrade and rebuild s7 | |||
| Resolved | Marostegui | T387216 Switchover s7 master (db1181 -> db1236) | |||
| Resolved | Marostegui | T387270 Switchover s7 master (db2218 -> db2220) |
Event Timeline
Icinga downtime and Alertmanager silence (ID=f10b64ca-2317-4be0-8c0c-8c35e9d6e6c3) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db1236.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=5c4d1fa9-2ef4-47eb-b26c-fd569e00f9b7) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db2220.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=b270fdd1-ce8f-40a2-80b1-6a3bd47ff600) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db1236.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=rLMAS8fd61b9d5e3b-19ac-42de-8036-6bced9ba4555) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db1227.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=3be8e339-d92d-451f-afa7-bab775f89da2) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db2222.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=f9c4baf3-3bd9-4341-abde-76a3bd605895) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db2221.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=17c1b9b4-eac9-486a-b8dd-749dd8fcb051) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db1202.eqiad.wmnet
Mentioned in SAL (#wikimedia-operations) [2025-02-06T06:58:00Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2208 db1194 T385550', diff saved to https://phabricator.wikimedia.org/P73275 and previous config saved to /var/cache/conftool/dbconfig/20250206-065759-marostegui.json
Icinga downtime and Alertmanager silence (ID=37147450-48c5-4574-8497-d49abab5ac37) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db2208.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=29e48ba7-187c-480c-bdd7-9508f1fc6874) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db1194.eqiad.wmnet
Mentioned in SAL (#wikimedia-operations) [2025-02-06T12:57:14Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2159 db1191 T385550', diff saved to https://phabricator.wikimedia.org/P73312 and previous config saved to /var/cache/conftool/dbconfig/20250206-125713-marostegui.json
Icinga downtime and Alertmanager silence (ID=8fc176d3-1ac2-44d1-8d2d-d3d7172243ed) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db1191.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=88f2b604-9411-4170-a7ce-6b73465e5c27) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db2159.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=366e04db-d729-476f-b441-5fc93f056a2c) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db2150.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=e74f6823-c7ba-4085-9cab-3396df3d7e6f) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db1174.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=f9845655-7483-4179-a6fb-a5ed88d6308a) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db1170.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=2f80616d-811c-4cec-9bb9-4be68c5eed42) set by jynus@cumin1002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: upgrade and rebuild tables
db2200.codfw.wmnet
Mentioned in SAL (#wikimedia-operations) [2025-02-21T07:22:39Z] <jynus> rebuilding tables for db2200 T385550
Icinga downtime and Alertmanager silence (ID=f86e9ddb-675e-4f4c-bac3-2f568013928a) set by jynus@cumin1002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: upgrade and rebuild tables
db2198.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=0eb5c53d-4cad-442c-9972-328355758ff2) set by jynus@cumin1002 for 4 days, 0:00:00 on 2 host(s) and their services with reason: Table rebuilding ongoing
db[2198,2200].codfw.wmnet
Once these 2 finish (technically already upgraded), only db2199 will be missing to upgrade from 10.6.17 to 10.6.20 of the backup sources (not s7).
Backup sources are done, reassigning to @Marostegui for him to proceed or close (only db2199 is missing 10.6.20, which I will upgrade now, but that is out of scope of this ticket).
All innodb tables were rebuilt:
Icinga downtime and Alertmanager silence (ID=70c6ecbd-6a05-4a34-925b-e609312bb73d) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db1158.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=3e9877ea-2d47-4259-904b-9c710818c63d) set by root@cumin1002 for 12:00:00 on 1 host(s) and their services with reason: Index rebuild
db2218.codfw.wmnet