eqiad
- db1255 (Dell - ready) - cloned in s8
- db1256 (Dell - ready) - cloned in s8
- db1257 (Supermicro - ready) - cloned in s8
codfw
- db2241 (Dell - ready) - cloned in s8
- db2242 (Dell - ready) - cloned in s8
- db2243 (Supermicro - ready) - cloned in s8
eqiad
codfw
| Status | Subtype | Assigned | Task | ||
|---|---|---|---|---|---|
| Unknown Object (Task) | |||||
| Resolved | Jhancock.wm | T379757 Q2:rack/setup/install db224[12] | |||
| Unknown Object (Task) | |||||
| Unknown Object (Task) | |||||
| Resolved | Jhancock.wm | T382425 Q2:rack/setup/install db2243 | |||
| Unknown Object (Task) | |||||
| Resolved | None | T351820 Move Wikidata term store to separate database cluster | |||
| Resolved | Jclark-ctr | T384979 Q3:rack/setup/install db1257 | |||
| Resolved | • Marostegui | T381475 Productionize x3 hosts | |||
| Resolved | • Marostegui | T388684 Test hot disk swap on Supermicro database hosts | |||
| Resolved | elukey | T377853 RAID monitoring on new hardware spec requires new or updated user space cli tool | |||
| Resolved | elukey | T383300 Alert in need of triage: Dell PowerEdge RAID Controller (instance ms-be1091) | |||
| Resolved | elukey | T383301 Alert in need of triage: Dell PowerEdge RAID Controller (instance thanos-be1005) | |||
| Restricted Task | |||||
| Resolved | • Marostegui | T390530 Create topology for x3 hosts | |||
| Resolved | • Marostegui | T393989 Productionize new x3 hosts |
db2243 is ready, which means all the hosts in codfw are ready. Only missing db1257 in eqiad to unblock this T384979
Change #1125029 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] installserver: Do not reimage db1255
Change #1125029 merged by Marostegui:
[operations/puppet@production] installserver: Do not reimage db1255
Change #1128746 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] valid_section.pp: Add x3
Change #1128746 merged by Marostegui:
[operations/puppet@production] valid_section.pp: Add x3
Change #1130976 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] db2243: Productionize
Change #1130976 merged by Marostegui:
[operations/puppet@production] db2243: Productionize
Mentioned in SAL (#wikimedia-operations) [2025-03-26T06:50:38Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2181 T381475', diff saved to https://phabricator.wikimedia.org/P74423 and previous config saved to /var/cache/conftool/dbconfig/20250326-065037-marostegui.json
Change #1131219 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] db2241: Productionize
Change #1131219 merged by Marostegui:
[operations/puppet@production] db2241: Productionize
Change #1131278 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] mariadb: Productionize db2242
Change #1131278 merged by Marostegui:
[operations/puppet@production] mariadb: Productionize db2242
Change #1131500 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] installserver: Do not reimage db2242
Change #1131500 merged by Marostegui:
[operations/puppet@production] installserver: Do not reimage db2242
Change #1131626 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] mariadb: Productionize db1255
Change #1131626 merged by Marostegui:
[operations/puppet@production] mariadb: Productionize db1255
Completed depool of db1211 - Depool db1211.eqiad.wmnet to then clone it to db1255.eqiad.wmnet - marostegui@cumin1002 - marostegui@cumin1002
Change #1132542 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] mariadb: Productionize db1256
Change #1132542 merged by Marostegui:
[operations/puppet@production] mariadb: Productionize db1256
Completed depool of db1211 - Depool db1211.eqiad.wmnet to then clone it to db1256.eqiad.wmnet - marostegui@cumin1002 - marostegui@cumin1002
Change #1132815 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] mariadb: Productionize db1257
Change #1132815 merged by Marostegui:
[operations/puppet@production] mariadb: Productionize db1257
Start pool of db1211 slowly with 10 steps - Pool db1211.eqiad.wmnet in after cloning - marostegui@cumin1002
Completed pool of db1211 slowly with 10 steps - Pool db1211.eqiad.wmnet in after cloning - marostegui@cumin1002
Change #1133345 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] instances.yaml: Add db1257
Change #1133345 merged by Marostegui:
[operations/puppet@production] instances.yaml: Add db1257
Mentioned in SAL (#wikimedia-operations) [2025-04-02T09:41:10Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Add db1257 to dbctl depooled T381475', diff saved to https://phabricator.wikimedia.org/P74555 and previous config saved to /var/cache/conftool/dbconfig/20250402-094109-marostegui.json
Change #1133347 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] db1257: Enable notifications
Change #1133347 merged by Marostegui:
[operations/puppet@production] db1257: Enable notifications
Change #1133348 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] db2243: Enable notifications
Change #1133348 merged by Marostegui:
[operations/puppet@production] db2243: Enable notifications
Change #1133349 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] instances.yaml: Add db2243
Change #1133349 merged by Marostegui:
[operations/puppet@production] instances.yaml: Add db2243
Mentioned in SAL (#wikimedia-operations) [2025-04-02T09:52:13Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Add db2243 to dbctl depooled T381475', diff saved to https://phabricator.wikimedia.org/P74557 and previous config saved to /var/cache/conftool/dbconfig/20250402-095213-marostegui.json
I am pooling db2243 and db1257 (both supermicro) into s8 to start also testing their performance.
I am going to close this. The hosts have data and further, db1257 and db2243 are serving reads in production.
Next steps is build the topology and adapt puppet for them, that will be tracked at T390530