Page MenuHomePhabricator

Jclark-ctr (John Clark)
User

Projects (2)

Today

  • No visible events.

Tomorrow

  • No visible events.

Thursday

  • No visible events.

User Details

User Since
Jul 24 2019, 8:11 PM (358 w, 5 d)
Availability
Available
LDAP User
Jclark-ctr
MediaWiki User
Jclark-ctr [ Global Accounts ]

Recent Activity

Today

Jclark-ctr claimed T418928: Q3:rack/setup/install frdev1003.
Tue, Jun 9, 6:29 PM · SRE, fundraising-tech-ops, ops-eqiad, DC-Ops
Jclark-ctr added a comment to T421484: decommission parsoidtest1001.eqiad.wmnet.

@MoritzMuehlenhoff if current location location is ok. Could it just be renamed and reimaged by service owner?

Tue, Jun 9, 2:58 PM · Infrastructure-Foundations, decommission-hardware
Jclark-ctr added a comment to T428542: db1237 not rebooting.

@Marostegui. Dell did just respond.

Tue, Jun 9, 1:24 PM · SRE, DC-Ops, ops-eqiad, DBA
Jclark-ctr closed Unknown Object (Task), a subtask of T418012: eqiad row A/B switch upgrade, as Resolved.
Tue, Jun 9, 12:44 PM · Infrastructure-Foundations, netops, DC-Ops, SRE, ops-eqiad
Jclark-ctr added a comment to T428542: db1237 not rebooting.

Yeah, unfortunately we haven't had any luck obtaining replacement parts. Some of the earlier failures may not have had Dell support tickets submitted when this issue first occurred, so Dell has trested these cases as first-time incidents.

Tue, Jun 9, 11:51 AM · SRE, DC-Ops, ops-eqiad, DBA
Jclark-ctr closed T428361: Alert for device ps1-d1-eqiad.mgmt.eqiad.wmnet - PDU sensor over limit as Resolved.

no Faults for the last 2 days resolving ticket

Tue, Jun 9, 11:46 AM · SRE, DC-Ops, ops-eqiad
Jclark-ctr closed T427852: hw troubleshooting: CPU1 thermal fault for wdqs1015.eqiad.wmnet as Resolved.

Closing this ticket Opened Decom ticket T428582 for Data Platform

Tue, Jun 9, 11:45 AM · Data-Platform-SRE (2026-06-05 - 2026-06-26), SRE, ops-eqiad, DC-Ops
Jclark-ctr created T428582: decommission wdqs1015.eqiad.wmnet.
Tue, Jun 9, 11:44 AM · Data-Platform-SRE (2026-06-05 - 2026-06-26), decommission-hardware
Jclark-ctr added a comment to T428542: db1237 not rebooting.

@Marostegui if you want to leave ticket open at least till Dell responds.

Tue, Jun 9, 11:39 AM · SRE, DC-Ops, ops-eqiad, DBA
Jclark-ctr moved T428571: Degraded RAID on an-worker1201 from Backlog - project to In Progress on the Data-Platform-SRE (2026-06-05 - 2026-06-26) board.
Tue, Jun 9, 11:24 AM · Data-Platform-SRE (2026-06-05 - 2026-06-26), DC-Ops, SRE, ops-eqiad
Jclark-ctr added a project to T428571: Degraded RAID on an-worker1201: Data-Platform-SRE (2026-06-05 - 2026-06-26).
Tue, Jun 9, 11:24 AM · Data-Platform-SRE (2026-06-05 - 2026-06-26), DC-Ops, SRE, ops-eqiad
Jclark-ctr updated subscribers of T428571: Degraded RAID on an-worker1201.

Dell SR227538194

Tue, Jun 9, 11:23 AM · Data-Platform-SRE (2026-06-05 - 2026-06-26), DC-Ops, SRE, ops-eqiad
Jclark-ctr moved T428571: Degraded RAID on an-worker1201 from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Tue, Jun 9, 10:47 AM · Data-Platform-SRE (2026-06-05 - 2026-06-26), DC-Ops, SRE, ops-eqiad
Jclark-ctr moved T428542: db1237 not rebooting from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Tue, Jun 9, 10:46 AM · SRE, DC-Ops, ops-eqiad, DBA
Jclark-ctr claimed T428571: Degraded RAID on an-worker1201.
Tue, Jun 9, 10:33 AM · Data-Platform-SRE (2026-06-05 - 2026-06-26), DC-Ops, SRE, ops-eqiad
Jclark-ctr claimed T428542: db1237 not rebooting.
Tue, Jun 9, 6:40 AM · SRE, DC-Ops, ops-eqiad, DBA

Yesterday

Jclark-ctr added a project to T427852: hw troubleshooting: CPU1 thermal fault for wdqs1015.eqiad.wmnet: Data-Platform-SRE.
Mon, Jun 8, 3:30 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), SRE, ops-eqiad, DC-Ops
Jclark-ctr added a comment to T428361: Alert for device ps1-d1-eqiad.mgmt.eqiad.wmnet - PDU sensor over limit.

Rebalanced the PDU. I will leave the ticket open to monitor for any additional alerts

Mon, Jun 8, 11:48 AM · SRE, DC-Ops, ops-eqiad
Jclark-ctr added a comment to T427852: hw troubleshooting: CPU1 thermal fault for wdqs1015.eqiad.wmnet.

Discussed with @RKemper via IRC. He mentioned that we should decommission this one if the replacement is already here T423314 and is racked and cabled, pending a Puppet fix to image the server.

Mon, Jun 8, 11:40 AM · Data-Platform-SRE (2026-06-05 - 2026-06-26), SRE, ops-eqiad, DC-Ops
Jclark-ctr added a comment to T428240: db1274 is not booting up.

Dell advised performing the same steps that had already been completed: a flea-power drain, firmware updates, and hardware diagnostic testing.

Mon, Jun 8, 11:36 AM · SRE, ops-eqiad, DC-Ops, DBA
Jclark-ctr added a comment to T428260: C/D refresh Nokia switches Exhaust direction is reversed.
Mon, Jun 8, 11:35 AM · DC-Ops, SRE, ops-eqiad

Sun, Jun 7

Jclark-ctr moved T428260: C/D refresh Nokia switches Exhaust direction is reversed from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Sun, Jun 7, 5:13 PM · DC-Ops, SRE, ops-eqiad
Jclark-ctr claimed T428361: Alert for device ps1-d1-eqiad.mgmt.eqiad.wmnet - PDU sensor over limit.
#1:
Sensor: Phase, BA:L1-L2, Active Power
Value: 1.749 kW (power)
Thresholds: High: 1650
Sun, Jun 7, 5:08 PM · SRE, DC-Ops, ops-eqiad
Jclark-ctr moved T428361: Alert for device ps1-d1-eqiad.mgmt.eqiad.wmnet - PDU sensor over limit from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Sun, Jun 7, 5:08 PM · SRE, DC-Ops, ops-eqiad

Fri, Jun 5

Jclark-ctr added a comment to T428260: C/D refresh Nokia switches Exhaust direction is reversed.

Screenshot 2026-06-05 at 9.12.03 AM.png (842×436 px, 709 KB)

Current fans installed are F2B. (Front to Back). They should be B2F ( Back to Front)

Fri, Jun 5, 1:12 PM · DC-Ops, SRE, ops-eqiad
Jclark-ctr claimed T428260: C/D refresh Nokia switches Exhaust direction is reversed.
Fri, Jun 5, 1:08 PM · DC-Ops, SRE, ops-eqiad
Jclark-ctr updated the task description for T428260: C/D refresh Nokia switches Exhaust direction is reversed.
Fri, Jun 5, 1:07 PM · DC-Ops, SRE, ops-eqiad
Jclark-ctr added a comment to T428240: db1274 is not booting up.

I ran the CPU stress test for approximately 30 minutes and did not encounter any issues. I think the server is good to be repooled.

Fri, Jun 5, 1:06 PM · SRE, ops-eqiad, DC-Ops, DBA
Jclark-ctr created T428260: C/D refresh Nokia switches Exhaust direction is reversed.
Fri, Jun 5, 1:04 PM · DC-Ops, SRE, ops-eqiad
Jclark-ctr added a comment to T428240: db1274 is not booting up.

Dell SR 227400671

Fri, Jun 5, 12:16 PM · SRE, ops-eqiad, DC-Ops, DBA
Jclark-ctr added a comment to T428240: db1274 is not booting up.

Performed a flea power drain, and the server came back up. I am currently updating the BIOS, then I will pull a TSR report and open a Dell support ticket for documentation and tracking in case the issue continues.

Fri, Jun 5, 11:54 AM · SRE, ops-eqiad, DC-Ops, DBA
Jclark-ctr moved T428240: db1274 is not booting up from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Fri, Jun 5, 11:38 AM · SRE, ops-eqiad, DC-Ops, DBA
Jclark-ctr claimed T428240: db1274 is not booting up.
Fri, Jun 5, 10:41 AM · SRE, ops-eqiad, DC-Ops, DBA

Thu, Jun 4

Jclark-ctr renamed T428161: document Old line cards in eqiad Storage. and removal of MPC-3D-16XGE-SFPP line cards from CR1 and CR2 from document Old line cards in eqiad Storage to document Old line cards in eqiad Storage. and removal of MPC-3D-16XGE-SFPP line cards from CR1 and CR2.
Thu, Jun 4, 2:17 PM · ops-eqiad, SRE, DC-Ops
Jclark-ctr closed T428161: document Old line cards in eqiad Storage. and removal of MPC-3D-16XGE-SFPP line cards from CR1 and CR2 as Resolved.

Thank you for checking @ayounsi we will add them with the servers

Thu, Jun 4, 2:12 PM · ops-eqiad, SRE, DC-Ops
Jclark-ctr moved T428161: document Old line cards in eqiad Storage. and removal of MPC-3D-16XGE-SFPP line cards from CR1 and CR2 from Backlog to Remote Work on the ops-eqiad board.
Thu, Jun 4, 1:25 PM · ops-eqiad, SRE, DC-Ops
Jclark-ctr updated subscribers of T428161: document Old line cards in eqiad Storage. and removal of MPC-3D-16XGE-SFPP line cards from CR1 and CR2.

@cmooney @ayounsi Following the NetOps sync on Tuesday, I verified the serial numbers of the fabric cards in storage and documented them Can you advise if these can be add to next recycling event at Eqiad.

PIDRevisionSerialAssemblyPCB
SCB-MX960-S-GAG72400361REVAABBH8547750-021524R15710-021523R09
SCB-MX960-S-GAG72400361REVAABBH2700750-021524R15710-021523R09
SCB-MX960-S-GAG72400361REVAABBH2635750-021524R15710-021523R09
SCB-MX960-S-GAG72400361REVAABBH8423750-021524R15710-021523R09
Thu, Jun 4, 1:03 PM · ops-eqiad, SRE, DC-Ops
Jclark-ctr created T428161: document Old line cards in eqiad Storage. and removal of MPC-3D-16XGE-SFPP line cards from CR1 and CR2.
Thu, Jun 4, 12:59 PM · ops-eqiad, SRE, DC-Ops

Wed, Jun 3

Jclark-ctr closed T426303: decommission mc10[37-54] as Resolved.
Wed, Jun 3, 2:07 PM · SRE, DC-Ops, ops-eqiad, User-jijiki, ServiceOps-Upgrades-Hardware, ServiceOps new, decommission-hardware
Jclark-ctr updated the task description for T426303: decommission mc10[37-54].
Wed, Jun 3, 2:06 PM · SRE, DC-Ops, ops-eqiad, User-jijiki, ServiceOps-Upgrades-Hardware, ServiceOps new, decommission-hardware
Jclark-ctr updated the task description for T426303: decommission mc10[37-54].
Wed, Jun 3, 1:14 PM · SRE, DC-Ops, ops-eqiad, User-jijiki, ServiceOps-Upgrades-Hardware, ServiceOps new, decommission-hardware

Tue, Jun 2

Jclark-ctr added a comment to T427748: Degraded RAID on centrallog1002.

New drive has been Attached @colewhite ready to be rebuilt

Tue, Jun 2, 9:11 PM · Observability-Logging, DC-Ops, SRE, ops-eqiad
Jclark-ctr added a comment to T427748: Degraded RAID on centrallog1002.

Removed Failed drive Verified sdb has been removed

Tue, Jun 2, 9:09 PM · Observability-Logging, DC-Ops, SRE, ops-eqiad
Jclark-ctr added a comment to T427748: Degraded RAID on centrallog1002.

I was double checking and i was looking at model not serail.. verified again it is actually slot 5 .

Tue, Jun 2, 9:08 PM · Observability-Logging, DC-Ops, SRE, ops-eqiad
Jclark-ctr moved T426303: decommission mc10[37-54] from Hardware Failure / Troubleshoot to Decommission on the ops-eqiad board.
Tue, Jun 2, 8:23 PM · SRE, DC-Ops, ops-eqiad, User-jijiki, ServiceOps-Upgrades-Hardware, ServiceOps new, decommission-hardware
Jclark-ctr moved T426303: decommission mc10[37-54] from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Tue, Jun 2, 8:23 PM · SRE, DC-Ops, ops-eqiad, User-jijiki, ServiceOps-Upgrades-Hardware, ServiceOps new, decommission-hardware
Jclark-ctr claimed T426303: decommission mc10[37-54].
Tue, Jun 2, 8:23 PM · SRE, DC-Ops, ops-eqiad, User-jijiki, ServiceOps-Upgrades-Hardware, ServiceOps new, decommission-hardware
Jclark-ctr added a comment to T427748: Degraded RAID on centrallog1002.

@colewhite can this be swapped at any time would you be able to rebuild after swapping?

Tue, Jun 2, 1:38 PM · Observability-Logging, DC-Ops, SRE, ops-eqiad
Jclark-ctr added a comment to T423314: Q4:rack/setup/install dse-k8s-wdqs100[1-3] (formerly wdqs103[6-8]).

Screenshot 2026-06-02 at 9.25.28 AM.png (1,096×608 px, 201 KB)
These are Failing to image for preseed file

Tue, Jun 2, 1:26 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review, Wikidata Platform Team, ops-eqiad, SRE, DC-Ops
Jclark-ctr added a comment to T427852: hw troubleshooting: CPU1 thermal fault for wdqs1015.eqiad.wmnet.

I did attempt the firmware updates, but after rebooting, the server became unresponsive and will not boot.

Tue, Jun 2, 1:01 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), SRE, ops-eqiad, DC-Ops
Jclark-ctr updated subscribers of T427852: hw troubleshooting: CPU1 thermal fault for wdqs1015.eqiad.wmnet.

@RKemper @wiki_willy I have gone through all decommissioned servers and do not have a matching Intel(R) Xeon(R) Silver 4215 CPU @ 2.50GHz available to replace CPU1.

Tue, Jun 2, 12:46 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), SRE, ops-eqiad, DC-Ops
Jclark-ctr added a comment to T423314: Q4:rack/setup/install dse-k8s-wdqs100[1-3] (formerly wdqs103[6-8]).

Dedicated dse-k8s workers for production WDQS in codfw - See #T425653

node /^dse-k8s-wdqs200[1-4]\.codfw\./ {

role(insetup::data_platform_ferm)

}

Tue, Jun 2, 12:08 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review, Wikidata Platform Team, ops-eqiad, SRE, DC-Ops

Mon, Jun 1

Jclark-ctr added a comment to T427852: hw troubleshooting: CPU1 thermal fault for wdqs1015.eqiad.wmnet.

This server is out of warranty @RKemper. but I am looking at it right now

Mon, Jun 1, 11:30 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), SRE, ops-eqiad, DC-Ops
Jclark-ctr moved T427852: hw troubleshooting: CPU1 thermal fault for wdqs1015.eqiad.wmnet from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Mon, Jun 1, 11:22 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), SRE, ops-eqiad, DC-Ops
Jclark-ctr added a comment to T427748: Degraded RAID on centrallog1002.

Errrors are on Sdb and has failed in md1 array matching serials according to idrac it is in slot 4

Mon, Jun 1, 1:26 PM · Observability-Logging, DC-Ops, SRE, ops-eqiad
Jclark-ctr claimed T427748: Degraded RAID on centrallog1002.

This server is out of warranty will check to see what is available from decom servers

Mon, Jun 1, 1:55 AM · Observability-Logging, DC-Ops, SRE, ops-eqiad

Fri, May 29

Jclark-ctr closed T427451: decommission lvs1016.eqiad.wmnet, a subtask of T421421: Revert lvs1017 Mellanox NIC to Broadcom, as Resolved.
Fri, May 29, 6:08 PM · SRE, Traffic
Jclark-ctr closed T427451: decommission lvs1016.eqiad.wmnet as Resolved.
Fri, May 29, 6:08 PM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Jclark-ctr updated the task description for T427451: decommission lvs1016.eqiad.wmnet.
Fri, May 29, 6:08 PM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Jclark-ctr updated the task description for T427451: decommission lvs1016.eqiad.wmnet.
Fri, May 29, 1:01 PM · SRE, ops-eqiad, DC-Ops, decommission-hardware

Thu, May 28

Jclark-ctr added a comment to T427451: decommission lvs1016.eqiad.wmnet.

A7 U27

Thu, May 28, 5:33 PM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Jclark-ctr claimed T427451: decommission lvs1016.eqiad.wmnet.
Thu, May 28, 5:32 PM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Jclark-ctr removed a member for ops-eqiad: Jclark-ctr.
Thu, May 28, 4:34 PM
Jclark-ctr added a watcher for ops-eqiad: Jclark-ctr.
Thu, May 28, 4:33 PM
Jclark-ctr moved T427535: db1224 is unreachable from Hardware Failure / Troubleshoot to Backlog on the ops-eqiad board.
Thu, May 28, 4:28 PM · SRE, DC-Ops, ops-eqiad, DBA
Jclark-ctr moved T427535: db1224 is unreachable from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Thu, May 28, 4:26 PM · SRE, DC-Ops, ops-eqiad, DBA
Jclark-ctr added a comment to T423314: Q4:rack/setup/install dse-k8s-wdqs100[1-3] (formerly wdqs103[6-8]).

@bking @RKemper can puppet be updated for new host names?

Thu, May 28, 3:32 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review, Wikidata Platform Team, ops-eqiad, SRE, DC-Ops
Jclark-ctr added a member for ops-eqiad: Jclark-ctr.
Thu, May 28, 3:28 PM
Jclark-ctr added a comment to T427451: decommission lvs1016.eqiad.wmnet.

@BCornwall I see cookbook failed. Is it still good for us to proceed with onsite work?

Thu, May 28, 1:31 AM · SRE, ops-eqiad, DC-Ops, decommission-hardware

Wed, May 27

Jclark-ctr closed T427408: Power Supply - PS Redundancy - issue on dbproxy1024:9290 as Resolved.
Wed, May 27, 4:35 PM · SRE, ops-eqiad, DC-Ops
Jclark-ctr claimed T427408: Power Supply - PS Redundancy - issue on dbproxy1024:9290.
Wed, May 27, 4:22 PM · SRE, ops-eqiad, DC-Ops
Jclark-ctr added a comment to T423314: Q4:rack/setup/install dse-k8s-wdqs100[1-3] (formerly wdqs103[6-8]).

I have updated server names, switchports and provisioned servers. pending puppet being updated @BTullis

Wed, May 27, 12:23 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review, Wikidata Platform Team, ops-eqiad, SRE, DC-Ops
Jclark-ctr added a comment to T425088: Q3 :rack/setup/install cloudvirt refresh.

@Jclark-ctr once T426180 is resolved and hosts can be reimaged, please rack as follows

1077 -> C8
1078 -> D5
1079 -> E4
1080 -> F4

Wed, May 27, 12:21 PM · SRE, ops-eqiad, DC-Ops
Jclark-ctr closed T427270: decommission pc1014.eqiad.wmnet as Resolved.
Wed, May 27, 11:39 AM · SRE, ops-eqiad, DC-Ops, DBA, decommission-hardware
Jclark-ctr closed T427270: decommission pc1014.eqiad.wmnet, a subtask of T418973: Productionize pc20[21-24] and pc10[21-24], as Resolved.
Wed, May 27, 11:39 AM · DBA
Jclark-ctr moved T427353: Repurpose ganeti102[3456] for Zuul migration from Racking Tasks to Remote Work on the ops-eqiad board.
Wed, May 27, 11:32 AM · DC-Ops, ops-eqiad, collaboration-services, SRE
Jclark-ctr moved T427353: Repurpose ganeti102[3456] for Zuul migration from Backlog to Racking Tasks on the ops-eqiad board.
Wed, May 27, 11:31 AM · DC-Ops, ops-eqiad, collaboration-services, SRE
Jclark-ctr updated the task description for T427270: decommission pc1014.eqiad.wmnet.
Wed, May 27, 11:31 AM · SRE, ops-eqiad, DC-Ops, DBA, decommission-hardware
Jclark-ctr added a comment to T427270: decommission pc1014.eqiad.wmnet.

D6 U36

Wed, May 27, 11:30 AM · SRE, ops-eqiad, DC-Ops, DBA, decommission-hardware
Jclark-ctr updated the task description for T427270: decommission pc1014.eqiad.wmnet.
Wed, May 27, 11:30 AM · SRE, ops-eqiad, DC-Ops, DBA, decommission-hardware
Jclark-ctr closed T427190: decommission pc1013.eqiad.wmnet as Resolved.
Wed, May 27, 11:29 AM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware
Jclark-ctr closed T427190: decommission pc1013.eqiad.wmnet, a subtask of T418973: Productionize pc20[21-24] and pc10[21-24], as Resolved.
Wed, May 27, 11:28 AM · DBA
Jclark-ctr moved T427270: decommission pc1014.eqiad.wmnet from Remote Work to Decommission on the ops-eqiad board.
Wed, May 27, 11:28 AM · SRE, ops-eqiad, DC-Ops, DBA, decommission-hardware
Jclark-ctr moved T427270: decommission pc1014.eqiad.wmnet from Backlog to Remote Work on the ops-eqiad board.
Wed, May 27, 11:28 AM · SRE, ops-eqiad, DC-Ops, DBA, decommission-hardware
Jclark-ctr claimed T427270: decommission pc1014.eqiad.wmnet.
Wed, May 27, 9:30 AM · SRE, ops-eqiad, DC-Ops, DBA, decommission-hardware

Tue, May 26

Jclark-ctr closed T426503: Alert for device ps1-c4-eqiad.mgmt.eqiad.wmnet - PDU sensor over limit as Resolved.
Tue, May 26, 5:42 PM · SRE, DC-Ops, ops-eqiad
Jclark-ctr updated the task description for T427190: decommission pc1013.eqiad.wmnet.
Tue, May 26, 2:51 PM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware
Jclark-ctr added a comment to T427190: decommission pc1013.eqiad.wmnet.

pc1013 C5 U26

Tue, May 26, 2:51 PM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware
Jclark-ctr claimed T427190: decommission pc1013.eqiad.wmnet.
Tue, May 26, 2:50 PM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware
Jclark-ctr moved T427190: decommission pc1013.eqiad.wmnet from Remote Work to Decommission on the ops-eqiad board.
Tue, May 26, 2:50 PM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware
Jclark-ctr moved T427190: decommission pc1013.eqiad.wmnet from Backlog to Remote Work on the ops-eqiad board.
Tue, May 26, 2:50 PM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware

Wed, May 20

Jclark-ctr closed T418916: Q3:rack/setup/install rdb101[56] as Resolved.
Wed, May 20, 11:40 AM · Patch-For-Review, ServiceOps-Upgrades-Hardware, SRE, ServiceOps new, ops-eqiad, DC-Ops
Jclark-ctr claimed T418916: Q3:rack/setup/install rdb101[56].

@MLechvien-WMF i believe so looks like @Papaul noticed the missing part in puppet and updated both

Wed, May 20, 11:04 AM · Patch-For-Review, ServiceOps-Upgrades-Hardware, SRE, ServiceOps new, ops-eqiad, DC-Ops

Tue, May 19

Jclark-ctr reassigned T418916: Q3:rack/setup/install rdb101[56] from jijiki to Clement_Goubert.
Tue, May 19, 10:09 PM · Patch-For-Review, ServiceOps-Upgrades-Hardware, SRE, ServiceOps new, ops-eqiad, DC-Ops
Jclark-ctr reassigned T418916: Q3:rack/setup/install rdb101[56] from Effib to jijiki.
Tue, May 19, 10:08 PM · Patch-For-Review, ServiceOps-Upgrades-Hardware, SRE, ServiceOps new, ops-eqiad, DC-Ops
Jclark-ctr reassigned T418916: Q3:rack/setup/install rdb101[56] from Jclark-ctr to Effib.
Tue, May 19, 10:07 PM · Patch-For-Review, ServiceOps-Upgrades-Hardware, SRE, ServiceOps new, ops-eqiad, DC-Ops
Jclark-ctr added a comment to T423314: Q4:rack/setup/install dse-k8s-wdqs100[1-3] (formerly wdqs103[6-8]).

wdqs1037 is failing to provision will check cabling next time on site

Tue, May 19, 9:42 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review, Wikidata Platform Team, ops-eqiad, SRE, DC-Ops
Jclark-ctr updated the task description for T423314: Q4:rack/setup/install dse-k8s-wdqs100[1-3] (formerly wdqs103[6-8]).
Tue, May 19, 9:42 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review, Wikidata Platform Team, ops-eqiad, SRE, DC-Ops
Jclark-ctr updated the task description for T423314: Q4:rack/setup/install dse-k8s-wdqs100[1-3] (formerly wdqs103[6-8]).
Tue, May 19, 9:25 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review, Wikidata Platform Team, ops-eqiad, SRE, DC-Ops
Jclark-ctr closed Unknown Object (Task), a subtask of T422038: WDQS: write puppet code/investigate performance optimizations for new hardware, as Resolved.
Tue, May 19, 8:55 PM · Wikidata Platform Team, Data-Platform-SRE