Page MenuHomePhabricator

wiki_willy
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Apr 16 2019, 9:00 PM (262 w, 54 m)
Availability
Available
LDAP User
Wpao
MediaWiki User
WPao (WMF) [ Global Accounts ]

Recent Activity

Mon, Apr 15

wiki_willy closed T296966: eqiad: Master Tracking Ticket for eqiad expansion cage as Resolved.

Since the only thing remaining in this task is bringing up the Dell switches in racks E8 and F8 (which I believe the Network SRE team is working on), I'm going to go ahead and resolve the main tracking ticket. Thanks, Willy

Mon, Apr 15, 5:21 PM · SRE, ops-eqiad, DC-Ops

Wed, Apr 3

wiki_willy added a comment to T336320: scrape RT ticket HTML files.

Sure, no prob @LSobanski. Here's the list of the 24 active devices that still reference RT tasks in Netbox, along with their purchase dates (network equipment usually EOLs every 8yrs):

Wed, Apr 3, 7:17 PM · collaboration-services

Tue, Apr 2

wiki_willy added a comment to T336320: scrape RT ticket HTML files.

Thanks for checking @LSobanski. It's definitely rare that we need to refer back to RT. In the last 5 years, the 2-3 cases that we've had to reference RT was typically due to tracking down information about core routers that we had purchased back then. In Netbox, we only have 24 active devices left that still reference RT tasks. As long as we're able to access these in someway (ideally quickly and easily) on the rare occasions that it's needed, you should be able to proceed with moving forward.

Tue, Apr 2, 7:22 PM · collaboration-services

Mar 19 2024

wiki_willy added a comment to T360297: Take advantage of 10Gb NICs in the new network stack.

Hi @elukey - do you want me to change the Lift Wing expansion requests for 16x servers in FY24-25 to 10g? Thanks, Willy

Mar 19 2024, 4:41 PM · Infrastructure-Foundations, DC-Ops, netops

Mar 13 2024

wiki_willy updated subscribers of T359940: hw troubleshooting: Unidentified for db1246.eqiad.wmnet.

++ @VRiley-WMF & @Jclark-ctr for troubleshooting the hardware. (host was installed a few quarters ago)

Mar 13 2024, 2:09 PM · DBA, SRE, ops-eqiad, DC-Ops

Mar 5 2024

wiki_willy added a comment to T358542: Netbox errors caused by system board replacement .

Sounds good. @Jhancock.wm - I created a new sheet below, with the following fields. I entered in the hostnames and asset tag, but can you fill in the remaining items for old S/N, new S/N, and Phabricator Task?

Mar 5 2024, 12:06 AM · SRE, ops-codfw

Mar 4 2024

wiki_willy added a comment to T358542: Netbox errors caused by system board replacement .

Thanks for confirming, @Volans. If everyone else is ok with making the correlation on the accounting spreadsheet, my vote is that we go with that route. Thanks, Willy

Mar 4 2024, 10:06 PM · SRE, ops-codfw

Mar 1 2024

wiki_willy added a comment to T358542: Netbox errors caused by system board replacement .

Thanks @Volans, that makes sense. My preference would be to leave Netbox as is, and use the accounting spreadsheet to make the S/N connection to each other. Would we be adding a different tab on the accounting spreadsheet for that?

Mar 1 2024, 12:39 AM · SRE, ops-codfw

Feb 29 2024

wiki_willy added a comment to T358542: Netbox errors caused by system board replacement .

If we change the serial number, I think it would create an error for S/N / Asset tag mismatch. (related to Riccardo's points earlier) We also reference the original chassis S/N when dealing with vendors for recycling servers (estimates, official documentation, etc) and purchasing replacement parts, so I'm still a bit hesitant with editing the S/N in Netbox as the solution. Since it doesn't sound like we receive any Netbox alerts when we replacing with a new motherboard, is there something that we could tweak to replicate the same thing? (ie: change the status or something of the donor server) Or worse case, just suppress these alerts somehow, until they eventually decommission?

Feb 29 2024, 9:26 PM · SRE, ops-codfw

Feb 28 2024

wiki_willy added a comment to T358542: Netbox errors caused by system board replacement .

Hey @Volans - much appreciated for your feedback and for the suggestions. I was wondering since the physical serial number listed on the chassis doesn't change (it's only from a Puppet perspective that the serial number changes), is there anything on the Puppet side that could be modified to reflect the MB replacement? If there's something easy that could be done in Puppet to prevent the Netbox error from alerting, I kind of feel like it would be a more accurate representation.

Feb 28 2024, 11:18 PM · SRE, ops-codfw
wiki_willy updated subscribers of T358727: Reclaim recently-decommed CP host for WDQS (see T352253).

++ @VRiley-WMF and @Jclark-ctr - can one of you pick up this request? We'll be repurposing one of the previously decommissioned cp servers to set up a temp server for Adam to use. Thanks, Willy

Feb 28 2024, 10:02 PM · Discovery-Search (Current work), Data-Platform-SRE (2024.03.04 - 2024.03.24), wmde-wikidata-tech, Wikidata, SRE, ops-eqiad
wiki_willy added a comment to T352253: Decommission task for old cp hosts (cp1075-1090).

Sounds good @bking, thanks!

Feb 28 2024, 8:59 PM · SRE, ops-eqiad, DC-Ops, Traffic
wiki_willy added a comment to T358533: Hardware requests for Search Platform FY2024-2025.

Hi @bking - thanks for coming up with the list. I have the following refreshes already on the CapEx doc, so you just have to fill in the missing columns for "Hardware Config", "Network Speed" and "Total Equipment Cost" (for custom configs)

Feb 28 2024, 5:17 PM · Data-Platform-SRE (2024.03.25 - 2024.04.14)

Feb 27 2024

wiki_willy added a comment to T358421: db2118 crashed and rebooted due to HW.

Thanks for picking this up @Jhancock.wm. @Marostegui - since this host looks like it's close to being refreshed in T355350, do you want to just wait for the refreshed server to be setup instead of fixing this one? Thanks, Willy

Feb 27 2024, 2:10 AM · Wikimedia-Incident, DBA, SRE

Feb 26 2024

wiki_willy updated subscribers of T358421: db2118 crashed and rebooted due to HW.

@wiki_willy can we contact the vendor about this issue which caused a reboot?

Record:      27
Date/Time:   02/24/2024 10:08:18
Source:      system
Severity:    Critical
Description: CPU 1 machine check error detected.
Feb 26 2024, 6:19 PM · Wikimedia-Incident, DBA, SRE

Feb 23 2024

wiki_willy added a comment to T352253: Decommission task for old cp hosts (cp1075-1090).

Hi @ssingh - the hardware should still be around, and we should be able to reallocate one of them for testing purposes. Can you shoot open a new Phabricator for us with all the necessary details (hostname, racking info, network setup, raid/partitioning, OS, and main poc)? Also, do you know how long Adam would need it for?

Feb 23 2024, 5:51 PM · SRE, ops-eqiad, DC-Ops, Traffic

Feb 21 2024

wiki_willy added a project to T357951: db2137 and es2026 don't get an IP via PXE boot: ops-codfw.

++ @Jhancock.wm for visibility and in case any onsite support is needed

Feb 21 2024, 3:56 PM · SRE, ops-codfw, DC-Ops

Feb 8 2024

wiki_willy assigned T357015: Degraded RAID on db2194 to Jhancock.wm.

++ @Jhancock.wm

Feb 8 2024, 5:14 PM · DBA, SRE, ops-codfw

Jan 10 2024

wiki_willy added a comment to T354606: Investigate memory increase for Prometheus hosts in codfw/eqiad.

Thanks @VRiley-WMF. I have T354684 assigned over to you, so you can work with @fgiunchedi on coordinating downtime for the upgrades. Thanks, Willy

Jan 10 2024, 9:35 PM · SRE, ops-codfw, ops-eqiad, Observability-Metrics
wiki_willy assigned T354684: RAM upgrade for prometheus100[56] to VRiley-WMF.
Jan 10 2024, 9:32 PM · SRE, ops-eqiad, Observability-Metrics

Jan 9 2024

wiki_willy added a comment to T354606: Investigate memory increase for Prometheus hosts in codfw/eqiad.

Awesome, thanks @Jhancock.wm. Here's the codfw upgrade ticket for you to coordinate with @fgiunchedi on the downtime - T354685. Thanks, Willy

Jan 9 2024, 6:36 PM · SRE, ops-codfw, ops-eqiad, Observability-Metrics
wiki_willy assigned T354685: RAM upgrade for prometheus200[56] to Jhancock.wm.
Jan 9 2024, 6:34 PM · SRE, ops-codfw, Observability-Metrics
wiki_willy updated subscribers of T354591: db1224 crashed - hardware error.

++ @Jclark-ctr & @VRiley-WMF

Jan 9 2024, 5:24 PM · SRE, DC-Ops, ops-eqiad, DBA
wiki_willy updated subscribers of T354606: Investigate memory increase for Prometheus hosts in codfw/eqiad.

@Papaul / @Jhancock.wm and @Jclark-ctr / @VRiley-WMF - can you see if you have any spare memory onsite for Filippo? I think it's for prometheus100[5,6] and prometheus200[5,6]. (cc @RobH in case we have to order them)

Jan 9 2024, 5:22 PM · SRE, ops-codfw, ops-eqiad, Observability-Metrics

Dec 15 2023

wiki_willy updated subscribers of T353503: ps1-e8-eqiad down.

@Jclark-ctr or @VRiley-WMF - can one of you take a look at this one?

Dec 15 2023, 9:00 PM · SRE, ops-eqiad

Dec 7 2023

wiki_willy updated subscribers of T353020: Degraded RAID on db1168.

Definitely. @Jclark-ctr & @VRiley-WMF - can you check if we have any spare drives from a decommissioned host? If not, we'll purchase one via @RobH). Thanks, Willy

Dec 7 2023, 8:50 PM · DBA, SRE, ops-eqiad
wiki_willy updated subscribers of T351891: Abstract a bit more the server provisioning process.
Dec 7 2023, 6:05 PM · Infrastructure-Foundations, SRE-tools

Dec 1 2023

wiki_willy closed Unknown Object (Task), a subtask of T329219: Main Tracking Task for ESAMS Migration to KNAMS, as Resolved.
Dec 1 2023, 10:04 PM · Patch-For-Review, SRE, ops-esams, DC-Ops

Nov 29 2023

wiki_willy updated subscribers of T352238: Degraded RAID on db1199.

++ @Jclark-ctr & @VRiley-WMF - can one of you two work on getting the drive RMA'd for this one? Thanks, Willy

Nov 29 2023, 8:36 AM · DBA, SRE, ops-eqiad

Nov 23 2023

wiki_willy closed Unknown Object (Task), a subtask of T346722: Sao Paulo, Brazil, South America POP tracking task, as Resolved.
Nov 23 2023, 12:06 AM · ops-magru, Patch-For-Review

Nov 22 2023

wiki_willy assigned T350179: Reimage cookbook on new eqiad hosts stuck at PXE booting to Jclark-ctr.
Nov 22 2023, 8:09 PM · SRE, Traffic, SRE-swift-storage, ops-codfw, DC-Ops, ops-eqiad

Nov 10 2023

wiki_willy added a comment to T350885: Project future physical host usage for Search Platform-owned services.

Thanks for working on this @bking. I'm mainly looking to see how much future growth you're looking at (a rough estimate is fine), if you have any requests for the type of servers we provide (ie: ARM, GPU, etc), or just have any feedback for us in general. We're getting pretty full at codfw, so when we purchase additional data center space, we want to ensure we're adding enough capacity for everyone's future needs over the next 3-5yrs. Thanks, Willy

Nov 10 2023, 1:24 AM · Data-Platform-SRE

Oct 30 2023

wiki_willy added a comment to T349756: Audit of WMCS Servers Using Single & Dual Switchports.

Awesome, thanks for working on this @VRiley-WMF. @nskaggs & @cmooney - since we have some discrepancies with the number of ports being used on these cloudvirts, should we come up with a plan/process to help us free up the second switchport on them? This will help us reclaim some switchports for new installs and server migrations. Thanks, Willy

Oct 30 2023, 7:48 PM · SRE, ops-eqiad, DC-Ops

Oct 25 2023

wiki_willy created T349756: Audit of WMCS Servers Using Single & Dual Switchports.
Oct 25 2023, 7:58 PM · SRE, ops-eqiad, DC-Ops

Oct 17 2023

wiki_willy updated subscribers of T308339: eqiad: move non WMCS servers out of rack C8.

@Jclark-ctr or @VRiley-WMF - can one of you follow up on Ben's question above on an-tool1010, along with Alex's comment on deploy1102? Thanks, Willy

Oct 17 2023, 9:02 PM · SRE, DBA, ops-eqiad

Oct 3 2023

wiki_willy updated subscribers of T306007: Avoid ghost hosts on the network.

++ @Papaul , who's going to dig around a bit and provide some feedback

Oct 3 2023, 9:49 PM · SRE, Infrastructure-Foundations, netbox, netops, DC-Ops

Aug 30 2023

wiki_willy assigned T344597: Decommission thumbor200[34] to Jhancock.wm.
Aug 30 2023, 5:12 PM · SRE, serviceops, ops-codfw

Aug 11 2023

wiki_willy updated subscribers of T344076: Increase VM size for wikitech-static.

Hi @Andrew - I don't have Rackspace under my budget. I think that one falls under the SRE budget, so you may to reach out to @mark on that one.

Aug 11 2023, 9:45 PM · Sustainability (Incident Followup), cloud-services-team

Aug 2 2023

wiki_willy updated subscribers of T343254: codfw: es2025 lost System Board Fan6.

It's not on the refresh list for this fiscal year; looks like it'll be due for a refresh in FY24-25. If the firmware upgrade on the iDrac doesn't work, we can try sourcing the fan if you want. (cc @RobH)

Aug 2 2023, 4:01 PM · SRE, ops-codfw, DBA

Jul 31 2023

wiki_willy updated the task description for T329219: Main Tracking Task for ESAMS Migration to KNAMS.
Jul 31 2023, 10:09 PM · Patch-For-Review, SRE, ops-esams, DC-Ops

Jul 19 2023

wiki_willy added a comment to T342198: Relocate one of the mx480 from esams to knams.

Cool, thanks for confirming @Papaul. Hopefully Iron Mountain will come back with the same confirmation as well.

Jul 19 2023, 5:41 PM · SRE, ops-esams, DC-Ops
wiki_willy reassigned T342224: decommission dbproxy1016.eqiad.wmnet from wiki_willy to Jclark-ctr.
Jul 19 2023, 3:04 PM · SRE, ops-eqiad, DBA, decommission-hardware

Jul 18 2023

wiki_willy reassigned T342103: decommission dbproxy1015.eqiad.wmnet from wiki_willy to Jclark-ctr.
Jul 18 2023, 3:00 PM · SRE, ops-eqiad, DBA, decommission-hardware

Jul 13 2023

wiki_willy added a project to T340128: decommission frpig1001.frack.eqiad.wmnet: ops-eqiad.
Jul 13 2023, 4:38 PM · SRE, ops-eqiad, fundraising-tech-ops, decommission-hardware
wiki_willy added a project to T340433: decommission krb2001.codfw.wmnet: ops-codfw.
Jul 13 2023, 4:37 PM · SRE, ops-codfw, decommission-hardware
wiki_willy reassigned T341782: decommission dbproxy1014.eqiad.wmnet from wiki_willy to Jclark-ctr.
Jul 13 2023, 4:37 PM · SRE, ops-eqiad, DBA, decommission-hardware

Jul 12 2023

wiki_willy reassigned T341711: decommission dbproxy1013.eqiad.wmnet from wiki_willy to Jclark-ctr.
Jul 12 2023, 4:59 PM · SRE, ops-eqiad, decommission-hardware

Jul 11 2023

wiki_willy assigned T341494: cloud @ eqiad: hardware re-racking plan to Jclark-ctr.

Hi @Jclark-ctr - can you work with @aborrero on the timeframe and migration plan for these servers? Thanks, Willy

Jul 11 2023, 8:11 PM · cloud-services-team (FY2023/2024-Q1-Q2), SRE, ops-eqiad, User-aborrero, Goal

Jul 10 2023

wiki_willy added a subtask for T329219: Main Tracking Task for ESAMS Migration to KNAMS: Unknown Object (Task).
Jul 10 2023, 9:38 PM · Patch-For-Review, SRE, ops-esams, DC-Ops
wiki_willy updated the task description for T329219: Main Tracking Task for ESAMS Migration to KNAMS.
Jul 10 2023, 9:38 PM · Patch-For-Review, SRE, ops-esams, DC-Ops
wiki_willy reassigned T341510: decommission dbproxy1012.eqiad.wmnet from wiki_willy to Jclark-ctr.
Jul 10 2023, 7:23 PM · SRE, ops-eqiad, DBA, decommission-hardware

Jun 27 2023

wiki_willy added a member for WMF-NDA: Jhancock.wm.
Jun 27 2023, 5:26 PM
wiki_willy moved T340501: Inbound interface errors from Backlog to Hardware Failure / Troubleshoot on the ops-codfw board.
Jun 27 2023, 5:17 PM · ops-codfw

Jun 23 2023

wiki_willy moved T340077: decommission gerrit1001.wikimedia.org (dcops, netbox) from Backlog to Decommission on the ops-eqiad board.
Jun 23 2023, 10:01 PM · SRE, Infrastructure-Foundations, ops-eqiad, decommission-hardware, collaboration-services
wiki_willy assigned T340077: decommission gerrit1001.wikimedia.org (dcops, netbox) to Jclark-ctr.
Jun 23 2023, 10:01 PM · SRE, Infrastructure-Foundations, ops-eqiad, decommission-hardware, collaboration-services
wiki_willy assigned T339340: hw troubleshooting: CPU machine check failure for parse1002.eqiad.wmnet to Jclark-ctr.
Jun 23 2023, 10:00 PM · serviceops, SRE, ops-eqiad, DC-Ops

Jun 20 2023

wiki_willy removed a watcher for ops-codfw: Cmjohnson.
Jun 20 2023, 7:31 PM
wiki_willy removed a member for ops-codfw: Cmjohnson.
Jun 20 2023, 7:31 PM
wiki_willy added a member for ops-codfw: Jhancock.wm.
Jun 20 2023, 7:30 PM
wiki_willy added a subtask for T329219: Main Tracking Task for ESAMS Migration to KNAMS: Unknown Object (Task).
Jun 20 2023, 4:57 PM · Patch-For-Review, SRE, ops-esams, DC-Ops
wiki_willy updated the task description for T329219: Main Tracking Task for ESAMS Migration to KNAMS.
Jun 20 2023, 4:57 PM · Patch-For-Review, SRE, ops-esams, DC-Ops

Jun 16 2023

wiki_willy updated the task description for T329219: Main Tracking Task for ESAMS Migration to KNAMS.
Jun 16 2023, 10:35 PM · Patch-For-Review, SRE, ops-esams, DC-Ops
wiki_willy updated the task description for T329219: Main Tracking Task for ESAMS Migration to KNAMS.
Jun 16 2023, 10:28 PM · Patch-For-Review, SRE, ops-esams, DC-Ops

Jun 15 2023

wiki_willy assigned T338326: Relabel: puppetserver1005 to puppetserver1001 to Jclark-ctr.
Jun 15 2023, 11:01 PM · SRE, DC-Ops, ops-eqiad
wiki_willy assigned T339100: decommission ms-be104[0-3].eqiad.wmnet to Jclark-ctr.
Jun 15 2023, 11:00 PM · SRE, SRE-swift-storage, DC-Ops, ops-eqiad, decommission-hardware

Jun 8 2023

wiki_willy raised the priority of T326684: Q4:rack/setup/install backup1010, backup1011 from Medium to High.
Jun 8 2023, 6:11 PM · bacula, Data-Persistence-Backup, Data-Persistence, SRE, ops-eqiad, DC-Ops
wiki_willy updated the task description for T329219: Main Tracking Task for ESAMS Migration to KNAMS.
Jun 8 2023, 3:55 PM · Patch-For-Review, SRE, ops-esams, DC-Ops
wiki_willy updated the task description for T329219: Main Tracking Task for ESAMS Migration to KNAMS.
Jun 8 2023, 7:30 AM · Patch-For-Review, SRE, ops-esams, DC-Ops

Jun 6 2023

wiki_willy assigned T338236: PowerSupplyFailure to Jclark-ctr.
Jun 6 2023, 4:57 PM · ops-eqiad
wiki_willy updated subscribers of T329219: Main Tracking Task for ESAMS Migration to KNAMS.
Jun 6 2023, 4:55 PM · Patch-For-Review, SRE, ops-esams, DC-Ops

Jun 1 2023

wiki_willy updated the task description for T329219: Main Tracking Task for ESAMS Migration to KNAMS.
Jun 1 2023, 10:05 PM · Patch-For-Review, SRE, ops-esams, DC-Ops
wiki_willy added a subtask for T329219: Main Tracking Task for ESAMS Migration to KNAMS: Unknown Object (Task).
Jun 1 2023, 10:04 PM · Patch-For-Review, SRE, ops-esams, DC-Ops

May 31 2023

wiki_willy assigned T337705: Inbound interface errors to Jhancock.wm.
May 31 2023, 5:33 PM · ops-codfw
wiki_willy removed a watcher for ops-eqiad: Cmjohnson.
May 31 2023, 3:09 PM
wiki_willy removed a member for ops-eqiad: Cmjohnson.
May 31 2023, 3:09 PM

May 26 2023

wiki_willy assigned T337451: Inbound interface errors to Jclark-ctr.
May 26 2023, 5:19 PM · ops-eqiad

May 25 2023

wiki_willy assigned T337276: Inbound interface errors to Jhancock.wm.
May 25 2023, 8:05 PM · ops-codfw
wiki_willy assigned T337247: ManagementSSHDown to Jhancock.wm.
May 25 2023, 8:05 PM · ops-codfw
wiki_willy assigned T337445: db2110 crashed to Jhancock.wm.

Hi @Marostegui - Papaul is on paternity leave for another week, so I'm going to pass this over to @Jhancock.wm to check out. The server is about 4yrs old, so it's out of warranty, but there might be parts that could be pulled from a decommissioned server if we're able to isolate the issue. Thanks, Willy

May 25 2023, 4:51 PM · SRE, ops-codfw, DBA

May 18 2023

wiki_willy assigned T336949: PowerSupplyFailure to Jhancock.wm.
May 18 2023, 11:33 PM · ops-codfw

May 17 2023

wiki_willy reassigned T336332: decommission db1112.eqiad.wmnet from wiki_willy to Jclark-ctr.
May 17 2023, 9:07 PM · SRE, ops-eqiad, decommission-hardware
wiki_willy added a comment to T336826: Degraded RAID on analytics1068.

Thanks @Jclark-ctr. Feel free to pull the drives from a server that's already been decommissoned.

May 17 2023, 2:57 PM · Data-Platform-SRE, SRE, ops-eqiad

May 16 2023

wiki_willy assigned T335588: Decommission prometheus6001 to RobH.
May 16 2023, 9:06 PM · ops-drmrs, DC-Ops, SRE Observability (FY2022/2023-Q4), decommission-hardware
wiki_willy added a comment to T335587: Decommission prometheus5001.

@RobH - this might be something we could add to the recycle pickup

May 16 2023, 9:05 PM · SRE, DC-Ops, ops-eqsin, SRE Observability (FY2022/2023-Q4), decommission-hardware
wiki_willy assigned T335587: Decommission prometheus5001 to RobH.
May 16 2023, 9:04 PM · SRE, DC-Ops, ops-eqsin, SRE Observability (FY2022/2023-Q4), decommission-hardware
wiki_willy assigned T335585: Decommission prometheus4001 to RobH.
May 16 2023, 9:04 PM · SRE, ops-ulsfo, DC-Ops, SRE Observability (FY2022/2023-Q4), decommission-hardware
wiki_willy assigned T336538: PowerSupplyFailure to Jhancock.wm.
May 16 2023, 9:03 PM · ops-codfw
wiki_willy assigned T336720: Inbound interface errors to Jhancock.wm.
May 16 2023, 9:02 PM · ops-codfw
wiki_willy updated subscribers of T336623: Netbox device's platform field inconsistency.

Agreed, I don't think there's any need to continue using "platform" in Netbox, especially since more than half the devices don't have it currently filled out. @Papaul, @RobH, @Jclark-ctr, @Jhancock.wm - feel free to chime in if you have any other thoughts.

May 16 2023, 4:30 PM · Infrastructure-Foundations, DC-Ops, netbox

May 15 2023

wiki_willy reassigned T334910: decommission db1123.eqiad.wmnet from wiki_willy to Jclark-ctr.
May 15 2023, 4:43 PM · SRE, ops-eqiad, decommission-hardware

May 11 2023

wiki_willy reassigned T335011: decommission db1110.eqiad.wmnet from wiki_willy to Jclark-ctr.
May 11 2023, 4:05 PM · SRE, ops-eqiad, DBA, decommission-hardware

May 10 2023

wiki_willy assigned T336326: db1225 crashed (CPU 1 machine check error detected) to Jclark-ctr.
May 10 2023, 5:36 PM · SRE, DC-Ops, ops-eqiad, Data-Persistence-Backup, DBA
wiki_willy reassigned T336029: decommission db1113.eqiad.wmnet from wiki_willy to Jclark-ctr.
May 10 2023, 3:10 PM · SRE, ops-eqiad, decommission-hardware

May 3 2023

wiki_willy reassigned T335836: decommission db1111.eqiad.wmnet from wiki_willy to Jclark-ctr.
May 3 2023, 2:59 PM · SRE, ops-eqiad, decommission-hardware

May 2 2023

wiki_willy assigned T335722: Inbound interface errors to Papaul.
May 2 2023, 8:05 PM · ops-codfw
wiki_willy reassigned T330930: Port with no description on access switch from Cmjohnson to Jclark-ctr.
May 2 2023, 7:24 PM · ops-eqiad
wiki_willy assigned T289882: Q1:(Need By: TBD) rack/setup/install cloudswift100[12] to Papaul.
May 2 2023, 7:21 PM · SRE, Infrastructure-Foundations, ops-eqiad, netops, cloud-services-team (Hardware), DC-Ops
wiki_willy added a comment to T324998: Q3:rack/setup/install cloudcephosd10(3[5-9]|40).

@Jclark-ctr - can you take a peak at this one to see if it's pending on anything from our side? Thanks, Willy

May 2 2023, 7:20 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
wiki_willy reassigned T324998: Q3:rack/setup/install cloudcephosd10(3[5-9]|40) from Cmjohnson to Jclark-ctr.
May 2 2023, 7:20 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops

Apr 28 2023

wiki_willy updated subscribers of T333007: validate what we need from the check_eth check.
Apr 28 2023, 11:24 PM · SRE Observability (FY2023/2024-Q2), Patch-For-Review, Infrastructure-Foundations, netbox, DC-Ops, Observability-Alerting