Page MenuHomePhabricator

fnegri (Francesco Negri)
Site Reliability Engineer

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Jul 18 2022, 2:39 PM (100 w, 4 d)
Availability
Available
IRC Nick
dhinus
LDAP User
FNegri
MediaWiki User
FNegri-WMF [ Global Accounts ]

Recent Activity

Today

fnegri added a comment to T344599: wikireplicas root access.

replication password is shared between clouddb and production hosts

This is not a super big deal, you cannot really do much with it.

Fri, Jun 21, 10:46 AM · Data-Services, cloud-services-team, Infrastructure Security
fnegri updated the task description for T368136: [wikireplicas] Make sure there is no sensitive data in clouddb hosts.
Fri, Jun 21, 10:27 AM · SRE, Data-Services, cloud-services-team
fnegri created T368136: [wikireplicas] Make sure there is no sensitive data in clouddb hosts.
Fri, Jun 21, 10:26 AM · SRE, Data-Services, cloud-services-team

Yesterday

fnegri moved T367464: [bug] Quarry queries not completing from Backlog to Bugs on the Quarry board.
Thu, Jun 20, 5:06 PM · Quarry
fnegri added a comment to T367464: [bug] Quarry queries not completing .

@Liz I'm sorry that you're still having issues, I suspect that sometimes your queries take a bit longer to complete, and when that happens you run into the ConnectionResetError described above.

Thu, Jun 20, 5:05 PM · Quarry
fnegri closed T367778: [wikireplicas] frequent replag spikes in clouddb1017 (s1) as Resolved.

it was agreed that it was a best effort and it was never guaranteed the hosts would have 0 lag.

Thu, Jun 20, 4:56 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri updated subscribers of T344599: wikireplicas root access.

I think that members of wmcs-roots can now circumvent this by using the cloudcumin hosts, and run a command as root through Cumin.

Thu, Jun 20, 2:41 PM · Data-Services, cloud-services-team, Infrastructure Security
fnegri created T368050: [wikireplicas] Automated tests for views.
Thu, Jun 20, 1:51 PM · Data-Services
fnegri added a comment to T300427: Automate maintain-views replica depooling.

@taavi I was wondering what's the status of this task. I see you pushed a few patches to maintain-views in February, what's left?

Thu, Jun 20, 9:02 AM · Data-Platform-SRE, cloud-services-team, Data-Services

Wed, Jun 19

fnegri added a comment to T367464: [bug] Quarry queries not completing .

But hey, a few of my queries just issued reports! I don't know what happened since I posted this message but something has changed for the better.

Wed, Jun 19, 3:23 PM · Quarry
fnegri added a comment to T367778: [wikireplicas] frequent replag spikes in clouddb1017 (s1).

Query plans on clouddb1013:

Wed, Jun 19, 10:06 AM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri added a comment to T367778: [wikireplicas] frequent replag spikes in clouddb1017 (s1).

To verify if my theory is correct, I repooled clouddb1017, let's see if the lag starts increasing again.

Wed, Jun 19, 10:02 AM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri added a comment to T367778: [wikireplicas] frequent replag spikes in clouddb1017 (s1).

As suggested by @taavi I tried depooling s1 on clouddb1017, so that all s1 wikireplica traffic will go to the other host (clouddb1013).

Wed, Jun 19, 9:45 AM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)

Tue, Jun 18

fnegri awarded T353891: https://lists.wikimedia.org is often slow to load a Love token.
Tue, Jun 18, 10:49 PM · Upstream, SRE, Performance Issue, Wikimedia-Mailing-lists
fnegri added a comment to T367464: [bug] Quarry queries not completing .

If you look through Execution time column on Recent queries list, it actually seems like that results of virtually any query with execution time longer than ~120s will never make it back

Tue, Jun 18, 5:04 PM · Quarry
fnegri added a comment to T367778: [wikireplicas] frequent replag spikes in clouddb1017 (s1).

Out of the total 170 queries killed, 69 include /* pollcats.rs SLOW_OK */

Tue, Jun 18, 4:47 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri updated subscribers of T367778: [wikireplicas] frequent replag spikes in clouddb1017 (s1).

As suggested by @taavi I tried depooling s1 on clouddb1017, so that all s1 wikireplica traffic will go to the other host (clouddb1013).

Tue, Jun 18, 4:40 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri updated subscribers of T367778: [wikireplicas] frequent replag spikes in clouddb1017 (s1).

The lag grows until about 3 hours, then starts decreasing. This is consistent with wmf-pt-kill that is configured to kill queries taking longer than 3 hours to complete (--busy-time 10800).

Tue, Jun 18, 4:23 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri added a comment to T367499: hw troubleshooting: server fails to reboot for clouddb1018.eqiad.wmnet.

The host is now repooled.

Tue, Jun 18, 3:54 PM · cloud-services-team (Hardware), SRE, ops-eqiad, DC-Ops
fnegri updated the task description for T367778: [wikireplicas] frequent replag spikes in clouddb1017 (s1).
Tue, Jun 18, 3:21 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri awarded T337570: Get GitLab to render `T{\d}+` in MR overviews, comments, etc. as links to Phabricator a Love token.
Tue, Jun 18, 1:12 PM · Phabricator, GitLab (Integrations), User-brennen, Release-Engineering-Team (Priority Backlog 📥)
fnegri added a comment to T367464: [bug] Quarry queries not completing .

@Liz it is getting attention by multiple people, but it's not clear what the problem is. :)

Tue, Jun 18, 8:35 AM · Quarry
fnegri changed the status of T367464: [bug] Quarry queries not completing from Open to In Progress.
Tue, Jun 18, 8:33 AM · Quarry

Mon, Jun 17

fnegri changed the status of T367778: [wikireplicas] frequent replag spikes in clouddb1017 (s1) from Open to In Progress.
Mon, Jun 17, 4:06 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri moved T367778: [wikireplicas] frequent replag spikes in clouddb1017 (s1) from Backlog to Wiki replicas on the Data-Services board.
Mon, Jun 17, 4:05 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri created T367778: [wikireplicas] frequent replag spikes in clouddb1017 (s1).
Mon, Jun 17, 4:05 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri moved T347977: cloudcumin: allow wmcs-admin to run wikireplicas cookbooks and scripts from Backlog to Wiki replicas on the Data-Services board.
Mon, Jun 17, 3:56 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
fnegri updated the task description for T367772: [toolsdb] Clean up users and manage as code.
Mon, Jun 17, 3:19 PM · Data-Services
fnegri created T367772: [toolsdb] Clean up users and manage as code.
Mon, Jun 17, 3:19 PM · Data-Services
fnegri awarded T367725: Get rid of cloud-cumin VMs in cloudinfra project a Like token.
Mon, Jun 17, 10:31 AM · Cloud-VPS
fnegri triaged T367393: Allow Superset to query ToolsDB public databases as Medium priority.
Mon, Jun 17, 10:27 AM · cloud-services-team (FY2023/2024-Q3-Q4), superset.wmcloud.org
fnegri changed the status of T367393: Allow Superset to query ToolsDB public databases, a subtask of T151158: Support queries against Quarry's own database and ToolsDB, from Open to In Progress.
Mon, Jun 17, 10:27 AM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri changed the status of T367393: Allow Superset to query ToolsDB public databases from Open to In Progress.
Mon, Jun 17, 10:27 AM · cloud-services-team (FY2023/2024-Q3-Q4), superset.wmcloud.org
fnegri updated subscribers of T367415: Allow Quarry to query its own database.
Mon, Jun 17, 10:20 AM · Quarry
fnegri added a comment to T351457: [toolsdb] Replication stopped because of invalid event.

This happened again yesterday. Similar to the previous occurrences, START SLAVE; was enough to resume replication.

Mon, Jun 17, 9:28 AM · cloud-services-team (FY2023/2024-Q1-Q2), Data-Services

Fri, Jun 14

fnegri closed T365975: [cloud-vps] migrate DNS zones away from deprecated clouddb-services project as Resolved.
Fri, Jun 14, 11:24 PM · Data-Services
fnegri closed T365975: [cloud-vps] migrate DNS zones away from deprecated clouddb-services project, a subtask of T359810: Are clouddb-wikireplicas-query-1 and the cloudb-services project still useful?, as Resolved.
Fri, Jun 14, 11:23 PM · Cloud-VPS (Debian Buster Deprecation), VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
fnegri added a comment to T365975: [cloud-vps] migrate DNS zones away from deprecated clouddb-services project.

But the extra ones are just standard openstack zones that are not being used.

Fri, Jun 14, 11:17 PM · Data-Services
fnegri added a comment to T365975: [cloud-vps] migrate DNS zones away from deprecated clouddb-services project.

There are actually 7 zones in total in the project:

Fri, Jun 14, 11:11 PM · Data-Services
fnegri renamed T365975: [cloud-vps] migrate DNS zones away from deprecated clouddb-services project from [cloud-vps] Deprecate clouddb-services project to [cloud-vps] migrate DNS zones away from deprecated clouddb-services project.
Fri, Jun 14, 10:07 PM · Data-Services
fnegri closed T367523: Cloud VPS "clouddb-services" project Buster deprecation as Resolved.

The only instance in this project was deleted in T359810.

Fri, Jun 14, 10:07 PM · Cloud-VPS (Debian Buster Deprecation)
fnegri updated the task description for T367523: Cloud VPS "clouddb-services" project Buster deprecation.
Fri, Jun 14, 10:06 PM · Cloud-VPS (Debian Buster Deprecation)
fnegri updated the task description for T367523: Cloud VPS "clouddb-services" project Buster deprecation.
Fri, Jun 14, 10:06 PM · Cloud-VPS (Debian Buster Deprecation)
fnegri added a comment to T365975: [cloud-vps] migrate DNS zones away from deprecated clouddb-services project.

The project was deleted today (T359810) but the DNS names are still listed as belonging to that project:

Fri, Jun 14, 10:04 PM · Data-Services
fnegri added a subtask for T359810: Are clouddb-wikireplicas-query-1 and the cloudb-services project still useful?: T365975: [cloud-vps] migrate DNS zones away from deprecated clouddb-services project.
Fri, Jun 14, 10:04 PM · Cloud-VPS (Debian Buster Deprecation), VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
fnegri added a parent task for T365975: [cloud-vps] migrate DNS zones away from deprecated clouddb-services project: T359810: Are clouddb-wikireplicas-query-1 and the cloudb-services project still useful?.
Fri, Jun 14, 10:04 PM · Data-Services
fnegri added a comment to T359810: Are clouddb-wikireplicas-query-1 and the cloudb-services project still useful?.

I didn't see this task, but I had left a comment at https://wikitech.wikimedia.org/wiki/News/Cloud_VPS_2024_Purge and T365975: [cloud-vps] migrate DNS zones away from deprecated clouddb-services project.

Fri, Jun 14, 9:55 PM · Cloud-VPS (Debian Buster Deprecation), VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
fnegri moved T367499: hw troubleshooting: server fails to reboot for clouddb1018.eqiad.wmnet from Inbox to Hardware on the cloud-services-team board.
Fri, Jun 14, 3:48 PM · cloud-services-team (Hardware), SRE, ops-eqiad, DC-Ops
fnegri added a project to T367499: hw troubleshooting: server fails to reboot for clouddb1018.eqiad.wmnet: cloud-services-team.
Fri, Jun 14, 3:47 PM · cloud-services-team (Hardware), SRE, ops-eqiad, DC-Ops
fnegri created T367499: hw troubleshooting: server fails to reboot for clouddb1018.eqiad.wmnet.
Fri, Jun 14, 12:05 PM · cloud-services-team (Hardware), SRE, ops-eqiad, DC-Ops
fnegri added a comment to T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO).

@Liz are the queries that never finish different from other queries, or are they similar but sometimes they randomly fail? Did similar queries work fine in the past?

Fri, Jun 14, 9:10 AM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry

Thu, Jun 13

fnegri renamed T348407: Allow Quarry to query ToolsDB public databases from Create db user for Quarry with readonly access to public ToolsDB databases to Allow Quarry to query ToolsDB public databases.
Thu, Jun 13, 2:49 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services, Quarry
fnegri added a subtask for T151158: Support queries against Quarry's own database and ToolsDB: T367393: Allow Superset to query ToolsDB public databases.
Thu, Jun 13, 2:44 PM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri added a parent task for T367393: Allow Superset to query ToolsDB public databases: T151158: Support queries against Quarry's own database and ToolsDB.
Thu, Jun 13, 2:44 PM · cloud-services-team (FY2023/2024-Q3-Q4), superset.wmcloud.org
fnegri updated the task description for T151158: Support queries against Quarry's own database and ToolsDB.
Thu, Jun 13, 2:39 PM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri created T367415: Allow Quarry to query its own database.
Thu, Jun 13, 2:36 PM · Quarry
fnegri removed a project from T348407: Allow Quarry to query ToolsDB public databases: Patch-For-Review.
Thu, Jun 13, 12:50 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services, Quarry
fnegri added a comment to T151158: Support queries against Quarry's own database and ToolsDB.

We are finally close to resolving this task! After the work in T348407 I can successfully query ToolsDB from Quarry. Access at the moment is limited to one database for testing (s55771__wsstats_p). As discussed in T348407 I will send an email to cloud-announce to inform everyone that we're opening this type of access to the ToolsDB databases (only for the databases ending with _p).

Thu, Jun 13, 10:34 AM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
KCVelaga_WMF awarded T367393: Allow Superset to query ToolsDB public databases a 100 token.
Thu, Jun 13, 10:27 AM · cloud-services-team (FY2023/2024-Q3-Q4), superset.wmcloud.org
fnegri created T367393: Allow Superset to query ToolsDB public databases.
Thu, Jun 13, 10:26 AM · cloud-services-team (FY2023/2024-Q3-Q4), superset.wmcloud.org
fnegri changed the status of T151158: Support queries against Quarry's own database and ToolsDB from Open to In Progress.
Thu, Jun 13, 10:04 AM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri claimed T151158: Support queries against Quarry's own database and ToolsDB.
Thu, Jun 13, 10:03 AM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri raised the priority of T151158: Support queries against Quarry's own database and ToolsDB from Low to Medium.
Thu, Jun 13, 10:02 AM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri added a comment to T348407: Allow Quarry to query ToolsDB public databases.

After the fixes in T365374 (thanks to @SD0001 for the fix and thanks to @rook for redeploying Quarry) I can now query ToolsDB successfully!

Thu, Jun 13, 10:01 AM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services, Quarry
fnegri moved T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO) from In progress to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Thu, Jun 13, 8:42 AM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri closed T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO), a subtask of T348407: Allow Quarry to query ToolsDB public databases, as Resolved.
Thu, Jun 13, 8:42 AM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services, Quarry
fnegri closed T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO) as Resolved.

The specific error described in this bug report (Access denied for user 'quarry'@'172.16.2.72' (using password: NO)) is no longer happening, so I'm marking this task as Resolved.

Thu, Jun 13, 8:42 AM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry

Wed, Jun 12

fnegri added a comment to T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO).

@rook redeployed Quarry including the latest fixes https://github.com/toolforge/quarry/pull/46 and https://github.com/toolforge/quarry/pull/47.

Wed, Jun 12, 4:38 PM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri added a comment to T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO).

I'm confused because that directory has a checkout of a non-main branch:

Wed, Jun 12, 3:18 PM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri added a comment to T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO).

https://github.com/toolforge/quarry/pull/46 and https://github.com/toolforge/quarry/pull/47 should probably fix the issue, but I'm not sure how to deploy those after they are merged.

Wed, Jun 12, 3:12 PM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri changed the status of T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO) from Open to In Progress.
Wed, Jun 12, 2:46 PM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri moved T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO) from Backlog to In progress on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Wed, Jun 12, 2:46 PM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri moved T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO) from Backlog to Bugs on the Quarry board.
Wed, Jun 12, 2:46 PM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri changed the status of T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO), a subtask of T348407: Allow Quarry to query ToolsDB public databases, from Open to In Progress.
Wed, Jun 12, 2:46 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services, Quarry
fnegri added a comment to T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO).

Ah well, looks like the deployment of https://github.com/toolforge/quarry/pull/40 didn't fully go through. The keys TOOLS_DB_USER and TOOLS_DB_PASSWORD are missing in config.yaml on the pods

Wed, Jun 12, 1:31 PM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry

Tue, Jun 11

fnegri added a comment to T365374: [bug] Access denied for user 'quarry'@'172.16.2.72' (using password: NO).

I suspect they are related yes. Maybe Quarry is trying to connect to another database but using ToolsDB credentials, or viceversa.

Tue, Jun 11, 2:29 PM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
fnegri added a comment to T348407: Allow Quarry to query ToolsDB public databases.

We could also try with Superset in the meantime, maybe that will be easier. I will have a look.

Tue, Jun 11, 1:46 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services, Quarry
fnegri added a comment to T348407: Allow Quarry to query ToolsDB public databases.

@KCVelaga_WMF it's not working but I'm struggling to understand why. I had a quick look into the Quarry source code but I plan on investigating more this week. If anyone has any hints on what could be the problem please let me know.

Tue, Jun 11, 1:43 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services, Quarry

Mon, Jun 10

fnegri moved T361105: create and deploy new Elastic Curator deb package from Backlog to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jun 10, 2:27 PM · Data-Platform-SRE (2024.03.25 - 2024.04.14), cloud-services-team (FY2023/2024-Q3-Q4), Infrastructure-Foundations, SRE-tools, Spicerack
fnegri moved T351450: Migrate Cloud VPS puppet infrastructure to Puppet 7 from Backlog to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jun 10, 2:27 PM · Patch-For-Review, cloud-services-team (FY2023/2024-Q3-Q4), Goal, Puppet (Puppet 7.0), Cloud-VPS
fnegri moved T356287: Upgrade cloud-vps openstack to version 'Bobcat' from Backlog to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jun 10, 2:27 PM · cloud-services-team (FY2023/2024-Q3-Q4), Goal, Cloud-VPS
fnegri moved T335978: openstack: consider removing references to old hardware from the database from Backlog to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jun 10, 2:27 PM · cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
fnegri moved T361647: Remove elasticsearch-curator dependency from Spicerack/Elastic cookbooks from Backlog to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jun 10, 2:27 PM · Data-Platform-SRE (2024.04.15 - 2024.05.05), Patch-For-Review, cloud-services-team (FY2023/2024-Q3-Q4), Infrastructure-Foundations, SRE-tools, Spicerack
fnegri moved T361563: [cloudinfra] puppet CA cert expired from Backlog to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jun 10, 2:27 PM · Cloud-Services-Worktype-Maintenance, Cloud-Services-Origin-Alert, cloud-services-team (FY2023/2024-Q3-Q4), User-dcaro
fnegri moved T362515: [tools,harbor] failing to push images to harbor from Backlog to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jun 10, 2:26 PM · Cloud-Services-Worktype-Unplanned, Cloud-Services-Origin-User, cloud-services-team (FY2023/2024-Q3-Q4), User-dcaro
fnegri moved T363696: [tf-infra-test] Authentication failed from Backlog to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jun 10, 2:26 PM · Cloud-VPS, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri moved T364459: Migrate eqiad1 cloudnets to Neutron OVS agent from In progress to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jun 10, 2:26 PM · cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
fnegri moved T320973: [wmcs][alerting] Allow silencing alerts metricsinfra alerts on alerts.wikimedia.org from In progress to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jun 10, 2:26 PM · cloud-services-team (FY2023/2024-Q3-Q4), Patch-For-Review, User-fgiunchedi, User-dcaro
fnegri moved T345337: spicerack: tox fails to install PyYAML using python 3.11 on bookworm from Blocked to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jun 10, 2:25 PM · cloud-services-team (FY2023/2024-Q3-Q4), Patch-For-Review, Infrastructure-Foundations, SRE-tools, Spicerack
fnegri added a comment to T365164: [wikireplicas] clouddb* free memory decreases over time.

The alerts are visible again. I will restart the services.

Mon, Jun 10, 2:17 PM · Data-Services
fnegri awarded T239378: Disable parent task metadata by default for new sub tasks a Like token.
Mon, Jun 10, 2:13 PM · Patch-For-Review, User-brennen, Release-Engineering-Team, Phabricator, Developer Productivity
fnegri added a comment to T364492: Ownership confusion on cloud-local puppet servers.

Do aliases/bashrc apply when running just sudo git foo (without -i)?

Mon, Jun 10, 2:05 PM · cloud-services-team (FY2023/2024-Q3-Q4), Patch-For-Review, Puppet-Infrastructure
fnegri added a comment to T364492: Ownership confusion on cloud-local puppet servers.

Maybe we could prevent the root user from running git with something like alias git="echo run git with the gitpuppet user"?

Mon, Jun 10, 1:43 PM · cloud-services-team (FY2023/2024-Q3-Q4), Patch-For-Review, Puppet-Infrastructure
fnegri added a comment to T364492: Ownership confusion on cloud-local puppet servers.

Cherry-picking is also working:

Mon, Jun 10, 1:24 PM · cloud-services-team (FY2023/2024-Q3-Q4), Patch-For-Review, Puppet-Infrastructure
fnegri added a comment to T364492: Ownership confusion on cloud-local puppet servers.

The permissions look similar on tools-puppetserver-01:

Mon, Jun 10, 1:16 PM · cloud-services-team (FY2023/2024-Q3-Q4), Patch-For-Review, Puppet-Infrastructure

Tue, Jun 4

fnegri added a comment to T365164: [wikireplicas] clouddb* free memory decreases over time.

I did not restart the services, but the alerts disappeared from alerts.wikimedia.org. I can see they are still in status WARNING in Icinga though, I'm not sure why they are no longer visible in alerts.wikimedia.org.

Tue, Jun 4, 4:43 PM · Data-Services
fnegri added a comment to T318479: Intermittent redis connection timeouts in Toolforge.

I wonder if it could be something that's not related to Redis at all, but instead something else that blocks the application thread for a long time. Just a guess, I might be completely wrong.

Tue, Jun 4, 4:34 PM · Toolforge (Toolforge iteration 11), cloud-services-team (FY2023/2024-Q3-Q4)
fnegri triaged T362390: [docs] update READMEs as Medium priority.
Tue, Jun 4, 2:01 PM · Documentation, Toolforge
fnegri triaged T353762: Python buildpack does not detect requirements from pyproject.toml as Low priority.
Tue, Jun 4, 2:01 PM · Toolforge, Upstream
fnegri triaged T366365: [toolforge] [redis] Improve Puppet config as Medium priority.
Tue, Jun 4, 1:42 PM · Toolforge